On Mon, Nov 17, 2003 at 11:29:23PM +0100, Luc Van Oostenryck wrote:
I don't know how much browser are non-UTF-8
compliant,
but I'm sure there will problems.
My point of view is more like the following:
why should we need UTF-8 on fr when 99.9999% of text are fairly happy with
ISO8859-1 (the charset that most european use and french in particular)
and named or numeric HTML entities are OK for the rest.
Why should we need ISO 8859-N with named or numeric HTML entities
when 100% of text works with UTF-8 ? The number is much smaller than 99.9999%.
It's probably much smaller than 99% too.
I worked with articles about Japanese language when Polish Wikipedia
used ISO 8859-2 and it wasn't fun. I wasted lot of time trying to locate
which &-entity corresponds to which character to fix a typo or make some
other change. In the end I installed local mirror Wikipedia with UTF-8,
and make a bunch of Perlscripts that converted from ISO 8859-2 + &codes;
to readable representation.