Thanks a lot, Tim. I am glad to hear the answer so soon.
Best regards,
Alexander Prudnikov.
> Can you explain me in a few words how Wiki engine
performs full-text search in UTF-8
> encoded articles?
>
TS> The handling depends on the language. The basic UTF-8 handling is to
TS> convert to lower case using an internal table, then to encode any
TS> non-ASCII characters as hexadecimal using bin2hex(). The Chinese and
TS> Japanese language files have special routines to insert spaces into
TS> strings, since MySQL uses a word search and those languages don't
TS> usually use spaces.
TS> The relevant functions are doUpdate() in includes/SearchUpdate.php, and
TS> stripForSearch() in languages/LanguageUtf8.php .
TS> -- Tim Starling
TS> _______________________________________________
TS> Wikitech-l mailing list
TS> Wikitech-l(a)Wikipedia.org
TS>
http://mail.wikipedia.org/mailman/listinfo/wikitech-l