Hello,
I totally rewrote the compiler to works with the XML format of
download.wikipedia.org. You will need to download two files from
download.wikipedia.org to use it: xxx-all-titles-in-ns0 and
xxx-pages-articles.xml.
After giving this two files as arguments of Analyzer, you will have the
compiled output for query tools.
With this modification, you will be able to use wikipedia-suggest on any
project using MediaWiki. I downloaded the last dump and updated my
website for the ten most popular languages of Wikipedia.
I released these modifications as wikipedia-suggest-0.4, here is the
change log :
== Version 0.4, released 15-Aug-2006 ==
* Rewrote Analyzer (trie compiler) in order to read files from
download.wikipedia.org instead of home made xml format.
You can now easily use wikipedia-suggest for project that use MediaWiki.
* Replaced libicu by glib to improve performance (glib is really faster for
utf-8 case conversions).
* Included Last contribution of Nick Jenckins :
http://files.nickj.org/MediaWiki/wikipedia-suggest-0.31-diff.txt
* Replaced escape by encodeURI because escape use %uXXX synthax which is
not understood by MediaWiki (for exemple %u044 and MediaWiki wait %C5%85)
I hope you will find these modifications useful.
Best Regards.
Julien Lemoine