Hi,
I added the support for German Wiktionary, it is available in the newest
version. There is a quick test script that should get you 300k+
translations from the German Wiktionary in less than 15 minutes.
The dictionaries in 50 languages built using wikt2dict and other resources
(parallel and comparable corpora) are available here:
Please let me know if you find parsing errors.
I understand that DBPedia Wiktionary does a lot more than wikt2dict and I
do not plan to compete with that. However, adding 35+ Wiktionaries would
have been near impossible for me. This a quick (and dirty) way to extract
the translations.
Cheers,
Judit
2013/7/12 Judit, Ács <acs.judit(a)sztaki.hu>
Hi All,
I created a tool to extract translations from different editions of
Wiktionary. Right now it supports 39 different Wiktionaries. It only
extracts translations and ignores the rest.
Supported Wiktionaries:
Azerbaijani, Bulgarian, Catalan, Czech, Danish, Greek, English, Esperanto,
Spanish, Estonian, Basque, Finnish, French, Galician, Hebrew, Croatian,
Hungarian, Indonesian, Italian, Georgian, Latin, Lithuanian, Malagasy,
Dutch, Norwegian, Occitan, Polish, Portuguese, Romanian, Russian, Slovak,
Slovenian, Serbian, Swedish, Swahili, Turkish, Ukrainian, Vietnamese and
Chinese.
Adding a new Wiktionary is done via a configuration file.
Right now the beta version is available for download at:
https://github.com/juditacs/wikt2dict
Documentation is in progress, until then the README should be enough to
get started.
Please test it and send me your feedback and bug reports.
Thanks,
Judit Ács