Well, the method seems interesting, now I would be interested to see
some concrete translations, if you have some links.
What do you call small languages?
Le 15/09/2017 à 18:14, C. Scott Ananian a écrit :
We're tracking source/destination pairs generated
by the
ContentTranslation tool, right? Could someone point me to that
dataset? (I'm playing around with some machine translation stuff to
see if i can prototype a suggester tool that would translate edits on
wiki A to corresponding edits on wiki B.)
--scott
PS. There's some cool work being done on "zero-shot translation"; aka
bootstrapping translation tools for small languages by pre-training
them on a related language (or even an unrelated language).
Apparently that works! (Cf
https://arxiv.org/pdf/1611.04558.pdf) It
can greatly reduce the amount of data required to build a translation
model for the small language.
Is there a candidate "small wiki" that's been wanting to use
ContentTranslation which would be a good candidate for experimentation?
--
(
http://cscott.net)
_______________________________________________
Mediawiki-i18n mailing list
Mediawiki-i18n(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-i18n