"Magnus Manske" wrote
> Sounds like there is an interesting exercise for
a statistician here: by sampling from the larger Wikipedias, estimate the total number of
article topics in them, taken collectively.
Easy:
all articles on en.wikipedia
+ all articles on other wikipedias that do *not* have an interlanguage
link to en.wikipedia (substract doublettes that are connected in an
"interwiki web" not including en.wikipedia; count these as 1)
There, all done :-)
You have a touching faith that interwiki is 100% efficient.
I think you've found a plausible upper bound, though.
Charles
-----------------------------------------
Email sent from
www.virginmedia.com/email
Virus-checked using McAfee(R) Software and scanned for spam