Hi all,
the first phase of Wikidata will help to centralize many of the
Wikipedia language links. We did a small analysis to figure out the
possible impact of this step.
Here are a few highlights:
* there are more than 240 million language links in the Wikipedias
* they are responsible for about 5 GB of wikitext
* on average, 33% of a Wikipedia language edition's wikitext is due to
language links - in several Wikipedias, more than 75% of the wikitext
are just language links!
The full data set is available here:
<http://simia.net/languagelinks/>
It would be interesting to also figure out how many of the edits are
due to changes to the language links. If anyone wants to try that...?
Enjoy the data,
Denny
--
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 |
http://wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.