Jeremy Dunck wrote:
On 10/17/05, Jakob Voss <jakob.voss(a)nichtich.de>
wrote:
Hi!
Once there was the size of all Wikipedia database dumps at
download.wikipedia.org.
I was just pondering this yesterday. Samuel is master with 6x73 GB=
438 GB. Of course, that's not in dump form.
Jakob, this ties in with the earlier request for wikipedia-by-mail. I
was thinking of doing a
Fundable.org drive for an array so that I
could serve those requests, but perhaps using the Tool server makes
more sense...
Samuel's InnoDB data files are about 290 GB, but it's likely most of
that is free space. There's also about 100 GB distributed across our
external storage DBs; the hypothesized free space in samuel is because
we moved a lot of the text out of it and into external storage. It's all
compressed with gzip.
The current total size of all the pages_full.xml.bz2 files from the
latest dump is 14 GB. In total, the wikipedia directory on the download
server is using 236 GB, thanks mostly to image tarballs and poorly
compressed copies of the text.
-- Tim Starling