[Xmldatadumps-admin-l] [Xmldatadumps-l] 2010-03-11 01:10:08: enwiki Checksumming pages-meta-history.xml.bz2 :D

Felipe Ortega glimmer_phoenix at yahoo.es
Thu Mar 11 16:38:56 UTC 2010


--- El jue, 11/3/10, Tomasz Finc <tfinc at wikimedia.org> escribió:

> De: Tomasz Finc <tfinc at wikimedia.org>
> Asunto: Re: [Xmldatadumps-l] 2010-03-11 01:10:08: enwiki Checksumming pages-meta-history.xml.bz2 :D
> Para: "Wikimedia developers" <wikitech-l at lists.wikimedia.org>, xmldatadumps-admin-l at lists.wikimedia.org, Xmldatadumps-l at lists.wikimedia.org
> Fecha: jueves, 11 de marzo, 2010 09:42
> Tomasz Finc wrote:

> We now have an md5sum for
> enwiki-20100130-pages-meta-history.xml.bz2.
> 
> "65677bc275442c7579857cc26b355ded"
> 
> Please verify against it before filing issues.
> 
> --tomasz
> 

One question, Tomasz: did you use pbzip2 to compress the file? 

If so, then we can decompress the 280GB file with pbzip2 more efficiently (since it compresses the data in individual chunks that can be sent to different cores/CPUs). Otherwise, plain bzip2 is preferred.

Thanks in advance.

Regards,
Felipe.

> 
> _______________________________________________
> Xmldatadumps-l mailing list
> Xmldatadumps-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
> 


      



More information about the Xmldatadumps-admin-l mailing list