[Xmldatadumps-admin-l] New enwiki-Dump

Anthony wikimail at inbox.org
Thu Apr 8 21:54:53 UTC 2010


Okay:

a at A-PC:~/data/enwiki/20100130$ curl -r -20000000
http://download.wikimedia.org/enwiki/20100312/enwiki-20100312-pages-meta-history.xml.bz2>temp.bz2
a at A-PC:~/data/enwiki/20100130$ bzip2recover temp.bz2
a at A-PC:~/data/enwiki/20100130$ bunzip2 rec01414temp.bz2
a at A-PC:~/data/enwiki/20100130$ tail rec01414temp
a at A-PC:~/data/enwiki/20100130$ tail -n1 rec01414temp
:::::You might want to check if the other users that you agreed with are
still editing. Some of them tend to come online recently merely to ask for
me to be banned, and the others might have been a at A-PC
:~/data/enwiki/20100130$

It's broken.

On Thu, Apr 8, 2010 at 5:49 PM, Anthony <wikimail at inbox.org> wrote:

> Hmm...
>
> a at A-PC:~/data/enwiki/20100130$ curl -r -20000000
> http://download.wikimedia.org/enwiki/20100312/enwiki-20100312-pages-meta-history.xml.bz2>temp.bz2
> a at A-PC:~/data/enwiki/20100130$ bzip2recover
> temp.bz2                            bzip2recover 1.0.5: extracts blocks from
> damaged .bz2 files.
> bzip2recover: searching for block boundaries ...
> bzip2recover: sorry, I couldn't find any block boundaries.
>
>
> On Thu, Apr 8, 2010 at 5:38 PM, Anthony <wikimail at inbox.org> wrote:
>
>> Anyone know how to download just the tail end (few megabytes) of a file?
>> If so, do that, bzip2recover, and read the tail end to see if it ends with
>> </mediawiki> for a quick check.
>>
>>
>> On Thu, Apr 8, 2010 at 2:50 PM, Andreas Meier <andreasmeier80 at gmx.de>wrote:
>>
>>> Hello,
>>>
>>> a new enwiki-dump is ready, but izs soze is only 178.7 GB, the dump
>>> before had a size of 280.3 GB. Do you have changed the compression or is
>>> the new dump corrupted?
>>>
>>> Best regards
>>>
>>> Andreas
>>>
>>> _______________________________________________
>>> Xmldatadumps-admin-l mailing list
>>> Xmldatadumps-admin-l at lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-admin-l
>>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikimedia.org/pipermail/xmldatadumps-admin-l/attachments/20100408/1846f738/attachment-0001.htm 


More information about the Xmldatadumps-admin-l mailing list