Hoi,
Is this dump going to be cleaned up? Will the next dump be good? Why did
this go wrong?
Thanks,
GerardM
On 21 October 2014 17:02, Lukas Benedix <lukas.benedix(a)fu-berlin.de> wrote:
Different keys can still be found in the actual xml
dump
wikidatawiki-20141009-pages-articles.xml.bz2.
This bug/feature
is also present in the current dump with history.
page_id wd_id keys
111 Q15 ['aliases', 'claims', 'descriptions',
'id', 'labels',
'sitelinks', 'type']
137 Q24 ['aliases', 'claims', 'description',
'entity',
'label', 'links']
31500 Q28119 ['aliases', 'description', 'entity',
'label', 'links']
225144 ? ['entity', 'redirect']
3916689 P6 ['aliases', 'claims', 'datatype',
'descriptions',
'id', 'labels', 'type']
3916937 P10 ['aliases', 'claims', 'datatype',
'description',
'entity', 'label']
Lukas
Am Do 09.10.2014 19:32, schrieb Lydia Pintscher:
On Thu, Oct 9, 2014 at 3:19 PM, Magnus Manske
<magnusmanske(a)googlemail.com> wrote:
> I managed to do the task at hand by switching to JSON dumps (because
that's
> the new, officially supported,
long-term-stable Wikidata dump format,
right?
> Right???), so no hurry there.
>
> Maybe the XML dump process was run in the middle of the switch to the
new
format,
or got a stale cache for some items?
It looks like the switch happened in the middle of a dump creation so
this one is half old and half new format mixed. The ones after that
should be all new format. And yay for switching to JSON!
Cheers
Lydia
_______________________________________________
Wikidata-l mailing list
Wikidata-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l