It's certainly an idea to keep track of what's in Wikidata, and what
types of categories and infoboxes have or have not had information
transferred.
There are some queries to give top-level round numbers for the UK and
Ireland at
https://www.wikidata.org/wiki/Wikidata:WikiProject_UK_and_Ireland#Stats
that could adapt straightforwardly to other countries; plus queries to
examine some of the more obvious gaps and anomalies at
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_UK_and_Ireland#To-d…
Looking through the results of the Autolist queries shows that some
really quite odd subclasses are getting into the trees of these
top-level things -- in particular the subclass tree of "event" looks to
be needing some considerable cleaning up.
I've been meaning to take the counts down a few more levels, to see what
of the immediate sub-classes of the top-level classes seem to be most
populated (and/or most under-populated), but haven't had the moment to
do that yet.
-- J.
On 11/03/2015 14:08, Markus Krötzsch wrote:
Hi Andrew,
This is a great idea! It would help data consumers to know what to
expect and community members to know what to put in (or where help with
imports would be appreciated). Moreover, the discussion about this list
would be a great way to structure our work in general (have documented
discussions about our goals for certain types of data). I feel that the
bot right approval process is not the best place to decide if we strive
to have all streets or all lighthouses in.
For things that are not complete in Wikidata (yet or ever), it would
further help to provide pointers to other, more complete data sources
(and the properties we might have to link to them).
The question is how to best organise this list. Your initial example
setup already shows that this tends to become very diverse (not to say:
chaotic). One could link this from the related class items (e.g.,
lighthouses or paintings), but having this as another extra load on the
talk page would maybe not so ideal either. After all, this could be one
of the first things that newbies to Wikidata want to get an idea about.
Cheers,
Markus
On 11.03.2015 14:07, Andrew Gray wrote:
...
I wonder if it would be useful to have a
centralised list of "classes
of things in Wikidata". For example:
Things entirely in Wikidata
* MEPs
* County-level administrative divisions of all countries
* All artworks by the following people (list)
* Cultural heritage sites in the following countries (list)
* All people listed in the following biographical databases (list)
* (etc)
Things not yet entirely in Wikidata (but probably will be eventually)
* All national-level elected representatives
* All species
* Lighthouses
* All artworks by the following people (list)
* Cultural heritage sites in the following countries (list)
* All people listed in the following biographical databases (list)
Things which will never be complete in Wikidata
* All local politicians
* Streets worldwide
* All businesses
This would be a very useful adjunct to the notability page, as it
would give concrete examples to work from for the sort of things we
feel are appropriate.
_______________________________________________
Wikidata-l mailing list
Wikidata-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l