Hello, you can take a look to project templatetiger (from Stefan Kühn
and me) that extract all templates from some wikis by parsing the dumps.
Example Template:Information from Commons:
Disadvantage: The update-intervall is only 2-3 months.
With Toolserver-account you would have access to the database of the tool.
With Toolserver-account you could also ask the commons-DB to give you
e.g. all images with Template:PD-old back.
I'm waiting also for years that the Foundation solve this problem of
structual data.
Some activities from Wikimedia seems to be in
but I'm not sure if they will come to a result.
Greetings Kolossos
Am 19.08.2011 22:58, schrieb Primavera De Filippi:
Hi all,
I write to you on behalf of the public domain working grouf of the
Open Knowledge Foundation. We are currently developing an automated
system to identify the legal status of different types of works (i.e.
to determine whether or not they are in public domain). In order to do
this, we need to gather the necessary metadata to determine the legal
status of these works. This includes information such as title,
author, date of publication, etc.
You can find more information about the project on our site
http://publicdomain.okfn.org/calculators.
A preliminary implementation of the project can be seen at
www.publicdomainworks.net (site still under development).
Incorporating the metadata from the Wikimedia Commons archive into our
database would be extremely useful both for us, since it would greatly
increase the quality of our results.
eg. in the case of
http://commons.wikimedia.org/wiki/File:Cyphoma_signatum_(Fingerprint_Cowry_…
- we would like to retrieve the information from the Summary section
If I understood correctly, the metadata regarding the works of the
archive is primarily text/html based.
Hence, I would like to know (a) whether there exists a database where
this metadata can be retrieved, or alternatively (b) whether would you
be interested in switching to a more structured database contained all
the relevant metadata about those works?
Looking forward to your answer,
Primavera