Maarten Dammers wrote:
Op 19-5-2010 13:45, Lars Aronsson schreef:
I tried to parse the year, I was successful for
3.5 million files.
(Maybe I didn't try very hard.)
I guess you used a regex. Which one
exactly? Or did you publish your
code somewhere?
No, I did not publish my code or regex, and I don't intend to.
This was a quick hack, and I know I might have missed lots
of files. For example, just one random image from the huge
Bundesarchiv image donation has a "Date=0-00-00",
http://commons.wikimedia.org/wiki/File:Bundesarchiv_Bild_147-0435,_Wolfgang…
(It's a mystery to me, why this is displayed as "november 1999".)
Then again, another random Bundesarchive image has
"Date=1950-07-05", which should be covered by my hack,
http://commons.wikimedia.org/wiki/File:Bodo_Uhse.jpg
We would have far fewer images from the 1950s if it weren't
for this donation.
I want to encourage others to invent their own regex
and see if they can find other results than mine. My numbers
are posted on the talk page of the graph.
--
Lars Aronsson (lars(a)aronsson.se)
Aronsson Datateknik -
http://aronsson.se