Emmanuel Engelhart wrote:
Is there a chance to get a dump of all pictures
(or/and thumbnailed
versions) used in the french wikipedia if :
- I give a list of pictures
- I give the script to get this list of pictures
- I give the script to store the files in a directory tree.
I currently work to propose a offline-reader prototype (with the french
wikipedia content) working without (mysql, apache, mediawiki) , and the lack
of pictures is almost the last problem I can't solve.
Sure, we could do a temporary one for you until a more permanent solution is in
place.
Note that you can put together a list of used local and commons images
relatively easily:
1) Grab 'image', 'imagelinks' tables for frwiki and 'image' table
for
commonswiki from
download.wikimedia.org
2) The set of unique il_to values in imagelinks gives you the list of all base
filenames.
3) Intersect the list with the img_name values in frwiki's image table. Those
which match have files in wikipedia/fr.
4) Take the leftovers and intersect them with commonswiki's image table. Those
which match have files in wikipedia/commons.
5) Throw away any left; those are references to nonexistent files.
These lists will skip any unused image files.
-- brion vibber (brion @
pobox.com)