Hi, everyone.

Can someone explain what procedure you use to add (some) images to the dump before packaging a ZIM file?

I am preparing a fresh Hebrew Wikipedia ZIM file, and would like to test the integration of images as well as recent improvements to Kiwix.

So far, I found Wikix, and no specific instructions on including images in Emmanuel's ZIM-building script, so I'm guessing that if images are downloaded and integrated into the local Wikipedia server it's enough?

My questions are:

1. How do you know which images are referenced by the local Wikipedia?  I see Wikix extracts this information into a bunch of shell script files, but maybe there's another/better way?  What do you use?

2. Given a list of images, what is the best way to retrieve them without pounding the Wikimedia servers?  Is there an accepted way?  Should I coordinate it with anyone?  The shell scripts generated by Wikix don't seem to make any provision for delays or anything, and I'm afraid running them would get me banned.  Again, what do you use?

3. What if we want only the thumbnail/low-res version incorporated in the articles themselves, and not the full resolution version from commons etc.?

4. Once you have a local tree of the image files (in directories 0, 1, 2,..., f), what else do you need to do to get Emmanuel's buildZim....pl script to include them in the ZIM file?

Many thanks in advance,

  Asaf Bartov
  Wikimedia Israel

--
--
Asaf Bartov <asaf@forum2.org>