* Khalida BEN SIDI AHMED wrote:
I need an Html dump of Wikipedia because I have written
a java code which
extract text from an html content and I would like to apply it on this
dump. In fact I need to extract the first sentence of a list of articles
(<200) and I don' know how to do it on other dumps. If you have any idea of
other solutions, I will be pleased if you share them with me.
If you just need a few articles, you can simply use the online version.
There are any number of tools that would help you to batch the requests
without hitting the server too much, `wget` and `curl` are popular ones.
--
Björn Höhrmann · mailto:bjoern@hoehrmann.de ·
http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 ·
http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 ·
http://www.websitedev.de/