On 1/23/08, Carl Beckhorn wrote:
The wikitext dumps are in XML format and can be
parsed pretty easily as
if they were plain text files.
Cool. Looks like the current dump is 3 Gb though, is there a subset available?
Steve
1) Choose the current flavout to avoid having all history.
2) Get the dump of a smaller wikipedia.