[Mediawiki-l] Help Reqd: Extract Portal:ComputerScience from en wikipedia dump
Pavan
yara.pavankumar at gmail.com
Tue Sep 12 04:23:56 UTC 2006
Hi All,
I want to build a mediawiki site(version:1.6.8) with all computer
science related articles from wikipedia dump.
I downloaded pages_articles.xml.bz2 sql dump from http://download.wikipedia.org.
I looked at both mediawiki and wikipedia documentation for doing this.
What i could get from the wikipedia download documentation is that
extraction of a particular namespace is possible by perl script.
How do i extract only Computer Science Portal related articles?
Any help and pointers in this is very much appreciated.
--
Thanks and Best Regards,
Pavan
More information about the MediaWiki-l
mailing list