[Mediawiki-l] Help Reqd: Extract Portal:ComputerScience from en wikipedia dump

Pavan yara.pavankumar at gmail.com
Tue Sep 12 04:23:56 UTC 2006


Hi All,

I want to build a mediawiki site(version:1.6.8) with all computer
science related articles from wikipedia dump.
I downloaded pages_articles.xml.bz2 sql dump from http://download.wikipedia.org.

I looked at both mediawiki and wikipedia documentation for doing this.
What i could get from the wikipedia download documentation is that
extraction of a particular namespace is possible by perl script.

How do i extract only Computer Science Portal related articles?
Any help and pointers in this is very much appreciated.
-- 
Thanks and Best Regards,
Pavan



More information about the MediaWiki-l mailing list