Replace in XML dump was done automatically and I did not want to remove
timestamp from all revisions (even from current ones) because the wiki is
quite large (about 9000 articles). So I made my own tool to touch such
revision timestamps:
https://github.com/Dmitri-Sintsov/MwDumpProcessor/commit/079cc194215632db3e…
It's not a complete solution (no real parser, no support for extra fields,
such as LiquidThread inserts into dump) however enough for ordinary NS_MAIN
pages.
It's strange that dump importer itself does not compare base36sha1, neither
for warning of altered content nor to import manually altered revisions
only.
Dmitriy
On Mon, Aug 11, 2014 at 5:14 AM, Benjamin Lees <emufarmers(a)gmail.com> wrote:
If you simply remove the timestamp from a revision in
a dump, the importer
appears to happily insert it with the current time as the timestamp. This
may also cause cancer, summon Cthulhu, etc.
In addition to pywikibot, there's the Replace Text extension[0], which
ought to be able to handle what you want to do.
[0]
https://www.mediawiki.org/wiki/Extension:Replace_Text
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l