[WikiEN-l] Copyright Violation Bot

Chris Picone ccool2ax at gmail.com
Thu Nov 23 15:55:32 UTC 2006


I've been seeing a rise in in-article copyvios. Last night I got one
in [[Content managment system]]. I know that only some paragrahs have
these copyvios, and not entire articles, so complete rewirtes aren't
necessary. Thus, I'm attempting to write a script that
(a) opens tabs with "Special:Random" on them
(b) select the first setence from each paragraph (line break)
(c) Google the sentence
(d) If there are any exact matches not from en.wikipedia.org, put up a
little message for me to check and remove the copyvio.
(e) repeat.

Problem is, all I know is Applescript. If any of you Perl or
pywikipedia or AWB-types have another way of writing this, can someone
write it so the general community can use it to remove copyvios? (or
is this possible with AWB?)

Chris (Ccool2ax)



More information about the WikiEN-l mailing list