When featuring the script, don't limit it to just excluding
en.wikipedia.org-- in fact, filter out any mention of "wikipedia". And
yes, it's good that
you're setting it so that it'll alert you instead of immediately removing
the text, since Google can provide false positives.
On 11/23/06, Chris Picone <ccool2ax(a)gmail.com> wrote:
I've been seeing a rise in in-article copyvios. Last night I got one
in [[Content managment system]]. I know that only some paragrahs have
these copyvios, and not entire articles, so complete rewirtes aren't
necessary. Thus, I'm attempting to write a script that
(a) opens tabs with "Special:Random" on them
(b) select the first setence from each paragraph (line break)
(c) Google the sentence
(d) If there are any exact matches not from
en.wikipedia.org, put up a
little message for me to check and remove the copyvio.
(e) repeat.
Problem is, all I know is Applescript. If any of you Perl or
pywikipedia or AWB-types have another way of writing this, can someone
write it so the general community can use it to remove copyvios? (or
is this possible with AWB?)
Chris (Ccool2ax)
_______________________________________________
WikiEN-l mailing list
WikiEN-l(a)Wikipedia.org
To unsubscribe from this mailing list, visit:
http://mail.wikipedia.org/mailman/listinfo/wikien-l