[WikiEN-l] Copyright Violation Bot

James Hare messedrocker at gmail.com
Thu Nov 23 15:59:21 UTC 2006


When featuring the script, don't limit it to just excluding
en.wikipedia.org-- in fact, filter out any mention of "wikipedia". And
yes, it's good that
you're setting it so that it'll alert you instead of immediately removing
the text, since Google can provide false positives.

On 11/23/06, Chris Picone <ccool2ax at gmail.com> wrote:
>
> I've been seeing a rise in in-article copyvios. Last night I got one
> in [[Content managment system]]. I know that only some paragrahs have
> these copyvios, and not entire articles, so complete rewirtes aren't
> necessary. Thus, I'm attempting to write a script that
> (a) opens tabs with "Special:Random" on them
> (b) select the first setence from each paragraph (line break)
> (c) Google the sentence
> (d) If there are any exact matches not from en.wikipedia.org, put up a
> little message for me to check and remove the copyvio.
> (e) repeat.
>
> Problem is, all I know is Applescript. If any of you Perl or
> pywikipedia or AWB-types have another way of writing this, can someone
> write it so the general community can use it to remove copyvios? (or
> is this possible with AWB?)
>
> Chris (Ccool2ax)
> _______________________________________________
> WikiEN-l mailing list
> WikiEN-l at Wikipedia.org
> To unsubscribe from this mailing list, visit:
> http://mail.wikipedia.org/mailman/listinfo/wikien-l
>



More information about the WikiEN-l mailing list