On Nov 26, 2007 12:13 PM, Anthony <wikimail(a)inbox.org> wrote:
On Nov 25, 2007 9:59 PM, Steve Bennett
<stevagewp(a)gmail.com> wrote:
Meanwhile my watchlist is totally useless as
it's full of pages that I
lost interest in over a year ago. There's no way to trim the fat
either - I can either manually remove every single old page, or I can
clear the entire lot. But how do I clear just the pages I haven't
edited in more than six months?
Sounds like a trivial query for anyone with toolserver access to run.
Someone might have already set up a script to output all pages that
haven't been edited in X days. Then download the list at
http://en.wikipedia.org/wiki/Special:Watchlist/raw , run "sort file1
file2 | uniq -u", and import the new list using
http://en.wikipedia.org/wiki/Special:Watchlist/raw
Err, that command was wrong. You want set subtraction, not symmetric
difference. Apparently the correct command is:
$ sort -m file1 file2 | uniq -c | grep "^ *1" | tr -s " " | cut -f3
-d" " > IP1_symmetric_diff_IP2
(
http://accu.org/index.php/journals/1410)
But I don't know enough about tr or cut to say if that's right.