I'm working on the cleanup right now.
I wrote a script for it.
csv files are cleaned up now.
html files still contain private wiki data.
This week new counts have been generated from recent dumps (as often wp:en:
is lagging behind).
Hope to start reports job today to produce clean and up to date html files.
I'll post on wikitech when all if finished.
Some background on what had happened:
Someone created a new wiki and forgot to mark it as confidential.
Wikistats job process all wikis listed in several *.dblist files in
/home/wikipedia/commons
So the new wiki was included automatically.
Hence confidential article titles were browsable through the ZeitGeist
feature.
(Brion, wikimania2007 is in wikipedia dblist, I'd expect it in
special.dblist, anyway all mania dumps are now excluded)
I'll rather not do this cleaning up job a second time, from now on I'll use
copies of dblists and sync updates on those manually every now and then.
Erik Zachte
-----Original Message-----
From: Brion Vibber [mailto:brion@pobox.com]
Sent: Sunday, 19 November 2006 18:43
To: wikipedia-l(a)Wikimedia.org
Subject: Re: [Wikipedia-l] Wikipedia Statistics
Brion Vibber wrote:
Parker Conrad wrote:
> Hi -- there used to be (as of a month or two ago) a very
useful website at
included historical
> Wikipedia growth statistics -- very helpful
for those of us
who are trying
> to study the phenomenal growth of this
community.
Unfortunately, it appears
to have
been taken down, or perhaps moved. Does anyone know where I can
access it?
These pages are down pending removal of some accidentally added private
info.
I'll go ahead and try to fix these tonight; hopefully it won't be that
hard...
-- brion vibber (brion @
pobox.com)
_______________________________________________
Wikipedia-l mailing list
Wikipedia-l(a)Wikimedia.org
http://mail.wikipedia.org/mailman/listinfo/wikipedia-l