Would there happen to be a dataset of that available
somewhere?
Data is available on public labs replicas but sql is complicated to write
and likely to time out due the volume of data that is combing. Data is also
available on Hadoop Data Lake which is not public yet (it is our plan to
make it so). This data has already been used to gather such a stats. See:
https://phabricator.wikimedia.org/T149021
On Sun, Aug 13, 2017 at 10:10 AM, Morten Wang <nettrom(a)gmail.com> wrote:
Hello everyone,
I'm currently working gathering data for the Autoconfirmed article
creation trial project[1]. One of the measures we're interested in is the
number of new articles, both surviving and deleted, that is created per
day. I know that recent data is logged through EventBus, but if possible
I'd would also like to have historic stats on this (e.g. going back a
handful of years). Would there happen to be a dataset of that available
somewhere?
References:
1:
https://meta.wikimedia.org/wiki/Research:Autoconfirmed_
article_creation_trial
Cheers,
Morten
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics