Lars,
I am not sure we have at the data you are looking for, the data we get from
searches is only available for 60 days or less while it gets processed and
deleted after that. Agreggated pageview data is kept long term, search data
is not.
So, what would be the process to request access to the
raw data and what would
be the conditions for such access?
Access to raw data is normally restricted to research projects. You can
perhaps do a request for a 1 time query but, as I was saying, the data you
are looking for is not available long term.
You can read about data access here:
https://meta.wikimedia.org/wiki/Research:FAQ
Thanks,
Nuria
On Sun, Sep 4, 2016 at 4:38 AM, Lars Noodén <lars.nooden(a)gmail.com> wrote:
Thanks, Dan and Nuria, for the responses.
I see that the 'webrequest' table [1] with the current schema would have
the field with raw header containing a superset of the data I am looking
for with regard to the Wikibook:
referer string Referer header of request
but I don't think I would be able to propose a generic database query
that would produce sufficiently sanitized data. At this point, I'm
looking for only the search strings.
I'm also not sure of the contents of uri_path or uri_query to know which
one would restrict the search to specific Wikibooks.
So, what would be the process to request access to the raw data and what
would be the conditions for such access? If I were to pursue that, as
far as a general interest research project goes, the referred search
terms could be grouped by Featured Book (plus the one, non-featured book
I am aiming for). There are about 200 English language Featured Books
[2] at the moment.
Regards,
Lars
[1]
https://wikitech.wikimedia.org/wiki/Analytics/Data/
Webrequest#Current_Schema
[2]
https://en.wikibooks.org/wiki/Wikibooks:Featured_books#Featured_books
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics