Hello,
Apologies for this being a repeat; I was just informed that the original ended
up being read as part of an existing thread.
I'd like to make a request to obtain access to anonymized apache logs for
wikipedia user data.
I am creating a browsing interface for wikipedia that requires clustered user
data
(in that sense it is akin to finding articles using the amazon recommendation
system or the earlier movielens recommendation system).
For this I need access to user page requests over time- preferably stored in a
database. I can provide a script that will translate users' ip addresses to a
unique signature so that the users themselves remain anonymous, stuff the data
into a reasonably size efficient mysql table, etc.
I was told that I might need to talk to Kate about the feasibility of doing
this. Are there any existing objections to retaining anonymized apache log data
for research purposes?
Tony Pryor