[Wikitech-l] Wikipedia full dump (English) broken link?

27 Nov 2003

Hi,

Here at CU we work with corpora of text to train models that 'understand' 
language (see, e.g., LSA.colorado.edu). We wanted to use Wikipedia to 
create a copyright-free corpus of text that anyone in the scientific 
community could use. To do that we downloaded the DB dumps a while ago 
( about 2 billion words), but due to a computer problem, we lost them.

I have noticed  that the link to the full english database (2280MB):
http://download.wikipedia.org/archives/en/20031125_old_table.sql.bz2

doesn't work anymore; it returns a Forbidden error, says that
you don't have permission to access 
/archives/en/20031125_old_table.sql.bz2 on this server

Could you please grant us access to the file?

Thanks a lot in advance,
-Jose

-- 
Jose Quesada, PhD.

quesadaj(a)psych.colorado.edu             Research associate
http://lsa.colorado.edu/~quesadaj       Institute of Cognitive Science
					University of Colorado (Boulder)

Muenzinger psychology building          Phone:303 492 1522
office D447A						Fax:  303 492 7177
Campus Box 344
University of Colorado at Boulder
Boulder, CO 80309-0344

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

[Wikitech-l] Wikipedia full dump (English) broken link?