[Wikipedia-l] Analyzing and Visualizing the Semantic Coverage of Wikipedia

Frederick "FN" Noronha fredericknoronha at gmail.com
Fri Sep 1 18:06:43 UTC 2006


http://mangalorean.com/news.php?newstype=local&newsid=32933
Wikipedia for more Indian languages on its website

By Frederick Noronha/IANS

Bangalore, Sep 1 (IANS) More and more Indians can contribute to the
volunteer-edited Wikipedia encyclopedia project, now rated among the
top 20 websites globally.

So says Jimmy "Jimbo" Wales, founder of the Web-based free content
multilingual project.

During a visit to India, Wales noted that volunteer contributions to
the Kannada Wikipedia had been growing 22 percent and Bengali 35
percent a month.

"These growth rates are fairly high. Of course, they're growing from a
small base. But Kannada already has over 5,000 articles and is still
growing. That's really exciting. Bengali too has a growth rate of 35
percent," he said.

"It's not as bad as it was a year ago. We had almost nothing then.
Now, languages like Bengali, Kannada, Marathi are in the 3,000-5,000
article range. Hindi, Assamese over 1,000. But Hindi, a very large
language, has only 1,500 entries. That's a little surprising," Wales
told IANS.

"We still have an enormous amount of work left to do. India has 23
official languages. English has more than 10,000 articles. We aim to
have 200,000 articles for every language spoken by a million people,"
he said.

Currently, Japanese is the only non-European language among the 'big
10' of Wikipedia.

Farsi, Arabic, Korean, Thai, Chinese and Bhasa Indonesia are among
those with over 10,000 articles. In the same category are Urdu,
Bangla, Hindi, Kannada, Marathi, Tamil and Telugu.

Wales, who took part in a conference on open content in India, said he
had just started a new project called the Wikiversity.

"We want to provide a free encyclopaedia to every single person in the
planet. We also want to provide all the tools to become literate," he
said.

"Our mission reaches far beyond the Internet - even people who don't
have access to electricity. There are over one million articles in
English. And English is less than one-third of the total work of
Wikipedia.

"Unless you have a thousand articles, I don't count it as a fully
active community," he told audiences in India, urging them to do more
in this regard.

With just four full-time employees, this volunteer-driven project has
become the 17th most visited website in the world, and currently has
some 200 servers across the globe.

Wales said Wikipedia was drawing more hits than the "BBC and the CNN
combined". He cited tests that show Wikipedia had only four mistakes
per article as against the Britannica's three.

"People should have this idea that Wikipedia is pretty good and that
it's getting better all the time. I think what was surprising for most
people was not that Wikipedia had four errors per article but that
Britannica had three (per article)!" he added.

"The focus of my effort is to see how to get the initial communities
going," he said, explaining that he planned to test-hire his fifth
employee in India to encourage content to be built up.

Wales referred to the Wikipedia on a CD or on mobile phones.

Webaroo.com, a project founded by IITian Rakesh Mathur that offers
software for users to download highly compressed web content, has also
offered portions of the Wikipedia through this format.

In keeping with its free nature, anyone could copy Wikipedia content,
print it and even sell it, Wales said.

Indian techies who interacted with him suggested the time was "right"
for Indian language content creation, since the tools for doing this
had been created, including projects such as IndLinux.

Wikipedia is run on a website that allows any visitor to edit its
content. It is written collaboratively by volunteers, allowing most
articles to be changed by almost anyone with access to the website.
Wikipedia's main servers are in Tampa, Florida, with additional
servers in Amsterdam and Seoul.

Wikipedia started as an English language project Jan 15, 2001, as a
complement to the now defunct Nupedia.

As of August 2006, Wikipedia has over five million articles in many
languages, including more than 1.3 million in the English-language
version. There are 229 language editions of Wikipedia, 16 of which
have over 50,000 articles each.

There has been controversy over Wikipedia's reliability and accuracy,
with the site receiving criticism for its susceptibility to vandalism,
uneven quality and inconsistency, systemic bias, and preference for
consensus over credentials.

IANS
-
-------------------------------------------------------------------------------
Frederick Noronha http://fn.goa-india.org  9822122436 +91-832-240-9490
http://fredericknoronha.wordpress.com fredericknoronha at gmail.com



More information about the Wikipedia-l mailing list