Re: [Wikitech-l] Ultimate Wiktionary and design decisions

27 Jul 2005

On Monday 25 July 2005 11:28, Gerard Meijssen wrote:
...
  Nikola Smolenski wrote:
 On Monday 25 July 2005 09:02, Andrew Dunbar
wrote:
 On 7/24/05, Gerard Meijssen
&lt;gerard.meijssen(a)gmail.com&gt; wrote:
accent according to the offical orthographical rules, Russian (and some
other Cyrllic script languages) can optionally indicate where the stress
is and in some contexts it is the norm. With Hebrew and most 
Yes, and one of the places where it is the norm is in a dictionary ;)

Regardless of how is this resolved in the end, it would make sense to
 built in at least some ability of determining such things automatically.
 You don't want to duplicate entire Russian corpus (with inflections, it
 could easily rise to ten million words), so that you could have each one
 of them with and without diacritics. It makes sense to have only
 canonical spellings in the dictionary, and a bit of code to offer nearest
 match when someone tries to retrieve a word spelled in a different way. 
 As a matter a fact I do want all inflections even if they are ten
 million words. Now I do not expect to have all these inflections to 
Of course, but do you want ten million words with diacritics, and then ten 
million of same words without diacritics?

...
   From a database point of view a Word has one
Spelling. That is given
 the ERD very much technical and non negotialbe. It is the Spelling that
 is validated by a Spelling Authority. 
Simply change relation spelling.spellingID-word.spellingID from 1-inf to 
inf-inf. That way each word could have multiple spellings, each validated by 
a spelling authority.

By the way, what is table "table" (related to "pronunciation") for?

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Ultimate Wiktionary and design decisions