Lars Aronsson wrote:
Gerard Meijssen wrote:
As the Dutch spelling will change in August 2006,
we added
things like spelling authorities.
Great, then you can introduce a new authority for every century of
a language, so the 19th century Swedish can be told apart from
20th century Swedish. But how are spelling authorities different
from languages? Couldn't "19th century Swedish" and "Dutch after
2006" be treated as languages of their own?
No, an authority is just that. When the Spelling is marked as
depreciated, you change the record in ValidSpelling. The reason for this
extra table is, that the word "paardenbloem" will be depreciated and the
older version of "paardebloem" is to be appreciated again. The Spelling
does give a date, that one is about when it was introduce. A spelling
authority is an organisation that decides on a specific spelling. The
NTU is such an organisation for the Dutch language, the change in 2006
follows the change of 1996.
As I said in a previous mail, this was created because I will have to
accomodate this change in a contemporary language. When 19th century
Swedish changed to a more modern version, the old spelling can be
depreciated by adding the ValidUntil in a ValidSpelling record. The old
spelling and the new Spelling are related by a Relation record.
So the design will be set for some time allowing
us to learn
what it is we have. It will be changed when we know what to
change and why. The requirements for content will be minimal and
improvements will happen because of a collective effort.
Have you started to populate the database yet? What cycle span do
you plan for between evaluations and redesigns?
At this moment I am working on the database design, Erik is working on
Wikidata and when he has finished his part, we will work on the
implementation as described on Meta. As to data, there are several many
that will be included into the Ultimate Wiktionary. The content of the
Wiktionaries is important among these. We have a wordlist in
Stellingwerfs of 18.865 waiting in the wings enough to rank it as the
sixth biggest Wiktionary. I have been informed that another version will
have some 20K+ articles .. We have some 222.930 correctly spelled Dutch
words that we may use (only spelling with hypenations). We have the
content of several glossaries and thesauri.
So at this stage questions on the database stuff are very welcome. We
do not expect to start including data until somewhere September. But
when we do, it will be available for everybody to play with it.
Thanks,
Gerard