How would you construct language dictionary database?

User avatar
Liquid
Posts:149
Joined:2006-07-23, 10:30
Real Name:Marcin Koziol
Gender:male
Location: Szczecin
Country:PLPoland (Polska)
How would you construct language dictionary database?

Postby Liquid » 2008-07-19, 20:42

Hi,

I'm thinking of the technical side of language dictionary creation. Have you ever tried to create one or heard how those are structured in the database.

I guess simple dictionary needs to have the following:
- two-ways translation for a word
- reference to other words
- several meanings of the word basing on the context,
- several examples of usage of the word
- grammar informations (verb/noun, number, gender, countable or not)
- reference to the root noun/verb form

I'm interested how such thing can be put into the sql database
Marcin

Polish - native, Spanish - estudiando todos los dias, English - still trying to speak properly

User avatar
hashi
Posts:9191
Joined:2008-11-02, 2:39
Gender:male
Country:NZNew Zealand (New Zealand / Aotearoa)
Contact:

Re: How would you construct language dictionary database?

Postby hashi » 2008-11-26, 11:03

I'm currently working on such a thing for my constructed language. It's fairly basic at the moment, but it certainly is possible.
(en-nz)(ja)(sv)(it)(mi)(et)

Sono ancora qui (a volte), ma probabilmente non ti voglio parlare.

User avatar
proycon
Server Administrator
Posts:2700
Joined:2002-06-20, 20:22
Real Name:Maarten van Gompel
Gender:male
Location:Eindhoven
Country:NLThe Netherlands (Nederland)
Contact:

Re: How would you construct language dictionary database?

Postby proycon » 2008-11-27, 8:28

I have very extensively thought about this for UniLang and came up with the so-called "Universal Language Resource" (ULR) format, which is used for all UniLang vocabulary lists, phrasebooks, stories, site translations, fill-in-exercises etc.. It's not aimed at dictionaries in particular but they can also be stored in it. The format supports a certain degree of linguistic annotation (gender, case, number, transliteration, usage example, phonetic description etc.. ) and attaching multimedia (audio, imagery, video). The foundation of the format is the idea of multi-way translation between languages. Synonyms and dialectal/scriptural variants are also supported.

This is XML based rather than SQL though. For a real large-size dictionary SQL with a proper search index might be mode adequate.
You can read a bit about ULR here: devdoc.php?page=rescore .

If you're purely interested in dictionaries, you might want to have a look at XDXF instead: http://xdxf.sourceforge.net/ (also XML-based)
Language is the dress of thought -- Samuel Johnson
my homepage

User avatar
hashi
Posts:9191
Joined:2008-11-02, 2:39
Gender:male
Country:NZNew Zealand (New Zealand / Aotearoa)
Contact:

Re: How would you construct language dictionary database?

Postby hashi » 2008-11-29, 2:25

This is what I have built: http://nith.4fd.us/d/

Take a look and let me know what you think :)
(en-nz)(ja)(sv)(it)(mi)(et)

Sono ancora qui (a volte), ma probabilmente non ti voglio parlare.


Return to “General Topics”

Who is online

Users browsing this forum: No registered users and 16 guests