Language data format
Posted: Sun Oct 12, 2014 10:54 am
Hello all!
I am currently investigating existing data formats to store information about languages (phonology, phonotactics, morphology, syntax, etc.). I was hoping I could find something that I could use to make a couple of tools while still being able to use other tools found on the internet.
It turns out I couldn't find such standard in the conlanging community. I investigated the Computerized Conlang Creator Project but there's very few and what there is on the website is more focused on file format than data format (the part on data format is more of a first draf proposal than a usable format).
I then ran into the Lexical Markup Framework. An academic effort that seem very extensive and complete. However, as with most academic work, no proper useable tool is available for it (I know about COLDIC but man do academic people have no sense of practicity and usability).
Do you know any other data format for describing languages? Do you use one?
I'd be curious to know if anyone has tried to use LMF. If so, please share your experience. It seems very complete but also possibly too focused on natural languages. I couldn't find a descriptor for the way sounds are produced, for example.
Any insight or experience sharing on the matter would be appreciated.
I am currently investigating existing data formats to store information about languages (phonology, phonotactics, morphology, syntax, etc.). I was hoping I could find something that I could use to make a couple of tools while still being able to use other tools found on the internet.
It turns out I couldn't find such standard in the conlanging community. I investigated the Computerized Conlang Creator Project but there's very few and what there is on the website is more focused on file format than data format (the part on data format is more of a first draf proposal than a usable format).
I then ran into the Lexical Markup Framework. An academic effort that seem very extensive and complete. However, as with most academic work, no proper useable tool is available for it (I know about COLDIC but man do academic people have no sense of practicity and usability).
Do you know any other data format for describing languages? Do you use one?
I'd be curious to know if anyone has tried to use LMF. If so, please share your experience. It seems very complete but also possibly too focused on natural languages. I couldn't find a descriptor for the way sounds are produced, for example.
Any insight or experience sharing on the matter would be appreciated.