Help me build lexical databases for romance and english

Discussion of natural languages, or language in general.
Post Reply
Radagast the Third
Sanci
Sanci
Posts: 17
Joined: Sun Oct 16, 2016 10:08 am

Help me build lexical databases for romance and english

Post by Radagast the Third »

Hi folks, I am back in a new incrarnaiton to ask for some help with menial labor in elaborating some comparative wordlists that will serve for all kinds of purposes - most specifically as a baseline for various attempts to create software that can classify languages. The database will be open for anyone to use, so if you have any thing you might use something like this for, you can benefit from the work when it is done.

I am beginning a hobby project of creating a program that can evaluate similarities and differences between related languages. But to make the program work I need to have databases that I can use to evaluate how accurate it is. To do this I was hoping you all might help me crowd source wordlists for Romance languages and English dialects to use as a baseline.

If you would like to participate you can follow these links to a couple of google spreadsheets.

Please type in a broad IPA based transcription - preferably phonemic. Feel free to add varieties if you have good data for them.

Please also mention which source you are working from by typing it into the first row where the language name is.

This one is for English dialects:
https://docs.google.com/spreadsheets/d/ ... p=sharing

This one is for Romance languages:
https://docs.google.com/spreadsheets/d/ ... sp=sharing

User avatar
mèþru
Smeric
Smeric
Posts: 1984
Joined: Thu Oct 29, 2015 6:44 am
Location: suburbs of Mrin
Contact:

Re: Help me build lexical databases for romance and english

Post by mèþru »

Welcome back! Have some pickles and tea!
ImageImage
This might interest you.
ìtsanso, God In The Mountain, may our names inspire the deepest feelings of fear in urkos and all his ilk, for we have saved another man from his lies! I welcome back to the feast hall kal, who will never gamble again! May the eleven gods bless him!
kårroť

User avatar
Ser
Smeric
Smeric
Posts: 1542
Joined: Sat Jul 19, 2008 1:55 am
Location: Vancouver, British Columbia / Colombie Britannique, Canada

Re: Help me build lexical databases for romance and english

Post by Ser »

Eh, only 216 items, I'll help you filling out the Spanish, French and Latin columns.

I'll use accusative forms of words for Latin though, e.g. nigrum for 'black' instead of niger, since that's the form that Romance words (almost always) come from.

I sent you a request for getting editing powers.

Radagast the Third
Sanci
Sanci
Posts: 17
Joined: Sun Oct 16, 2016 10:08 am

Re: Help me build lexical databases for romance and english

Post by Radagast the Third »

Yes, only 216 items, so it is doable. If you think specific items are missing that would be illuminating you are free to add them as well.

Thanks for wanting to help, I have granted your access request.

User avatar
Salmoneus
Sanno
Sanno
Posts: 3197
Joined: Thu Jan 15, 2004 5:00 pm
Location: One of the dark places of the world

Re: Help me build lexical databases for romance and english

Post by Salmoneus »

If you don't mind my asking, why does the English one focus on the relatively minor differences between the accents of different parts of the US, while ignoring the dialect differences in the UK (outside of London)? I know most of those dialects don't have that many millions of speakers, relatively speaking... but then neither does Jamaican English.

You may also want to clarify exactly what you mean by terms like 'Cockney' and 'RP'. It's debatable whether anybody speaks either of them any more. So for 'RP', do you mean actual RP, or do you mean modern SSBE? And for 'Cockney' do you mean 19th and early 20th century cockney proper, or modern MLE (the sociological successor to cockney), or the subdialects of MLE that retain most of their cockney heritage and most avoid 'contamination' from black speakers?

I'd be happy to give you SSBE pronunciations, if you want.

However, you may want to note that all of that (and more) is already on wikipedia.
Blog: [url]http://vacuouswastrel.wordpress.com/[/url]

But the river tripped on her by and by, lapping
as though her heart was brook: Why, why, why! Weh, O weh
I'se so silly to be flowing but I no canna stay!

User avatar
Ser
Smeric
Smeric
Posts: 1542
Joined: Sat Jul 19, 2008 1:55 am
Location: Vancouver, British Columbia / Colombie Britannique, Canada

Re: Help me build lexical databases for romance and english

Post by Ser »

I also don't think it's feasible to have a column for Vulgar Latin. Vulgar Latin basically refers to the spoken sociolects of Latin across various centuries (especially between the 1st c. BC and roughly the 8th century) and places, a concept particularly useful when a word is attested in Romance in multiple places (especially early on) but it's not in written Latin, like Old Spanish señero/a and Old Galico-Portuguese senlleiro/a, from vulg. Lat. *singularius/a/um...

Radagast the Third
Sanci
Sanci
Posts: 17
Joined: Sun Oct 16, 2016 10:08 am

Re: Help me build lexical databases for romance and english

Post by Radagast the Third »

Good points.

As for English I want to have the most divergent varieties represented. I did want to include west-country dialects, geordie, etc. But I am not sure there is enough data available. If you have data for other UK dialects you are more than welcome to add them.

RP is of course not spoken by a lot of people anymore, but there is a lot of data - but I guess SSBE would be equally useful (so please do add it Salmoneus). For Cockney I would prefer the most divergent forms (i.e. probably 18th/19th century).

It is in each list a case of the more the better. The more varieties that can be included the better. We want to have maximum diversity in the lists for them to be able to work well as a baseline for comparison with other language groups.


By the way, here is a lot of data for Romance.http://www.soundcomparisons.com/#/en/Ro ... ht/Lgs_All

User avatar
Salmoneus
Sanno
Sanno
Posts: 3197
Joined: Thu Jan 15, 2004 5:00 pm
Location: One of the dark places of the world

Re: Help me build lexical databases for romance and english

Post by Salmoneus »

To clarify, are you interested in phonology only, or also lexicon?
Blog: [url]http://vacuouswastrel.wordpress.com/[/url]

But the river tripped on her by and by, lapping
as though her heart was brook: Why, why, why! Weh, O weh
I'se so silly to be flowing but I no canna stay!

Radagast the Third
Sanci
Sanci
Posts: 17
Joined: Sun Oct 16, 2016 10:08 am

Re: Help me build lexical databases for romance and english

Post by Radagast the Third »

Both, definitely. And not too fine phonetic distinctions, only distinctions that can reasonably be considered phonemic.

User avatar
Znex
Lebom
Lebom
Posts: 226
Joined: Sun Aug 25, 2013 2:45 am
Location: Australia

Re: Help me build lexical databases for romance and english

Post by Znex »

Alright, I'll add in Australian English stuff.
Native: English || Pretty decent: Ancient Greek || Alright: Ancient Hebrew || Eh: Welsh || Basic: Mandarin Chinese || Very basic: French, Latin, Nisuese, Apsish
Conlangs: Nisuese, Apsish, Kaptaran, Pseudo-Ligurian

User avatar
Salmoneus
Sanno
Sanno
Posts: 3197
Joined: Thu Jan 15, 2004 5:00 pm
Location: One of the dark places of the world

Re: Help me build lexical databases for romance and english

Post by Salmoneus »

Editing of the document seems to be lost. I'd have to create a google account, tell you about my google account, and ask you to give my google account permission to edit the document, and then log into the google account in order to do anything...
Blog: [url]http://vacuouswastrel.wordpress.com/[/url]

But the river tripped on her by and by, lapping
as though her heart was brook: Why, why, why! Weh, O weh
I'se so silly to be flowing but I no canna stay!

Radagast the Third
Sanci
Sanci
Posts: 17
Joined: Sun Oct 16, 2016 10:08 am

Re: Help me build lexical databases for romance and english

Post by Radagast the Third »

You edited something and it was lost? Or the editing function is not working for you? I didnt think a google account was necessary when someone shares the editing link. I can try and share a personal link with you in a private message.

Post Reply