PART 2: MORPHOLOGY, NOMINAL CATEGORIES, NOMINAL SYNTAX
---
I'd posted this over at the CBB, but I had a feeling I should probably crosspost it here as well. So here you go.
---
So I recently found out about CALS, the conlang world's answer to WALS. And as I noticed that all the categories were the same, and that the numbers of catalogued languages on each side were pretty similar, I started thinking that someone should go through and compare things.
And, by the great tradition of "someone should" => "I should", here we are.
For each of the features in both the WALS and CALS databases, I've converted them to percentages, then subtracted the CALS number from the WALS number. Nothing too mathematically profound, but it should still give us some interesting data. In effect, a value of +10% means that 10% of conlangs have that feature and "shouldn't" have that feature, while -15% means that 15% of conlangs don't have that feature and "should". (Pretty heavy inverted commas there - I'm not trying to be prescriptive - but still a good way of picturing things.)
And because it gives us a *lot* of data, much of it interesting, I've decided to split things up into a few posts so people don't get bogged down in numbers. In general, I've chosen the features with the most extreme positive or negative values, plus a few that I just find interesting. If anyone's curious about any features I've missed off, let me know and I'll throw together another graph.
Just comparing percentages doesn't give you the full picture by any means - an extra 10% on a feature that 75% of natlangs have will show up the same as an extra 10% on a feature that pretty much never happens. But it's not a bad start. Maybe I'll delve into some more complex statistical stuff at a later date.
PART 1: PHONOLOGY
Consonant Inventories

It seems that there's a tendency towards average-sized (19-25) consonant inventories here.
People seem to be shying away more from the very small (6-14) than the very large (34+) inventories - maybe they're not seen as as interesting. ("Just one more phoneme...")
Vowel Inventories

But for vowels, unlike consonants, there's a tendency away from the average. Possibly some of the huge interesting Indo-European vowel systems are pulling people away from the mean, towards larger (7+) inventories.
Voicing in Plosives and Fricatives

There's a strong tendency here - 20% more conlangs have a voicing contrast throughout.
Front Rounded Vowels

And again, possibly fueled by the tendency towards larger vowel systems, we see more conlangers going for the most "interesting" options.
Tone

As people generally assume, lots of conlangs don't have tones. But what surprised me here was how well the languages with tone matched the natlang distribution of complexity of tone system - I'd expected the tonal-conlangers to have gone much more for big dramatic contours-and-sandhi systems over simple two-way contrasts. Maybe there are more pitch-accent langs than I thought...
Stress

Fixed stress seems unpopular. But though I'd have expected to see unpredictable stress as popular, I wouldn't have expected "Right-oriented: one of the last three" to have shown up quite so strongly.
Uncommon Consonants

English rears its ugly head again. Non-sibilant dental fricatives are pretty rare in natlangs, being less common than co-articulated /kp/ - but because English has them (and, if I'm honest, because they're quite a nice-sounding sound) they show up in 18.5% more conlangs than natlangs. Though, to be honest, I was expecting a larger number - the tendency away from tone was larger than this one.
COMING SOON: MORPHOSYNTAX


















