Linguistic reconstruction and maths
Posted: Mon Sep 10, 2012 12:03 pm
It appears to me that there's quite a clear relationship between linguistic reconstruction and some kind of information-theory maths.
Clearly there could be defined a function rec(mutter, moder, mother, moeder, móðir, móðir, muter, mither, mader, (whatever it is in gothic)) which spits out *mōdēr, the requisite sound changes from *mōdēr to the derived names and maybe a number of other guesses assigning some kind of probability to each? It could also provide some kind of null-change with a "reject" value or whatever, if, say, we had included a word that likely isn't a cognate.
Hence, the output would be
a string (the reconstructed form), a set1 of sets2 ordered sets3 of sets4 strings.
1) one for each language providing a cognate
2) for different potential paths
3) ordered by order they've been applied in
4) where each set contains those for whom order they've been applied in is not further deducable
Continuing from that, we could probably come up with a function that combines the sets of sets of sets and compares them. Meanwhile, it wouldn't be recursive, or rather, recursivity would be sort of invasive - dunno how to express this:
rec(rec(Mutter, moeder, mother, ...), rec(rec(móðir, móðir,moder), moder, moder)), could have an outer function alter the likelihoods of different reconstructions one level in?
Ok, I am rambling. The question is basically: what mathematical formalisms and functions on strings would be relevant for reconstruction?
Clearly there could be defined a function rec(mutter, moder, mother, moeder, móðir, móðir, muter, mither, mader, (whatever it is in gothic)) which spits out *mōdēr, the requisite sound changes from *mōdēr to the derived names and maybe a number of other guesses assigning some kind of probability to each? It could also provide some kind of null-change with a "reject" value or whatever, if, say, we had included a word that likely isn't a cognate.
Hence, the output would be
a string (the reconstructed form), a set1 of sets2 ordered sets3 of sets4 strings.
1) one for each language providing a cognate
2) for different potential paths
3) ordered by order they've been applied in
4) where each set contains those for whom order they've been applied in is not further deducable
Continuing from that, we could probably come up with a function that combines the sets of sets of sets and compares them. Meanwhile, it wouldn't be recursive, or rather, recursivity would be sort of invasive - dunno how to express this:
rec(rec(Mutter, moeder, mother, ...), rec(rec(móðir, móðir,moder), moder, moder)), could have an outer function alter the likelihoods of different reconstructions one level in?
Ok, I am rambling. The question is basically: what mathematical formalisms and functions on strings would be relevant for reconstruction?