Try using this rule:
Code: Select all
a/à/_/à…_
on these words
Code: Select all
akakakakakaka
akkakkakkakkakkakka
If it's working as designed, it would be good to note.
Code: Select all
a/à/_/à…_
Code: Select all
akakakakakaka
akkakkakkakkakkakka
Yeah, but it's more complicated than that, not to mention that in general, JavaScript has a Unicode problem.vec wrote:Just wanted to pop in and complain that the SCA2 doesn't fully support Unicode combining characters and tends to treat them as separates rather than as a part of the previous glyph. Is this something that can be fixed? I have a large number of vowel qualities, each marking nasality and stress and I'd rather not add in dozens of rewrite rules. I figure the combining characters spectrum in Unicode is quite limited so it ought to be easy enough to just say: if 0300–036F, 1AB0–1AFF, 1DC0–1DFF, 20D0–20FF or FE20–FE2F, treat as part of previous glyph, right?
I found that problem already!KathAveara wrote:Salmoneus, I've found the problem. In the Portugese stuff, click after 'districtus' and 'secundus'. You'll then see why there are extra k's.
Nice!zompist wrote:OK, should be fixed.
1. Insertions should work at the end of the word.
2. Blanks at the end of vocabulary items are removed.
I haven't tried it, but there's this.GreenBowTie wrote:is there a de-SCA somewhere? a Sound Change Reverser? it would be nice to be able to insert the rules and the results and get back a list of possible origins
Thanks, great! I'll take a look at it alter today.zompist wrote:All right, I made several changes to gen
Well, it's already a bit messy, but I think I can live with that :)I though the UI would get a bit too messy if I added percentage boxes.
I don't envy you :).I will look at the SCA2 problem later. Enough Javascript wrangling for tonight. :)
Code: Select all
V=aeioɨāēīōȳ (ȳ represents /ɨ:/, there's a rewrite rule for it)
Ā=āēȳī
Ē=ēīāî
-------
Ā/Ē/_/V…_# (that's definitely an ellipsis character, not three periods)