LovecraftHP
Programmer
I have an input file with a lemma, the cvc structure of that lemma, and the pronunciation of that lemma, eg
abide,[V][CVVC],[@][baId]
abolish,[V][CV][CVC],[@][bO][lIS]
What I would like to do is convert this to an output that gives me the original lemma together with an onset-nucleus-coda structure, eg
abide,=,=,=,=,@,=,b,aI,d
abolish,=,@,=,b,O,=,l,I,S
I'm only interested in the final 3 syllables of each lemma, so the output for eg
accomodate,[V][CV][CV][CVVC],[@][kO][m@][deIt]
should only read
accomodate,k,O,=,m,@,=,d,eI,t
If there are only one or two syllables, the output should have ='s for the empty spaces, as in the example of "abide" above.
I've tried to get it to work but I can't understand how to relate the cvc structure to the pronunciation, or get the prog to disregard the square brackets.
Any help would be greatly appreciated.
abide,[V][CVVC],[@][baId]
abolish,[V][CV][CVC],[@][bO][lIS]
What I would like to do is convert this to an output that gives me the original lemma together with an onset-nucleus-coda structure, eg
abide,=,=,=,=,@,=,b,aI,d
abolish,=,@,=,b,O,=,l,I,S
I'm only interested in the final 3 syllables of each lemma, so the output for eg
accomodate,[V][CV][CV][CVVC],[@][kO][m@][deIt]
should only read
accomodate,k,O,=,m,@,=,d,eI,t
If there are only one or two syllables, the output should have ='s for the empty spaces, as in the example of "abide" above.
I've tried to get it to work but I can't understand how to relate the cvc structure to the pronunciation, or get the prog to disregard the square brackets.
Any help would be greatly appreciated.