Syllable segmentation of Cornish text
Tags: kernewekPythonnatural language processingcornish language27 Mar 2015 - MawKernewek
I will put a write up of the my solar eclipse pictures on this blog, dreckly. In the meantime find the pictures on my Google+ album.
For a while now I have been working on, from time to time a Python program to segment Cornish words into syllables using regular expressions. So far I've worked in Kernewek Kemmyn, but one of the uses I have in mind for this is to transliterate text from one orthography to another.
Here are a few test inputs. This is fairly early results and it will need some more testing. I will put the code online once I have the chance to debug it a little more.:
kath (cat)
An ger yw: kath
Niver a sylabelennow yw: 1
Hag yns i:
1 : kath
kathik (kitten)
An ger yw: kathik
Niver a sylabelennow yw: 2
Hag yns i:
1 : kath
2 : ik
kathigow (kittens)
An ger yw: kathigow
Niver a sylabelennow yw: 3
Hag yns i:
1 : kath
2 : ig
3 : ow
del (leaves)
An ger yw: del
Niver a sylabelennow yw: 1
Hag yns i:
1 : del
delenn (leaf)
An ger yw: delenn
Niver a sylabelennow yw: 2
Hag yns i:
1 : del
2 : enn
delennow (a countable collection of individual leaves)
An ger yw: delennow
Niver a sylabelennow yw: 3
Hag yns i:
1 : del
2 : enn
3 : ow
roes (net)
An ger yw: roes
Niver a sylabelennow yw: 1
Hag yns i:
1 : roes
roesweyth (network)
An ger yw: roesweyth
Niver a sylabelennow yw: 2
Hag yns i:
1 : roes
2 : weyth
kesroesweyth (Internet)
An ger yw: kesroesweyth
Niver a sylabelennow yw: 3
Hag yns i:
1 : kes
2 : roes
3 : weyth
jynn (engine/machine)
An ger yw: jynn
Niver a sylabelennow yw: 1
Hag yns i:
1 : jynn
jynn-ebrenn (aeroplane)
An ger yw: jynnebrenn
Niver a sylabelennow yw: 3
Hag yns i:
1 : jynn
2 : eb
3 : renn
jynnow-ebrenn (aeroplanes)
An ger yw: jynnowebrenn
Niver a sylabelennow yw: 4
Hag yns i:
1 : jynn
2 : ow
3 : eb
4 : renn
For a while now I have been working on, from time to time a Python program to segment Cornish words into syllables using regular expressions. So far I've worked in Kernewek Kemmyn, but one of the uses I have in mind for this is to transliterate text from one orthography to another.
Here are a few test inputs. This is fairly early results and it will need some more testing. I will put the code online once I have the chance to debug it a little more.:
kath (cat)
An ger yw: kath
Niver a sylabelennow yw: 1
Hag yns i:
1 : kath
kathik (kitten)
An ger yw: kathik
Niver a sylabelennow yw: 2
Hag yns i:
1 : kath
2 : ik
kathigow (kittens)
An ger yw: kathigow
Niver a sylabelennow yw: 3
Hag yns i:
1 : kath
2 : ig
3 : ow
del (leaves)
An ger yw: del
Niver a sylabelennow yw: 1
Hag yns i:
1 : del
delenn (leaf)
An ger yw: delenn
Niver a sylabelennow yw: 2
Hag yns i:
1 : del
2 : enn
delennow (a countable collection of individual leaves)
An ger yw: delennow
Niver a sylabelennow yw: 3
Hag yns i:
1 : del
2 : enn
3 : ow
roes (net)
An ger yw: roes
Niver a sylabelennow yw: 1
Hag yns i:
1 : roes
roesweyth (network)
An ger yw: roesweyth
Niver a sylabelennow yw: 2
Hag yns i:
1 : roes
2 : weyth
kesroesweyth (Internet)
An ger yw: kesroesweyth
Niver a sylabelennow yw: 3
Hag yns i:
1 : kes
2 : roes
3 : weyth
jynn (engine/machine)
An ger yw: jynn
Niver a sylabelennow yw: 1
Hag yns i:
1 : jynn
jynn-ebrenn (aeroplane)
An ger yw: jynnebrenn
Niver a sylabelennow yw: 3
Hag yns i:
1 : jynn
2 : eb
3 : renn
jynnow-ebrenn (aeroplanes)
An ger yw: jynnowebrenn
Niver a sylabelennow yw: 4
Hag yns i:
1 : jynn
2 : ow
3 : eb
4 : renn