Skrifenn Etek - Writing Eighteen - Tekstow Kernewe - Cornish Texts
Tags: kerneweknatural language processingcornish24 Oct 2010 - MawKernewek
I have recently acquired a book on "Natural Language Processing with Python" and have begun to apply its principles to a few Cornish texts. I have downloaded a number of the traditional texts from http://corpus.kernewek.cymru247.net/ as well as a couple of modern texts (the short story Solempnyta by Benjamin Bruch and an translation of a chapter of the Lord of the Rings into Cornish from Keskewsel. If anyone reading this has any further texts in electronic form that they'd be willing to let me use let me know.
So here are a few basic results:
['bmkk.txt', 'cwkdlkk.txt', 'omkkks.txt', 'pckk.txt', 'rdkk.txt', 'solempnyta_kk.txt', 'tolkien_kk.txt', 'tregkk.txt']
Text: Improved version 21 / 10 / 96 Bewnans... (Bewnans Meryasek)
Collocations: Building collocations list Collocations are words that tend to be more likely to occur together than would be suggested by their general frequency. This is a built in function of Python's Natural Language Toolkit
pur wir; Comes venetensis; heb falladow; Tertius tortor; Secundus
tortor; Primus tortor; pub eur; Yesu Krist; Episcopus Kernow; Rag
kerensa; heb ahwer; kuv kolonn; wosa hemma; deun alemma; pur dhiogel;
pub termyn; heb namm; heb wow; Primus exulator; dha vodh
None
number of words = 26815
number of different words = 4664
Lengths of words in descending order of frequency [(3, 5094), (2, 4813), (4, 3857), (5, 3270), (1, 3078), (6, 2636), (7, 1697), (8, 1180), (9, 612), (10, 385), (11, 115), (12, 57), (13, 18), (14, 2), (18, 1)]
Top 50 words: ['a', 'y', 'n', 'dhe', 'ha', 'yn', 'an', 'ow', 'my', 'yw', 'c', 'ny', 'na', 're', 's', 'dha', 'omma', 'pur', 'ni', 'm', 'rag', 'meryasek', 'ma', 'sur', 'krist', 'yesu', 'bys', 'th', 'hwi', 'mar', 'heb', 'arloedh', 'oll', 'ev', 'vynn', 'gans', 'yma', 'dyw', 'vydh', 'lemmyn', 'vy', 'maria', 'den', 'ty', 'wir', 'dell', 'eus', 'meriadocus', 'dhymm', 'sertan']
Top 50 words of 4 or more letters: ['omma', 'meryasek', 'krist', 'yesu', 'arloedh', 'vynn', 'gans', 'vydh', 'lemmyn', 'maria', 'dell', 'meriadocus', 'dhymm', 'sertan', 'meur', 'dhymmo', 'dhyn', 'dhis', 'finit', 'episcopus', 'agas', 'comes', 'primus', 'secundus', 'nyns', 'yredi', 'orth', 'henna', 'prest', 'syrr', 'agan', 'devri', 'tortor', 'dhywgh', 'nevra', 'gweres', 'alemma', 'hanow', 'bydh', 'bynytha', 'deun', 'dhodho', 'epskop', 'hemma', 'lies', 'descendit', 'dhiso', 'lowena', 'mones', 'aredy']
Text: # ------------------------------------------------------------------------ # # The text of _Gwreans... (Gwreans an Bys)
Collocations: Building collocations list (note that I really should have trimmed some of the comments out that are in the file but not part of the Cornish text)
KDL page; ### KDL; par dell; lever dhis; pub eur; pub tra; pur vras;
pub prys; wosa hemma; pur wir; Nyns eus; vynn mos; dhe vos; Der henna;
mos alemma; mar vras; heb falladow; warn ugens; FIRST DEVIL; myns eus
None
number of words = 15044
number of different words = 2539
Lengths of words in descending order of frequency [(3, 3450), (2, 3007), (4, 2420), (5, 1666), (1, 1663), (6, 1474), (7, 747), (8, 381), (9, 144), (10, 59), (11, 29), (12, 4)]
Top 50 words: ['a', 'ha', 'y', 'n', 'an', 'dhe', 'my', 'ow', 'yw', 'yn', 'ny', 'na', 'yth', 'adam', 'the', 'bys', 'pur', 'dha', 'ty', 'vydh', 'rag', 'hag', 'henna', 'm', 'pub', 'ma', 'oll', 'omma', 'kdl', 'page', 'th', 'dhymm', 'dyw', 'gans', 'ev', 'mar', 'tas', 'to', 'vynn', 'hwi', 'in', 'der', 'dhymmo', 'eus', 'heb', 'ms', 'and', 'eva', 'vy', 'gwrys']
Top 50 words of 4 or more letters: ['adam', 'vydh', 'henna', 'omma', 'page', 'dhymm', 'gans', 'vynn', 'dhymmo', 'gwrys', 'dhis', 'bras', 'dell', 'father', 'lemmyn', 'nevra', 'serpent', 'dout', 'genev', 'kaym', 'seth', 'keth', 'vras', 'nyns', 'rakhenna', 'hemma', 'meur', 'cain', 'prys', 'orth', 'abel', 'alemma', 'fydh', 'hwath', 'lever', 'yndella', 'ynwedh', 'sertan', 'woer', 'heaven', 'plas', 'agan', 'gwel', 'mayth', 'ragdho', 'wartha', 'genes', 'lavar', 'maga', 'wydhenn']
Text: # --------- ORIGO MUNDI --------- # Keith Syed...
Collocations: Building collocations list
DEUS PATER; REX SAL; pur wir; Tas Dyw; heb falladow; Nyns eus; teyr
gwelenn; may hallo; nyns eus; heb fall; heb wow; Lavar dhymmo; dell
vynni; dres puptra; Arloedh ker; pub huni; pub eur; kollenwel bodh;
verr dermyn; war bayn
None
number of words = 15533
number of different words = 2608
Lengths of words in descending order of frequency [(3, 3375), (2, 3033), (4, 2396), (1, 1898), (5, 1673), (6, 1421), (7, 956), (8, 469), (9, 171), (10, 97), (12, 21), (11, 19), (13, 2), (14, 2)]
Top 50 words: ['a', 'ha', 'y', 'an', 'n', 'yn', 'dhe', 'ow', 'my', 'dha', 'rag', 'yw', 'na', 'ny', 'dyw', 'oll', 'm', 're', 'bys', 'th', 'vydh', 'arloedh', 'war', 'hag', 'heb', 'may', 'dell', 'ev', 'gans', 'mar', 'dhis', 'ty', 'ma', 'tas', 'i', 'wra', 'ni', 'dhymm', 'lemmyn', 'deus', 'dre', 'nev', 'adam', 'vynn', 'moyses', 'pan', 'pur', 'bos', 'eus', 'pater']
Top 50 words of 4 or more letters: ['vydh', 'arloedh', 'dell', 'gans', 'dhis', 'dhymm', 'lemmyn', 'deus', 'adam', 'vynn', 'moyses', 'pater', 'dhymmo', 'dhyn', 'agan', 'skon', 'bras', 'hware', 'nevra', 'dhodho', 'gwrys', 'nyns', 'sertan', 'dhiso', 'dhyw', 'meur', 'keffrys', 'kyns', 'orth', 'dhywgh', 'henna', 'leun', 'fydh', 'hweg', 'vynytha', 'bydh', 'abel', 'omma', 'onan', 'awos', 'kemmer', 'ellas', 'gwel', 'gwra', 'bones', 'deun', 'nuncius', 'pyth', 'ynwedh', 'bennath']
Text: PASSIO CHRISTI - KK Version made from Norris...
Collocations: Building collocations list
Mab Dyw; Princeps Annas; IVs Tortor; IIs Tortor; IIIs Tortor; pur wir;
heb lettya; tri dydh; kepar dell; dha vodh; IIs Doctor; Dydh Breus;
dhis lowena; Pur wir; dhe wruthyl; kettep onan; may hallo; Arloedh
ker; Myghtern Yedhewon; Nyns eus
None
number of words = 21260
number of different words = 3604
Lengths of words in descending order of frequency [(3, 4249), (2, 4152), (4, 3066), (5, 2425), (1, 2406), (6, 2115), (7, 1408), (8, 779), (9, 366), (10, 218), (11, 47), (12, 18), (13, 5), (14, 5), (16, 1)]
Top 50 words: ['a', 'y', 'n', 'yn', 'my', 'an', 'ha', 'dhe', 'ow', 'yw', 'et', 'rag', 'ny', 're', 'na', 'dha', 'ev', 'oll', 'hag', 'm', 'dyw', 'tortor', 'bys', 'war', 'gans', 'hic', 'mar', 'ihc', 'th', 'cayphas', 'ma', 'mab', 'may', 'ad', 'ihesu', 'dell', 'lemmyn', 'dhis', 'hwi', 'ty', 'heb', 'pan', 'tunc', 'vydh', 'wra', 'ni', 'arloedh', 's', 'den', 'dre']
Top 50 words of 4 or more letters: ['tortor', 'gans', 'cayphas', 'ihesu', 'dell', 'lemmyn', 'dhis', 'tunc', 'vydh', 'arloedh', 'pilatus', 'dhodho', 'sertan', 'meur', 'vynn', 'dicit', 'dhymm', 'henna', 'dhyn', 'mara', 'annas', 'skon', 'kyns', 'dhywgh', 'dhymmo', 'hware', 'agan', 'agas', 'lowena', 'ellas', 'gwas', 'gwir', 'mars', 'petrus', 'princeps', 'syrr', 'bras', 'hweg', 'lavar', 'worth', 'bydh', 'hayl', 'myghtern', 'dhiso', 'fydh', 'lever', 'nyns', 'yndella', 'awos', 'kettep']
Text: Resurrectio Domini - KK Version made from Norris...
Collocations: Building collocations list
pur wir; Spyrys Sans; fem .?}; verr spys; tressa dydh; kepar dell;
Arloedh ker; Penn vyghternedh; dhe dhasserghi; Kepar dell; vos
dasserghys; osculatur eos; IVs Miles; Ihesu Cryst; Mab Maria; tri
dydh; januis clausis; IIIs Miles; hakkra mernans; heb lettya
None
number of words = 16209
number of different words = 2635
Lengths of words in descending order of frequency [(3, 3314), (2, 3146), (4, 2310), (1, 2142), (5, 1864), (6, 1476), (7, 990), (8, 497), (9, 239), (10, 171), (11, 40), (12, 16), (13, 3), (14, 1)]
Top 50 words: ['a', 'y', 'n', 'yn', 'ha', 'dhe', 'an', 'ow', 'my', 'yw', 'ny', 'na', 'ev', 'rag', 'arloedh', 'dha', 'mar', 'ty', 'm', 'bys', 'dell', 'ni', 'oll', 'th', 'sur', 're', 'vydh', 'gans', 'meur', 'hag', 'pur', 'ihesu', 'lemmyn', 'nev', 'thomas', 'et', 'dre', 'ma', 'heb', 'bedh', 'pan', 'cryst', 'dhymmo', 'ns', 'dhymm', 'hwi', 'maria', 'dhis', 'dyw', 'neb']
Top 50 words of 4 or more letters: ['arloedh', 'dell', 'vydh', 'gans', 'meur', 'ihesu', 'lemmyn', 'thomas', 'bedh', 'cryst', 'dhymmo', 'dhymm', 'maria', 'dhis', 'dhyn', 'skon', 'bydh', 'korf', 'imperator', 'nyns', 'agan', 'miles', 'sertan', 'marow', 'henna', 'ellas', 'ynwedh', 'tortor', 'bras', 'dhywgh', 'lavar', 'vernona', 'agas', 'leun', 'drog', 'grys', 'myghtern', 'pilatus', 'genen', 'golonn', 'dasserghys', 'genev', 'krysi', 'vynn', 'dhodho', 'gwir', 'tunc', 'hedhyw', 'kepar', 'nevra']
Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...
Collocations: Building collocations list (Note that this is quite a short text)
Pow Sows; yth esa; dro dhe; brassa rann; mos tre; Hag ythó; Pur dha;
rann anedha; Gov haâ; dell grysav; esov omma; eus saw; rag covhÃ; den
vyth; omma rag; dhe Loundres; ledhys gans; Nyns eus; nyns eus; Henry
ledhys
None
number of words = 1264
number of different words = 511
Lengths of words in descending order of frequency [(2, 279), (3, 229), (4, 171), (5, 138), (1, 129), (6, 118), (7, 82), (8, 51), (9, 27), (10, 20), (11, 18), (13, 1), (14, 1)]
Top 50 words: ['an', 'a', 'n', 'yn', 'yw', 'ha', 'y', 'dhe', 'sowsnek', 'ma', 'ow', 'rag', 'my', 'nyns', 'henry', 'mes', 'vy', 'gans', 're', 'hag', 'omma', 'hy', 'o', 'dell', 'shakespeare', 'sows', 'taves', 'war', 'yeth', 'yth', 'dhymm', 'le', 'na', 'nebes', 'ny', 'oll', 'pan', 'po', 'pow', 'yma', 'bos', 'genev', 'gov', 'gwari', 'heb', 'hi', 'may', 'tus', 'vyth', 'aga']
Top 50 words of 4 or more letters: ['sowsnek', 'nyns', 'henry', 'gans', 'omma', 'dell', 'shakespeare', 'sows', 'taves', 'yeth', 'dhymm', 'nebes', 'genev', 'gwari', 'vyth', 'avel', 'erel', 'hedhyw', 'henna', 'kernewek', 'orth', 'sowsneger', 'studhya', 'whath', 'wosa', 'anedha', 'aral', 'bedh', 'blackheath', 'clewes', 'dann', 'dherag', 'honan', 'ledhys', 'margh', 'martesen', 'mernans', 'nans', 'ogas', 'skila', 'tiwedh', 'vernans', 'yethow', 'arta', 'bothek', 'brassa', 'cales', 'cansblydhen', 'clappya', 'codhas']
Text: Osta karer Arloedh An Bysowyer ? Wel ,... (A chapter of Lord Of the Rings rendered in Cornish at www.keskewsel.com)
Collocations: Building collocations list
Yth esa; yth esa; dhe vos; haval orth; Unn Bysow; medh Gandalf; Bag
End; Nyns eus; dro dhe; leveris Gandalf; wovynnas Frodo; medh Frodo;
neb kas; fatell wrug; dell dybav; dann gel; res dhis; rewlya oll;
Parkow Gladen; dre dermyn
None
number of words = 11147
number of different words = 1966
Lengths of words in descending order of frequency [(2, 2309), (3, 2004), (1, 1526), (4, 1442), (5, 1342), (6, 976), (7, 772), (8, 395), (9, 185), (10, 129), (11, 36), (12, 10), (13, 9), (17, 5), (15, 3), (18, 2), (14, 1), (16, 1)]
Top 50 words: ['a', 'an', 'ev', 'y', 'yn', 'ha', 'n', 'dhe', 'hag', 'mes', 'o', 'ow', 'na', 'yw', 'ny', 'frodo', 'bysow', 'esa', 'vy', 'yth', 're', 'my', 'nyns', 'gans', 'wrug', 'dell', 'bos', 'rag', 'i', 'oll', 'gandalf', 'vos', 'bylbo', 'orth', 'po', 'mar', 'termyn', 'henna', 'dre', 'leveris', 'meur', 'dhodho', 'medh', 'aga', 'es', 'pan', 'pur', 'dres', 'ta', 'yma']
Top 50 words of 4 or more letters: ['frodo', 'bysow', 'nyns', 'gans', 'wrug', 'dell', 'gandalf', 'bylbo', 'orth', 'termyn', 'henna', 'leveris', 'meur', 'dhodho', 'medh', 'dres', 'arta', 'kever', 'nerth', 'dhymm', 'diworth', 'golum', 'shayr', 'tewl', 'haval', 'hobytow', 'hwir', 'nebes', 'wosa', 'henn', 'honan', 'lemmyn', 'yndella', 'arall', 'kyns', 'vydh', 'hwath', 'ganso', 'klywes', 'pyth', 'woer', 'drefenn', 'elfow', 'leverel', 'owth', 'ytho', 'dhis', 'nans', 'nevra', 'orto']
Text: THE TREGEAR HOMILIES KK Version made from Christopher...
Collocations: Building collocations list
dhe vos; kepar dell; Spyrys Sans; agan Savyour; katholik eglos; pub
eur; mar veur; heb diwedh; fatell wrug; Yesu Krist; dre reson; agan
honan; Savyour Yesu; res dhyn; agan Arloedh; Katholik Eglos; Sans
Eglos; dell wrug; dhe leverel; gan Savyour
None
number of words = 40897
number of different words = 5246
Lengths of words in descending order of frequency [(2, 8508), (3, 7334), (1, 5121), (4, 5001), (5, 4461), (6, 3516), (7, 2555), (8, 2112), (9, 1009), (10, 637), (11, 317), (12, 155), (13, 99), (14, 43), (15, 17), (16, 5), (17, 3), (19, 2), (18, 1), (20, 1)]
Top 50 words: ['a', 'ha', 'an', 'n', 'dhe', 'y', 'yn', 'yw', 'ow', 'ni', 'ev', 'ma', 'na', 'rag', 'krist', 'agan', 'wrug', 's', 'oll', 'dre', 'yma', 'eglos', 'dyw', 'gans', 'hag', 'bonner', 'fatell', 'henna', 'et', 'kepar', 'den', 'leverel', 'vos', 'aga', 'yth', 'mar', 'keth', 're', 'honan', 'dell', 'bos', 'i', 'in', 'vydh', 'folio', 'ny', 'o', 'de', 'homily', 'nyns']
Top 50 words of 4 or more letters: ['krist', 'agan', 'wrug', 'eglos', 'gans', 'bonner', 'fatell', 'henna', 'kepar', 'leverel', 'keth', 'honan', 'dell', 'vydh', 'folio', 'homily', 'nyns', 'dhyn', 'dhyw', 'ynwedh', 'korf', 'savyour', 'rakhenna', 'hemma', 'henn', 'dhiworth', 'katholik', 'onan', 'geryow', 'pyth', 'hwath', 'arloedh', 'peder', 'chaptra', 'gwrys', 'omma', 'yndella', 'skryptor', 'lemmyn', 'bobel', 'sans', 'arall', 'dhodho', 'goes', 'leveris', 'lies', 'spyrys', 'agas', 'powl', 'termyn']
Here we see what percentage of the text words of various lengths make up:
Text: Improved version 21 / 10 / 96 Bewnans...
3 letters : 14.05 %
2 letters : 13.27 %
4 letters : 10.63 %
5 letters : 9.020 %
1 letters : 8.490 %
6 letters : 7.271 %
7 letters : 4.681 %
8 letters : 3.255 %
9 letters : 1.688 %
10 letters : 1.062 %
11 letters : 0.317 %
12 letters : 0.157 %
13 letters : 0.049 %
14 letters : 0.005 %
18 letters : 0.002 %
Text: # ------------------------------------------------------------------------ # # The text of _Gwreans...
3 letters : 15.71 %
2 letters : 13.70 %
4 letters : 11.02 %
5 letters : 7.590 %
1 letters : 7.577 %
6 letters : 6.715 %
7 letters : 3.403 %
8 letters : 1.735 %
9 letters : 0.656 %
10 letters : 0.268 %
11 letters : 0.132 %
12 letters : 0.018 %
Text: # --------- ORIGO MUNDI --------- # Keith Syed...
3 letters : 16.67 %
2 letters : 14.98 %
4 letters : 11.83 %
1 letters : 9.376 %
5 letters : 8.264 %
6 letters : 7.020 %
7 letters : 4.722 %
8 letters : 2.316 %
9 letters : 0.844 %
10 letters : 0.479 %
12 letters : 0.103 %
11 letters : 0.093 %
13 letters : 0.009 %
14 letters : 0.009 %
Text: PASSIO CHRISTI - KK Version made from Norris...
3 letters : 15.49 %
2 letters : 15.14 %
4 letters : 11.18 %
5 letters : 8.844 %
1 letters : 8.774 %
6 letters : 7.713 %
7 letters : 5.135 %
8 letters : 2.841 %
9 letters : 1.334 %
10 letters : 0.795 %
11 letters : 0.171 %
12 letters : 0.065 %
13 letters : 0.018 %
14 letters : 0.018 %
16 letters : 0.003 %
Text: Resurrectio Domini - KK Version made from Norris...
3 letters : 15.66 %
2 letters : 14.86 %
4 letters : 10.91 %
1 letters : 10.12 %
5 letters : 8.808 %
6 letters : 6.974 %
7 letters : 4.678 %
8 letters : 2.348 %
9 letters : 1.129 %
10 letters : 0.808 %
11 letters : 0.189 %
12 letters : 0.075 %
13 letters : 0.014 %
14 letters : 0.004 %
Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...
2 letters : 16.55 %
3 letters : 13.59 %
4 letters : 10.14 %
5 letters : 8.189 %
1 letters : 7.655 %
6 letters : 7.002 %
7 letters : 4.866 %
8 letters : 3.026 %
9 letters : 1.602 %
10 letters : 1.186 %
11 letters : 1.068 %
13 letters : 0.059 %
14 letters : 0.059 %
Text: Osta karer Arloedh An Bysowyer ? Wel ,...
2 letters : 15.47 %
3 letters : 13.42 %
1 letters : 10.22 %
4 letters : 9.661 %
5 letters : 8.991 %
6 letters : 6.539 %
7 letters : 5.172 %
8 letters : 2.646 %
9 letters : 1.239 %
10 letters : 0.864 %
11 letters : 0.241 %
12 letters : 0.067 %
13 letters : 0.060 %
17 letters : 0.033 %
15 letters : 0.020 %
18 letters : 0.013 %
14 letters : 0.006 %
16 letters : 0.006 %
Text: THE TREGEAR HOMILIES KK Version made from Christopher...
2 letters : 16.19 %
3 letters : 13.95 %
1 letters : 9.745 %
4 letters : 9.517 %
5 letters : 8.489 %
6 letters : 6.691 %
7 letters : 4.862 %
8 letters : 4.019 %
9 letters : 1.920 %
10 letters : 1.212 %
11 letters : 0.603 %
12 letters : 0.294 %
13 letters : 0.188 %
14 letters : 0.081 %
15 letters : 0.032 %
16 letters : 0.009 %
17 letters : 0.005 %
19 letters : 0.003 %
18 letters : 0.001 %
20 letters : 0.001 %
Here we show what percentage individual words make up of a given text.
Text: Improved version 21 / 10 / 96 Bewnans...
a : 3.233 %
y : 1.613 %
n : 1.431 %
dhe : 1.359 %
ha : 1.274 %
yn : 1.150 %
an : 1.051 %
ow : 1.006 %
my : 0.976 %
yw : 0.846 %
c : 0.822 %
ny : 0.689 %
na : 0.590 %
re : 0.571 %
s : 0.513 %
dha : 0.499 %
omma : 0.499 %
pur : 0.496 %
ni : 0.463 %
m : 0.449 %
Text: # ------------------------------------------------------------------------ # # The text of _Gwreans...
a : 3.781 %
ha : 1.581 %
y : 1.476 %
n : 1.435 %
an : 1.362 %
dhe : 1.316 %
my : 1.152 %
ow : 1.075 %
yw : 0.952 %
yn : 0.947 %
ny : 0.701 %
na : 0.651 %
yth : 0.542 %
adam : 0.492 %
the : 0.482 %
bys : 0.473 %
pur : 0.469 %
dha : 0.446 %
ty : 0.441 %
vydh : 0.428 %
Text: # --------- ORIGO MUNDI --------- # Keith Syed...
a : 4.890 %
ha : 1.753 %
y : 1.719 %
an : 1.570 %
n : 1.526 %
yn : 1.511 %
dhe : 1.383 %
ow : 1.373 %
my : 1.249 %
dha : 0.844 %
rag : 0.755 %
yw : 0.755 %
na : 0.666 %
ny : 0.666 %
dyw : 0.573 %
oll : 0.568 %
m : 0.543 %
re : 0.533 %
bys : 0.508 %
th : 0.464 %
Text: PASSIO CHRISTI - KK Version made from Norris...
a : 3.924 %
y : 2.337 %
n : 1.531 %
yn : 1.385 %
my : 1.316 %
an : 1.283 %
ha : 1.276 %
dhe : 1.050 %
ow : 1.013 %
yw : 0.846 %
et : 0.751 %
rag : 0.707 %
ny : 0.689 %
re : 0.652 %
na : 0.649 %
dha : 0.576 %
ev : 0.503 %
oll : 0.452 %
hag : 0.415 %
m : 0.404 %
Text: Resurrectio Domini - KK Version made from Norris...
a : 4.853 %
y : 2.216 %
n : 1.989 %
yn : 1.606 %
ha : 1.356 %
dhe : 1.327 %
an : 1.256 %
ow : 1.063 %
my : 1.001 %
yw : 0.954 %
ny : 0.907 %
na : 0.803 %
ev : 0.666 %
rag : 0.619 %
arloedh : 0.567 %
dha : 0.533 %
mar : 0.496 %
ty : 0.472 %
m : 0.448 %
bys : 0.430 %
Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...
an : 3.738 %
a : 3.323 %
n : 2.195 %
yn : 1.721 %
yw : 1.661 %
ha : 1.602 %
y : 1.186 %
dhe : 1.127 %
sowsnek : 1.127 %
ma : 1.008 %
ow : 1.008 %
rag : 0.890 %
my : 0.830 %
nyns : 0.830 %
henry : 0.712 %
mes : 0.652 %
vy : 0.652 %
gans : 0.593 %
re : 0.593 %
hag : 0.534 %
Text: Osta karer Arloedh An Bysowyer ? Wel ,...
a : 4.783 %
an : 2.599 %
ev : 2.512 %
y : 2.445 %
yn : 2.070 %
ha : 1.862 %
n : 1.587 %
dhe : 1.226 %
hag : 1.038 %
mes : 1.011 %
o : 0.884 %
ow : 0.824 %
na : 0.676 %
yw : 0.663 %
ny : 0.636 %
frodo : 0.596 %
bysow : 0.589 %
esa : 0.509 %
vy : 0.502 %
yth : 0.495 %
Text: THE TREGEAR HOMILIES KK Version made from Christopher...
a : 4.411 %
ha : 3.016 %
an : 2.925 %
n : 2.148 %
dhe : 1.899 %
y : 1.821 %
yn : 1.288 %
yw : 1.157 %
ow : 1.073 %
ni : 0.858 %
ev : 0.839 %
ma : 0.749 %
na : 0.698 %
rag : 0.679 %
krist : 0.664 %
agan : 0.616 %
wrug : 0.603 %
s : 0.527 %
oll : 0.464 %
dre : 0.439 %
So here are a few basic results:
['bmkk.txt', 'cwkdlkk.txt', 'omkkks.txt', 'pckk.txt', 'rdkk.txt', 'solempnyta_kk.txt', 'tolkien_kk.txt', 'tregkk.txt']
Text: Improved version 21 / 10 / 96 Bewnans... (Bewnans Meryasek)
Collocations: Building collocations list Collocations are words that tend to be more likely to occur together than would be suggested by their general frequency. This is a built in function of Python's Natural Language Toolkit
pur wir; Comes venetensis; heb falladow; Tertius tortor; Secundus
tortor; Primus tortor; pub eur; Yesu Krist; Episcopus Kernow; Rag
kerensa; heb ahwer; kuv kolonn; wosa hemma; deun alemma; pur dhiogel;
pub termyn; heb namm; heb wow; Primus exulator; dha vodh
None
number of words = 26815
number of different words = 4664
Lengths of words in descending order of frequency [(3, 5094), (2, 4813), (4, 3857), (5, 3270), (1, 3078), (6, 2636), (7, 1697), (8, 1180), (9, 612), (10, 385), (11, 115), (12, 57), (13, 18), (14, 2), (18, 1)]
Top 50 words: ['a', 'y', 'n', 'dhe', 'ha', 'yn', 'an', 'ow', 'my', 'yw', 'c', 'ny', 'na', 're', 's', 'dha', 'omma', 'pur', 'ni', 'm', 'rag', 'meryasek', 'ma', 'sur', 'krist', 'yesu', 'bys', 'th', 'hwi', 'mar', 'heb', 'arloedh', 'oll', 'ev', 'vynn', 'gans', 'yma', 'dyw', 'vydh', 'lemmyn', 'vy', 'maria', 'den', 'ty', 'wir', 'dell', 'eus', 'meriadocus', 'dhymm', 'sertan']
Top 50 words of 4 or more letters: ['omma', 'meryasek', 'krist', 'yesu', 'arloedh', 'vynn', 'gans', 'vydh', 'lemmyn', 'maria', 'dell', 'meriadocus', 'dhymm', 'sertan', 'meur', 'dhymmo', 'dhyn', 'dhis', 'finit', 'episcopus', 'agas', 'comes', 'primus', 'secundus', 'nyns', 'yredi', 'orth', 'henna', 'prest', 'syrr', 'agan', 'devri', 'tortor', 'dhywgh', 'nevra', 'gweres', 'alemma', 'hanow', 'bydh', 'bynytha', 'deun', 'dhodho', 'epskop', 'hemma', 'lies', 'descendit', 'dhiso', 'lowena', 'mones', 'aredy']
Text: # ------------------------------------------------------------------------ # # The text of _Gwreans... (Gwreans an Bys)
Collocations: Building collocations list (note that I really should have trimmed some of the comments out that are in the file but not part of the Cornish text)
KDL page; ### KDL; par dell; lever dhis; pub eur; pub tra; pur vras;
pub prys; wosa hemma; pur wir; Nyns eus; vynn mos; dhe vos; Der henna;
mos alemma; mar vras; heb falladow; warn ugens; FIRST DEVIL; myns eus
None
number of words = 15044
number of different words = 2539
Lengths of words in descending order of frequency [(3, 3450), (2, 3007), (4, 2420), (5, 1666), (1, 1663), (6, 1474), (7, 747), (8, 381), (9, 144), (10, 59), (11, 29), (12, 4)]
Top 50 words: ['a', 'ha', 'y', 'n', 'an', 'dhe', 'my', 'ow', 'yw', 'yn', 'ny', 'na', 'yth', 'adam', 'the', 'bys', 'pur', 'dha', 'ty', 'vydh', 'rag', 'hag', 'henna', 'm', 'pub', 'ma', 'oll', 'omma', 'kdl', 'page', 'th', 'dhymm', 'dyw', 'gans', 'ev', 'mar', 'tas', 'to', 'vynn', 'hwi', 'in', 'der', 'dhymmo', 'eus', 'heb', 'ms', 'and', 'eva', 'vy', 'gwrys']
Top 50 words of 4 or more letters: ['adam', 'vydh', 'henna', 'omma', 'page', 'dhymm', 'gans', 'vynn', 'dhymmo', 'gwrys', 'dhis', 'bras', 'dell', 'father', 'lemmyn', 'nevra', 'serpent', 'dout', 'genev', 'kaym', 'seth', 'keth', 'vras', 'nyns', 'rakhenna', 'hemma', 'meur', 'cain', 'prys', 'orth', 'abel', 'alemma', 'fydh', 'hwath', 'lever', 'yndella', 'ynwedh', 'sertan', 'woer', 'heaven', 'plas', 'agan', 'gwel', 'mayth', 'ragdho', 'wartha', 'genes', 'lavar', 'maga', 'wydhenn']
Text: # --------- ORIGO MUNDI --------- # Keith Syed...
Collocations: Building collocations list
DEUS PATER; REX SAL; pur wir; Tas Dyw; heb falladow; Nyns eus; teyr
gwelenn; may hallo; nyns eus; heb fall; heb wow; Lavar dhymmo; dell
vynni; dres puptra; Arloedh ker; pub huni; pub eur; kollenwel bodh;
verr dermyn; war bayn
None
number of words = 15533
number of different words = 2608
Lengths of words in descending order of frequency [(3, 3375), (2, 3033), (4, 2396), (1, 1898), (5, 1673), (6, 1421), (7, 956), (8, 469), (9, 171), (10, 97), (12, 21), (11, 19), (13, 2), (14, 2)]
Top 50 words: ['a', 'ha', 'y', 'an', 'n', 'yn', 'dhe', 'ow', 'my', 'dha', 'rag', 'yw', 'na', 'ny', 'dyw', 'oll', 'm', 're', 'bys', 'th', 'vydh', 'arloedh', 'war', 'hag', 'heb', 'may', 'dell', 'ev', 'gans', 'mar', 'dhis', 'ty', 'ma', 'tas', 'i', 'wra', 'ni', 'dhymm', 'lemmyn', 'deus', 'dre', 'nev', 'adam', 'vynn', 'moyses', 'pan', 'pur', 'bos', 'eus', 'pater']
Top 50 words of 4 or more letters: ['vydh', 'arloedh', 'dell', 'gans', 'dhis', 'dhymm', 'lemmyn', 'deus', 'adam', 'vynn', 'moyses', 'pater', 'dhymmo', 'dhyn', 'agan', 'skon', 'bras', 'hware', 'nevra', 'dhodho', 'gwrys', 'nyns', 'sertan', 'dhiso', 'dhyw', 'meur', 'keffrys', 'kyns', 'orth', 'dhywgh', 'henna', 'leun', 'fydh', 'hweg', 'vynytha', 'bydh', 'abel', 'omma', 'onan', 'awos', 'kemmer', 'ellas', 'gwel', 'gwra', 'bones', 'deun', 'nuncius', 'pyth', 'ynwedh', 'bennath']
Text: PASSIO CHRISTI - KK Version made from Norris...
Collocations: Building collocations list
Mab Dyw; Princeps Annas; IVs Tortor; IIs Tortor; IIIs Tortor; pur wir;
heb lettya; tri dydh; kepar dell; dha vodh; IIs Doctor; Dydh Breus;
dhis lowena; Pur wir; dhe wruthyl; kettep onan; may hallo; Arloedh
ker; Myghtern Yedhewon; Nyns eus
None
number of words = 21260
number of different words = 3604
Lengths of words in descending order of frequency [(3, 4249), (2, 4152), (4, 3066), (5, 2425), (1, 2406), (6, 2115), (7, 1408), (8, 779), (9, 366), (10, 218), (11, 47), (12, 18), (13, 5), (14, 5), (16, 1)]
Top 50 words: ['a', 'y', 'n', 'yn', 'my', 'an', 'ha', 'dhe', 'ow', 'yw', 'et', 'rag', 'ny', 're', 'na', 'dha', 'ev', 'oll', 'hag', 'm', 'dyw', 'tortor', 'bys', 'war', 'gans', 'hic', 'mar', 'ihc', 'th', 'cayphas', 'ma', 'mab', 'may', 'ad', 'ihesu', 'dell', 'lemmyn', 'dhis', 'hwi', 'ty', 'heb', 'pan', 'tunc', 'vydh', 'wra', 'ni', 'arloedh', 's', 'den', 'dre']
Top 50 words of 4 or more letters: ['tortor', 'gans', 'cayphas', 'ihesu', 'dell', 'lemmyn', 'dhis', 'tunc', 'vydh', 'arloedh', 'pilatus', 'dhodho', 'sertan', 'meur', 'vynn', 'dicit', 'dhymm', 'henna', 'dhyn', 'mara', 'annas', 'skon', 'kyns', 'dhywgh', 'dhymmo', 'hware', 'agan', 'agas', 'lowena', 'ellas', 'gwas', 'gwir', 'mars', 'petrus', 'princeps', 'syrr', 'bras', 'hweg', 'lavar', 'worth', 'bydh', 'hayl', 'myghtern', 'dhiso', 'fydh', 'lever', 'nyns', 'yndella', 'awos', 'kettep']
Text: Resurrectio Domini - KK Version made from Norris...
Collocations: Building collocations list
pur wir; Spyrys Sans; fem .?}; verr spys; tressa dydh; kepar dell;
Arloedh ker; Penn vyghternedh; dhe dhasserghi; Kepar dell; vos
dasserghys; osculatur eos; IVs Miles; Ihesu Cryst; Mab Maria; tri
dydh; januis clausis; IIIs Miles; hakkra mernans; heb lettya
None
number of words = 16209
number of different words = 2635
Lengths of words in descending order of frequency [(3, 3314), (2, 3146), (4, 2310), (1, 2142), (5, 1864), (6, 1476), (7, 990), (8, 497), (9, 239), (10, 171), (11, 40), (12, 16), (13, 3), (14, 1)]
Top 50 words: ['a', 'y', 'n', 'yn', 'ha', 'dhe', 'an', 'ow', 'my', 'yw', 'ny', 'na', 'ev', 'rag', 'arloedh', 'dha', 'mar', 'ty', 'm', 'bys', 'dell', 'ni', 'oll', 'th', 'sur', 're', 'vydh', 'gans', 'meur', 'hag', 'pur', 'ihesu', 'lemmyn', 'nev', 'thomas', 'et', 'dre', 'ma', 'heb', 'bedh', 'pan', 'cryst', 'dhymmo', 'ns', 'dhymm', 'hwi', 'maria', 'dhis', 'dyw', 'neb']
Top 50 words of 4 or more letters: ['arloedh', 'dell', 'vydh', 'gans', 'meur', 'ihesu', 'lemmyn', 'thomas', 'bedh', 'cryst', 'dhymmo', 'dhymm', 'maria', 'dhis', 'dhyn', 'skon', 'bydh', 'korf', 'imperator', 'nyns', 'agan', 'miles', 'sertan', 'marow', 'henna', 'ellas', 'ynwedh', 'tortor', 'bras', 'dhywgh', 'lavar', 'vernona', 'agas', 'leun', 'drog', 'grys', 'myghtern', 'pilatus', 'genen', 'golonn', 'dasserghys', 'genev', 'krysi', 'vynn', 'dhodho', 'gwir', 'tunc', 'hedhyw', 'kepar', 'nevra']
Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...
Collocations: Building collocations list (Note that this is quite a short text)
Pow Sows; yth esa; dro dhe; brassa rann; mos tre; Hag ythó; Pur dha;
rann anedha; Gov haâ; dell grysav; esov omma; eus saw; rag covhÃ; den
vyth; omma rag; dhe Loundres; ledhys gans; Nyns eus; nyns eus; Henry
ledhys
None
number of words = 1264
number of different words = 511
Lengths of words in descending order of frequency [(2, 279), (3, 229), (4, 171), (5, 138), (1, 129), (6, 118), (7, 82), (8, 51), (9, 27), (10, 20), (11, 18), (13, 1), (14, 1)]
Top 50 words: ['an', 'a', 'n', 'yn', 'yw', 'ha', 'y', 'dhe', 'sowsnek', 'ma', 'ow', 'rag', 'my', 'nyns', 'henry', 'mes', 'vy', 'gans', 're', 'hag', 'omma', 'hy', 'o', 'dell', 'shakespeare', 'sows', 'taves', 'war', 'yeth', 'yth', 'dhymm', 'le', 'na', 'nebes', 'ny', 'oll', 'pan', 'po', 'pow', 'yma', 'bos', 'genev', 'gov', 'gwari', 'heb', 'hi', 'may', 'tus', 'vyth', 'aga']
Top 50 words of 4 or more letters: ['sowsnek', 'nyns', 'henry', 'gans', 'omma', 'dell', 'shakespeare', 'sows', 'taves', 'yeth', 'dhymm', 'nebes', 'genev', 'gwari', 'vyth', 'avel', 'erel', 'hedhyw', 'henna', 'kernewek', 'orth', 'sowsneger', 'studhya', 'whath', 'wosa', 'anedha', 'aral', 'bedh', 'blackheath', 'clewes', 'dann', 'dherag', 'honan', 'ledhys', 'margh', 'martesen', 'mernans', 'nans', 'ogas', 'skila', 'tiwedh', 'vernans', 'yethow', 'arta', 'bothek', 'brassa', 'cales', 'cansblydhen', 'clappya', 'codhas']
Text: Osta karer Arloedh An Bysowyer ? Wel ,... (A chapter of Lord Of the Rings rendered in Cornish at www.keskewsel.com)
Collocations: Building collocations list
Yth esa; yth esa; dhe vos; haval orth; Unn Bysow; medh Gandalf; Bag
End; Nyns eus; dro dhe; leveris Gandalf; wovynnas Frodo; medh Frodo;
neb kas; fatell wrug; dell dybav; dann gel; res dhis; rewlya oll;
Parkow Gladen; dre dermyn
None
number of words = 11147
number of different words = 1966
Lengths of words in descending order of frequency [(2, 2309), (3, 2004), (1, 1526), (4, 1442), (5, 1342), (6, 976), (7, 772), (8, 395), (9, 185), (10, 129), (11, 36), (12, 10), (13, 9), (17, 5), (15, 3), (18, 2), (14, 1), (16, 1)]
Top 50 words: ['a', 'an', 'ev', 'y', 'yn', 'ha', 'n', 'dhe', 'hag', 'mes', 'o', 'ow', 'na', 'yw', 'ny', 'frodo', 'bysow', 'esa', 'vy', 'yth', 're', 'my', 'nyns', 'gans', 'wrug', 'dell', 'bos', 'rag', 'i', 'oll', 'gandalf', 'vos', 'bylbo', 'orth', 'po', 'mar', 'termyn', 'henna', 'dre', 'leveris', 'meur', 'dhodho', 'medh', 'aga', 'es', 'pan', 'pur', 'dres', 'ta', 'yma']
Top 50 words of 4 or more letters: ['frodo', 'bysow', 'nyns', 'gans', 'wrug', 'dell', 'gandalf', 'bylbo', 'orth', 'termyn', 'henna', 'leveris', 'meur', 'dhodho', 'medh', 'dres', 'arta', 'kever', 'nerth', 'dhymm', 'diworth', 'golum', 'shayr', 'tewl', 'haval', 'hobytow', 'hwir', 'nebes', 'wosa', 'henn', 'honan', 'lemmyn', 'yndella', 'arall', 'kyns', 'vydh', 'hwath', 'ganso', 'klywes', 'pyth', 'woer', 'drefenn', 'elfow', 'leverel', 'owth', 'ytho', 'dhis', 'nans', 'nevra', 'orto']
Text: THE TREGEAR HOMILIES KK Version made from Christopher...
Collocations: Building collocations list
dhe vos; kepar dell; Spyrys Sans; agan Savyour; katholik eglos; pub
eur; mar veur; heb diwedh; fatell wrug; Yesu Krist; dre reson; agan
honan; Savyour Yesu; res dhyn; agan Arloedh; Katholik Eglos; Sans
Eglos; dell wrug; dhe leverel; gan Savyour
None
number of words = 40897
number of different words = 5246
Lengths of words in descending order of frequency [(2, 8508), (3, 7334), (1, 5121), (4, 5001), (5, 4461), (6, 3516), (7, 2555), (8, 2112), (9, 1009), (10, 637), (11, 317), (12, 155), (13, 99), (14, 43), (15, 17), (16, 5), (17, 3), (19, 2), (18, 1), (20, 1)]
Top 50 words: ['a', 'ha', 'an', 'n', 'dhe', 'y', 'yn', 'yw', 'ow', 'ni', 'ev', 'ma', 'na', 'rag', 'krist', 'agan', 'wrug', 's', 'oll', 'dre', 'yma', 'eglos', 'dyw', 'gans', 'hag', 'bonner', 'fatell', 'henna', 'et', 'kepar', 'den', 'leverel', 'vos', 'aga', 'yth', 'mar', 'keth', 're', 'honan', 'dell', 'bos', 'i', 'in', 'vydh', 'folio', 'ny', 'o', 'de', 'homily', 'nyns']
Top 50 words of 4 or more letters: ['krist', 'agan', 'wrug', 'eglos', 'gans', 'bonner', 'fatell', 'henna', 'kepar', 'leverel', 'keth', 'honan', 'dell', 'vydh', 'folio', 'homily', 'nyns', 'dhyn', 'dhyw', 'ynwedh', 'korf', 'savyour', 'rakhenna', 'hemma', 'henn', 'dhiworth', 'katholik', 'onan', 'geryow', 'pyth', 'hwath', 'arloedh', 'peder', 'chaptra', 'gwrys', 'omma', 'yndella', 'skryptor', 'lemmyn', 'bobel', 'sans', 'arall', 'dhodho', 'goes', 'leveris', 'lies', 'spyrys', 'agas', 'powl', 'termyn']
Here we see what percentage of the text words of various lengths make up:
Text: Improved version 21 / 10 / 96 Bewnans...
3 letters : 14.05 %
2 letters : 13.27 %
4 letters : 10.63 %
5 letters : 9.020 %
1 letters : 8.490 %
6 letters : 7.271 %
7 letters : 4.681 %
8 letters : 3.255 %
9 letters : 1.688 %
10 letters : 1.062 %
11 letters : 0.317 %
12 letters : 0.157 %
13 letters : 0.049 %
14 letters : 0.005 %
18 letters : 0.002 %
Text: # ------------------------------------------------------------------------ # # The text of _Gwreans...
3 letters : 15.71 %
2 letters : 13.70 %
4 letters : 11.02 %
5 letters : 7.590 %
1 letters : 7.577 %
6 letters : 6.715 %
7 letters : 3.403 %
8 letters : 1.735 %
9 letters : 0.656 %
10 letters : 0.268 %
11 letters : 0.132 %
12 letters : 0.018 %
Text: # --------- ORIGO MUNDI --------- # Keith Syed...
3 letters : 16.67 %
2 letters : 14.98 %
4 letters : 11.83 %
1 letters : 9.376 %
5 letters : 8.264 %
6 letters : 7.020 %
7 letters : 4.722 %
8 letters : 2.316 %
9 letters : 0.844 %
10 letters : 0.479 %
12 letters : 0.103 %
11 letters : 0.093 %
13 letters : 0.009 %
14 letters : 0.009 %
Text: PASSIO CHRISTI - KK Version made from Norris...
3 letters : 15.49 %
2 letters : 15.14 %
4 letters : 11.18 %
5 letters : 8.844 %
1 letters : 8.774 %
6 letters : 7.713 %
7 letters : 5.135 %
8 letters : 2.841 %
9 letters : 1.334 %
10 letters : 0.795 %
11 letters : 0.171 %
12 letters : 0.065 %
13 letters : 0.018 %
14 letters : 0.018 %
16 letters : 0.003 %
Text: Resurrectio Domini - KK Version made from Norris...
3 letters : 15.66 %
2 letters : 14.86 %
4 letters : 10.91 %
1 letters : 10.12 %
5 letters : 8.808 %
6 letters : 6.974 %
7 letters : 4.678 %
8 letters : 2.348 %
9 letters : 1.129 %
10 letters : 0.808 %
11 letters : 0.189 %
12 letters : 0.075 %
13 letters : 0.014 %
14 letters : 0.004 %
Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...
2 letters : 16.55 %
3 letters : 13.59 %
4 letters : 10.14 %
5 letters : 8.189 %
1 letters : 7.655 %
6 letters : 7.002 %
7 letters : 4.866 %
8 letters : 3.026 %
9 letters : 1.602 %
10 letters : 1.186 %
11 letters : 1.068 %
13 letters : 0.059 %
14 letters : 0.059 %
Text: Osta karer Arloedh An Bysowyer ? Wel ,...
2 letters : 15.47 %
3 letters : 13.42 %
1 letters : 10.22 %
4 letters : 9.661 %
5 letters : 8.991 %
6 letters : 6.539 %
7 letters : 5.172 %
8 letters : 2.646 %
9 letters : 1.239 %
10 letters : 0.864 %
11 letters : 0.241 %
12 letters : 0.067 %
13 letters : 0.060 %
17 letters : 0.033 %
15 letters : 0.020 %
18 letters : 0.013 %
14 letters : 0.006 %
16 letters : 0.006 %
Text: THE TREGEAR HOMILIES KK Version made from Christopher...
2 letters : 16.19 %
3 letters : 13.95 %
1 letters : 9.745 %
4 letters : 9.517 %
5 letters : 8.489 %
6 letters : 6.691 %
7 letters : 4.862 %
8 letters : 4.019 %
9 letters : 1.920 %
10 letters : 1.212 %
11 letters : 0.603 %
12 letters : 0.294 %
13 letters : 0.188 %
14 letters : 0.081 %
15 letters : 0.032 %
16 letters : 0.009 %
17 letters : 0.005 %
19 letters : 0.003 %
18 letters : 0.001 %
20 letters : 0.001 %
Here we show what percentage individual words make up of a given text.
Text: Improved version 21 / 10 / 96 Bewnans...
a : 3.233 %
y : 1.613 %
n : 1.431 %
dhe : 1.359 %
ha : 1.274 %
yn : 1.150 %
an : 1.051 %
ow : 1.006 %
my : 0.976 %
yw : 0.846 %
c : 0.822 %
ny : 0.689 %
na : 0.590 %
re : 0.571 %
s : 0.513 %
dha : 0.499 %
omma : 0.499 %
pur : 0.496 %
ni : 0.463 %
m : 0.449 %
Text: # ------------------------------------------------------------------------ # # The text of _Gwreans...
a : 3.781 %
ha : 1.581 %
y : 1.476 %
n : 1.435 %
an : 1.362 %
dhe : 1.316 %
my : 1.152 %
ow : 1.075 %
yw : 0.952 %
yn : 0.947 %
ny : 0.701 %
na : 0.651 %
yth : 0.542 %
adam : 0.492 %
the : 0.482 %
bys : 0.473 %
pur : 0.469 %
dha : 0.446 %
ty : 0.441 %
vydh : 0.428 %
Text: # --------- ORIGO MUNDI --------- # Keith Syed...
a : 4.890 %
ha : 1.753 %
y : 1.719 %
an : 1.570 %
n : 1.526 %
yn : 1.511 %
dhe : 1.383 %
ow : 1.373 %
my : 1.249 %
dha : 0.844 %
rag : 0.755 %
yw : 0.755 %
na : 0.666 %
ny : 0.666 %
dyw : 0.573 %
oll : 0.568 %
m : 0.543 %
re : 0.533 %
bys : 0.508 %
th : 0.464 %
Text: PASSIO CHRISTI - KK Version made from Norris...
a : 3.924 %
y : 2.337 %
n : 1.531 %
yn : 1.385 %
my : 1.316 %
an : 1.283 %
ha : 1.276 %
dhe : 1.050 %
ow : 1.013 %
yw : 0.846 %
et : 0.751 %
rag : 0.707 %
ny : 0.689 %
re : 0.652 %
na : 0.649 %
dha : 0.576 %
ev : 0.503 %
oll : 0.452 %
hag : 0.415 %
m : 0.404 %
Text: Resurrectio Domini - KK Version made from Norris...
a : 4.853 %
y : 2.216 %
n : 1.989 %
yn : 1.606 %
ha : 1.356 %
dhe : 1.327 %
an : 1.256 %
ow : 1.063 %
my : 1.001 %
yw : 0.954 %
ny : 0.907 %
na : 0.803 %
ev : 0.666 %
rag : 0.619 %
arloedh : 0.567 %
dha : 0.533 %
mar : 0.496 %
ty : 0.472 %
m : 0.448 %
bys : 0.430 %
Text: Solempnyta Blackheath . 17 Metheven 1997 . Wel...
an : 3.738 %
a : 3.323 %
n : 2.195 %
yn : 1.721 %
yw : 1.661 %
ha : 1.602 %
y : 1.186 %
dhe : 1.127 %
sowsnek : 1.127 %
ma : 1.008 %
ow : 1.008 %
rag : 0.890 %
my : 0.830 %
nyns : 0.830 %
henry : 0.712 %
mes : 0.652 %
vy : 0.652 %
gans : 0.593 %
re : 0.593 %
hag : 0.534 %
Text: Osta karer Arloedh An Bysowyer ? Wel ,...
a : 4.783 %
an : 2.599 %
ev : 2.512 %
y : 2.445 %
yn : 2.070 %
ha : 1.862 %
n : 1.587 %
dhe : 1.226 %
hag : 1.038 %
mes : 1.011 %
o : 0.884 %
ow : 0.824 %
na : 0.676 %
yw : 0.663 %
ny : 0.636 %
frodo : 0.596 %
bysow : 0.589 %
esa : 0.509 %
vy : 0.502 %
yth : 0.495 %
Text: THE TREGEAR HOMILIES KK Version made from Christopher...
a : 4.411 %
ha : 3.016 %
an : 2.925 %
n : 2.148 %
dhe : 1.899 %
y : 1.821 %
yn : 1.288 %
yw : 1.157 %
ow : 1.073 %
ni : 0.858 %
ev : 0.839 %
ma : 0.749 %
na : 0.698 %
rag : 0.679 %
krist : 0.664 %
agan : 0.616 %
wrug : 0.603 %
s : 0.527 %
oll : 0.464 %
dre : 0.439 %