Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
121 com 1,576
0.07%
0.0683% 682.7
122 hadda 1,574
0.07%
0.0682% 681.8
123 kadib 1,567
0.07%
0.0679% 678.8
124 qaar 1,564
0.07%
0.0677% 677.5
125 tirsan 1,558
0.07%
0.0675% 674.9
126 maanta 1,553
0.07%
0.0673% 672.7
127 Soomaaliyeed 1,551
0.07%
0.0672% 671.9
128 Muqdisho 1,539
0.07%
0.0667% 666.7
129 qofka 1,508
0.07%
0.0653% 653.2
130 doono 1,494
0.06%
0.0647% 647.2

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539