Frequency Analysis

Reset
1,251,873
Total Tokens
75,764
Unique Types
6.05%
Type-Token Ratio
1,992
Corpus Entries

Word Frequency List

75,764 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
711 xadka 215
0.02%
0.0172% 171.7
712 lix 215
0.02%
0.0172% 171.7
713 wakaaladda 214
0.02%
0.0171% 170.9
714 dhici 214
0.02%
0.0171% 170.9
715 xusay 213
0.02%
0.0170% 170.1
716 doonayo 213
0.02%
0.0170% 170.1
717 diinta 213
0.02%
0.0170% 170.1
718 maamul 211
0.02%
0.0169% 168.5
719 doorasho 210
0.02%
0.0168% 167.7
720 hub 210
0.02%
0.0168% 167.7

By Language

LanguageEntries%
Somali 1,989
99.8%
somali 3
0.2%

Top 10 Words

oo
34855
ka
30584
ay
28097
ku
25969
ah
24568
ee
22706
in
22097
ayaa
21364
iyo
17018
uu
16278