Frequency Analysis

Reset
1,251,873
Total Tokens
75,764
Unique Types
6.05%
Type-Token Ratio
1,992
Corpus Entries

Word Frequency List

75,764 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
701 ayaana 218
0.02%
0.0174% 174.1
702 cabdiraxmaan 217
0.02%
0.0173% 173.3
703 meelaha 217
0.02%
0.0173% 173.3
704 nolosha 217
0.02%
0.0173% 173.3
705 qaba 216
0.02%
0.0173% 172.5
706 arrinta 216
0.02%
0.0173% 172.5
707 putin 216
0.02%
0.0173% 172.5
708 muuqaalka 216
0.02%
0.0173% 172.5
709 doortay 215
0.02%
0.0172% 171.7
710 xun 215
0.02%
0.0172% 171.7

By Language

LanguageEntries%
Somali 1,989
99.8%
somali 3
0.2%

Top 10 Words

oo
34855
ka
30584
ay
28097
ku
25969
ah
24568
ee
22706
in
22097
ayaa
21364
iyo
17018
uu
16278