Frequency Analysis

Reset
1,180,353
Total Tokens
72,703
Unique Types
6.16%
Type-Token Ratio
1,759
Corpus Entries

Word Frequency List

72,703 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
71801 seddaax 1
0.00%
0.0001% 0.8
71802 dalacsiinayo 1
0.00%
0.0001% 0.8
71803 file38159479 1
0.00%
0.0001% 0.8
71804 dadargeliyo 1
0.00%
0.0001% 0.8
71805 waddooyina 1
0.00%
0.0001% 0.8
71806 dhammaadan 1
0.00%
0.0001% 0.8
71807 boondheereoo 1
0.00%
0.0001% 0.8
71808 kaashanayaa 1
0.00%
0.0001% 0.8
71809 kuugaleynay 1
0.00%
0.0001% 0.8
71810 wareysaneyno 1
0.00%
0.0001% 0.8

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 4
0.2%

Top 10 Words

oo
32798
ka
28967
ay
26682
ku
24670
ah
23430
ee
21367
in
21157
ayaa
20291
uu
15813
iyo
15700