Frequency Analysis

Reset
1,180,353
Total Tokens
72,703
Unique Types
6.16%
Type-Token Ratio
1,759
Corpus Entries

Word Frequency List

72,703 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
71811 dardargeliyo 1
0.00%
0.0001% 0.8
71812 adkeyneynaa 1
0.00%
0.0001% 0.8
71813 coffee 1
0.00%
0.0001% 0.8
71814 file38161522 1
0.00%
0.0001% 0.8
71815 dushana 1
0.00%
0.0001% 0.8
71816 xabadihii 1
0.00%
0.0001% 0.8
71817 file38161567 1
0.00%
0.0001% 0.8
71818 kufsan 1
0.00%
0.0001% 0.8
71819 wiilashiina 1
0.00%
0.0001% 0.8
71820 ehelkii 1
0.00%
0.0001% 0.8

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 4
0.2%

Top 10 Words

oo
32798
ka
28967
ay
26682
ku
24670
ah
23430
ee
21367
in
21157
ayaa
20291
uu
15813
iyo
15700