Frequency Analysis

Reset
1,180,353
Total Tokens
72,703
Unique Types
6.16%
Type-Token Ratio
1,759
Corpus Entries

Word Frequency List

72,703 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
471 11 291
0.02%
0.0247% 246.5
472 heli 291
0.02%
0.0247% 246.5
473 xuseen 290
0.02%
0.0246% 245.7
474 saartay 290
0.02%
0.0246% 245.7
475 shaqo 289
0.02%
0.0245% 244.8
476 jireen 289
0.02%
0.0245% 244.8
477 xiran 288
0.02%
0.0244% 244.0
478 waqooyi 287
0.02%
0.0243% 243.1
479 ingiriiska 287
0.02%
0.0243% 243.1
480 heshiis 286
0.02%
0.0242% 242.3

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 4
0.2%

Top 10 Words

oo
32798
ka
28967
ay
26682
ku
24670
ah
23430
ee
21367
in
21157
ayaa
20291
uu
15813
iyo
15700