Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
311 sharci 438
0.04%
0.0373% 372.7
312 kaga 436
0.04%
0.0371% 371.0
313 puntland 433
0.04%
0.0368% 368.4
314 kusoo 433
0.04%
0.0368% 368.4
315 news 433
0.04%
0.0368% 368.4
316 dhulka 432
0.04%
0.0368% 367.5
317 noqon 432
0.04%
0.0368% 367.5
318 cumar 424
0.04%
0.0361% 360.7
319 madaxa 424
0.04%
0.0361% 360.7
320 dambeeyay 424
0.04%
0.0361% 360.7

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577