Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
451 goobta 301
0.03%
0.0256% 256.1
452 maxkamadda 299
0.03%
0.0254% 254.4
453 dhinac 299
0.03%
0.0254% 254.4
454 doona 298
0.03%
0.0254% 253.5
455 su 297
0.03%
0.0253% 252.7
456 shaqaalaha 297
0.03%
0.0253% 252.7
457 abiy 297
0.03%
0.0253% 252.7
458 rasmi 295
0.03%
0.0251% 251.0
459 dawladda 295
0.03%
0.0251% 251.0
460 maalin 294
0.03%
0.0250% 250.1

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577