Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
371 meesha 378
0.03%
0.0322% 321.6
372 xitaa 378
0.03%
0.0322% 321.6
373 dadkii 375
0.03%
0.0319% 319.1
374 jirin 373
0.03%
0.0317% 317.3
375 taagan 373
0.03%
0.0317% 317.3
376 arrintan 372
0.03%
0.0316% 316.5
377 karto 371
0.03%
0.0316% 315.6
378 sharciga 370
0.03%
0.0315% 314.8
379 cad 369
0.03%
0.0314% 313.9
380 we 369
0.03%
0.0314% 313.9

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577