Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
331 qaatay 411
0.03%
0.0350% 349.7
332 tilmaamay 411
0.03%
0.0350% 349.7
333 maadaama 409
0.03%
0.0348% 348.0
334 ururka 408
0.03%
0.0347% 347.1
335 jirtay 408
0.03%
0.0347% 347.1
336 leedahay 406
0.03%
0.0345% 345.4
337 lahayn 406
0.03%
0.0345% 345.4
338 ula 405
0.03%
0.0345% 344.6
339 dambe 404
0.03%
0.0344% 343.7
340 bariga 400
0.03%
0.0340% 340.3

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577