Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
421 dhaqan 629
0.03%
0.0272% 272.5
422 tilmaamay 629
0.03%
0.0272% 272.5
423 lasoo 629
0.03%
0.0272% 272.5
424 shacabka 628
0.03%
0.0272% 272.0
425 qoray 628
0.03%
0.0272% 272.0
426 maadaama 627
0.03%
0.0272% 271.6
427 19 626
0.03%
0.0271% 271.2
428 dhintay 626
0.03%
0.0271% 271.2
429 warbaahinta 624
0.03%
0.0270% 270.3
430 aanu 623
0.03%
0.0270% 269.9

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539