Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
491 Iiraan 545
0.02%
0.0236% 236.1
492 war 542
0.02%
0.0235% 234.8
493 Puntland 541
0.02%
0.0234% 234.3
494 caafimaad 541
0.02%
0.0234% 234.3
495 kalena 540
0.02%
0.0234% 233.9
496 Maxamuud 538
0.02%
0.0233% 233.0
497 Qof 537
0.02%
0.0233% 232.6
498 Soomaali 534
0.02%
0.0231% 231.3
499 keliya 534
0.02%
0.0231% 231.3
500 booliska 534
0.02%
0.0231% 231.3

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539