Frequency Analysis

Reset
2,003,909
Total Tokens
117,128
Unique Types
5.84%
Type-Token Ratio
2,343
Corpus Entries

Word Frequency List

117,128 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
3221 tababbarada 64
0.00%
0.0032% 31.9
3222 bandhiggaasi 64
0.00%
0.0032% 31.9
3223 alshabaab 63
0.00%
0.0031% 31.4
3224 hargaysa 63
0.00%
0.0031% 31.4
3225 yaasiin 63
0.00%
0.0031% 31.4
3226 hufan 63
0.00%
0.0031% 31.4
3227 wakiilada 63
0.00%
0.0031% 31.4
3228 uuliyadda 63
0.00%
0.0031% 31.4
3229 ujeedka 63
0.00%
0.0031% 31.4
3230 tihiin 63
0.00%
0.0031% 31.4

Top 10 Words

oo
54394
ka
48093
ku
42087
ay
37920
ah
37194
iyo
33549
ee
32790
in
31029
uu
26440
ayaa
25605