Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
31 wax 6,663
0.29%
0.2886% 2,886.2
32 sida 5,940
0.26%
0.2573% 2,573.1
33 tahay 5,789
0.25%
0.2508% 2,507.6
34 wuxuu 5,747
0.25%
0.2489% 2,489.5
35 kala 5,626
0.24%
0.2437% 2,437.0
36 yahay 5,497
0.24%
0.2381% 2,381.2
37 Soomaaliya 4,980
0.22%
0.2157% 2,157.2
38 inuu 4,967
0.22%
0.2152% 2,151.6
39 dadka 4,932
0.21%
0.2136% 2,136.4
40 markii 4,855
0.21%
0.2103% 2,103.1

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539