Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
51 sii 3,478
0.15%
0.1507% 1,506.6
52 leh 3,471
0.15%
0.1504% 1,503.5
53 magaalada 3,395
0.15%
0.1471% 1,470.6
54 yihiin 3,334
0.14%
0.1444% 1,444.2
55 ma 3,213
0.14%
0.1392% 1,391.8
56 ahaan 3,136
0.14%
0.1358% 1,358.4
57 Qoraalka 3,131
0.14%
0.1356% 1,356.3
58 marka 3,123
0.14%
0.1353% 1,352.8
59 file 3,021
0.13%
0.1309% 1,308.6
60 imported 3,014
0.13%
0.1306% 1,305.6

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539