Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
171 In 1,195
0.05%
0.0518% 517.6
172 kasoo 1,192
0.05%
0.0516% 516.3
173 Soomaaliga 1,192
0.05%
0.0516% 516.3
174 walba 1,189
0.05%
0.0515% 515.0
175 nin 1,176
0.05%
0.0509% 509.4
176 baxay 1,171
0.05%
0.0507% 507.2
177 Ruushka 1,165
0.05%
0.0505% 504.6
178 iyada 1,158
0.05%
0.0502% 501.6
179 ahayn 1,157
0.05%
0.0501% 501.2
180 Soomaalida 1,150
0.05%
0.0498% 498.2

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539