Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
211 socda 1,027
0.04%
0.0445% 444.9
212 faa 1,026
0.04%
0.0444% 444.4
213 doorashada 1,025
0.04%
0.0444% 444.0
214 adag 1,023
0.04%
0.0443% 443.1
215 buugga 1,022
0.04%
0.0443% 442.7
216 Xasan 1,014
0.04%
0.0439% 439.2
217 000 1,012
0.04%
0.0438% 438.4
218 jirto 1,007
0.04%
0.0436% 436.2
219 Af 1,007
0.04%
0.0436% 436.2
220 jir 999
0.04%
0.0433% 432.7

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539