Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
11 la 25,987
1.13%
1.1257% 11,256.9
12 soo 21,672
0.94%
0.9388% 9,387.7
13 nbsp 17,101
0.74%
0.7408% 7,407.7
14 lagu 14,121
0.61%
0.6117% 6,116.9
15 aan 11,477
0.50%
0.4972% 4,971.5
16 kale 10,940
0.47%
0.4739% 4,738.9
17 si 10,872
0.47%
0.4709% 4,709.5
18 ugu 10,746
0.47%
0.4655% 4,654.9
19 waxaa 10,608
0.46%
0.4595% 4,595.1
20 mid 10,489
0.45%
0.4544% 4,543.6

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539