Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
201 kamid 1,085
0.05%
0.0470% 470.0
202 laakiin 1,082
0.05%
0.0469% 468.7
203 ayaan 1,075
0.05%
0.0466% 465.7
204 Haddii 1,069
0.05%
0.0463% 463.1
205 dagaalka 1,061
0.05%
0.0460% 459.6
206 wanaagsan 1,050
0.05%
0.0455% 454.8
207 Marka 1,049
0.05%
0.0454% 454.4
208 doona 1,048
0.05%
0.0454% 454.0
209 gudaha 1,045
0.05%
0.0453% 452.7
210 Sida 1,043
0.05%
0.0452% 451.8

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539