Frequency Analysis

Reset
2,121,045
Total Tokens
139,714
Unique Types
6.59%
Type-Token Ratio
2,349
Corpus Entries

Word Frequency List

139,714 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
109411 yaridiisa 1
0.00%
0.0000% 0.5
109412 PuntlandSaciid 1
0.00%
0.0000% 0.5
109413 Isimadii 1
0.00%
0.0000% 0.5
109414 mudaneyaashii 1
0.00%
0.0000% 0.5
109415 geynteeda 1
0.00%
0.0000% 0.5
109416 Dhaashane 1
0.00%
0.0000% 0.5
109417 qaaqlaha 1
0.00%
0.0000% 0.5
109418 Geesdiir 1
0.00%
0.0000% 0.5
109419 gambasho 1
0.00%
0.0000% 0.5
109420 hortaagay 1
0.00%
0.0000% 0.5

Top 10 Words

oo
57059
ka
49478
ku
42888
ay
39431
ah
39391
iyo
35310
ee
34411
in
31363
uu
27833
ayaa
26240