Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
321 wanaagsan 422
0.04%
0.0359% 359.0
322 degmada 421
0.04%
0.0358% 358.2
323 biden 421
0.04%
0.0358% 358.2
324 10 420
0.04%
0.0357% 357.3
325 aqalka 419
0.04%
0.0356% 356.5
326 taariikhda 418
0.04%
0.0356% 355.6
327 iyaga 417
0.04%
0.0355% 354.8
328 xiray 416
0.04%
0.0354% 353.9
329 xisbiga 415
0.04%
0.0353% 353.1
330 diyaaradaha 414
0.04%
0.0352% 352.2

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577