Frequency Analysis

Reset
1,180,353
Total Tokens
72,703
Unique Types
6.16%
Type-Token Ratio
1,759
Corpus Entries

Word Frequency List

72,703 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
481 dhinteen 286
0.02%
0.0242% 242.3
482 taliyaha 285
0.02%
0.0241% 241.5
483 xubnaha 285
0.02%
0.0241% 241.5
484 cabdullaahi 284
0.02%
0.0241% 240.6
485 xigeenka 284
0.02%
0.0241% 240.6
486 isbitaalka 284
0.02%
0.0241% 240.6
487 febraayo 284
0.02%
0.0241% 240.6
488 xildhibaan 283
0.02%
0.0240% 239.8
489 kooxaha 283
0.02%
0.0240% 239.8
490 kalena 283
0.02%
0.0240% 239.8

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 4
0.2%

Top 10 Words

oo
32798
ka
28967
ay
26682
ku
24670
ah
23430
ee
21367
in
21157
ayaa
20291
uu
15813
iyo
15700