Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
441 shaqada 306
0.03%
0.0260% 260.3
442 jirka 306
0.03%
0.0260% 260.3
443 covid 304
0.03%
0.0259% 258.6
444 ammaanka 304
0.03%
0.0259% 258.6
445 maalmood 304
0.03%
0.0259% 258.6
446 ninka 303
0.03%
0.0258% 257.8
447 lacag 302
0.03%
0.0257% 256.9
448 wali 302
0.03%
0.0257% 256.9
449 arrintaas 302
0.03%
0.0257% 256.9
450 ahaayeen 301
0.03%
0.0256% 256.1

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577