Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
171 iska 761
0.06%
0.0647% 647.5
172 kasoo 760
0.06%
0.0647% 646.6
173 axmed 757
0.06%
0.0644% 644.1
174 federaalka 747
0.06%
0.0636% 635.6
175 xasan 747
0.06%
0.0636% 635.6
176 jiraan 743
0.06%
0.0632% 632.1
177 dilay 742
0.06%
0.0631% 631.3
178 taas 737
0.06%
0.0627% 627.0
179 baxay 727
0.06%
0.0619% 618.5
180 duwan 709
0.06%
0.0603% 603.2

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577