Filter Options

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
31 soomaaliya 3,635
0.31%
0.3093% 3,092.7
32 doc 3,512
0.30%
0.2988% 2,988.0
33 wuxuu 3,474
0.30%
0.2956% 2,955.7
34 dadka 3,460
0.29%
0.2944% 2,943.8
35 waa 3,355
0.29%
0.2854% 2,854.4
36 dalka 3,252
0.28%
0.2767% 2,766.8
37 ayuu 2,935
0.25%
0.2497% 2,497.1
38 aad 2,929
0.25%
0.2492% 2,492.0
39 qoraalka 2,826
0.24%
0.2404% 2,404.4
40 bbc 2,817
0.24%
0.2397% 2,396.7

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577