Filter Options

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
11 la 14,119
1.20%
1.2013% 12,012.5
12 soo 11,020
0.94%
0.9376% 9,375.9
13 lagu 7,818
0.67%
0.6652% 6,651.6
14 waxaa 6,707
0.57%
0.5706% 5,706.3
15 sawirka 6,376
0.54%
0.5425% 5,424.7
16 ugu 5,943
0.51%
0.5056% 5,056.3
17 kale 5,864
0.50%
0.4989% 4,989.1
18 waxay 5,735
0.49%
0.4879% 4,879.4
19 sheegay 5,620
0.48%
0.4782% 4,781.5
20 si 5,367
0.46%
0.4566% 4,566.3

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577