Filter Options

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
71 https 1,755
0.15%
0.1493% 1,493.2
72 balse 1,753
0.15%
0.1491% 1,491.5
73 gobolka 1,686
0.14%
0.1434% 1,434.5
74 www 1,628
0.14%
0.1385% 1,385.1
75 waxaan 1,625
0.14%
0.1383% 1,382.6
76 inta 1,618
0.14%
0.1377% 1,376.6
77 ciidamada 1,592
0.14%
0.1354% 1,354.5
78 hor 1,571
0.13%
0.1337% 1,336.6
79 waxaana 1,553
0.13%
0.1321% 1,321.3
80 marka 1,552
0.13%
0.1320% 1,320.4

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577