Filter Options

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
61 id 1,862
0.16%
0.1584% 1,584.2
62 isku 1,831
0.16%
0.1558% 1,557.8
63 sheegtay 1,787
0.15%
0.1520% 1,520.4
64 file 1,761
0.15%
0.1498% 1,498.3
65 al 1,757
0.15%
0.1495% 1,494.9
66 imported 1,755
0.15%
0.1493% 1,493.2
67 filename 1,755
0.15%
0.1493% 1,493.2
68 html 1,755
0.15%
0.1493% 1,493.2
69 parent_folder 1,755
0.15%
0.1493% 1,493.2
70 url 1,755
0.15%
0.1493% 1,493.2

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577