Frequency Analysis

Reset
1,175,359
Total Tokens
72,305
Unique Types
6.15%
Type-Token Ratio
1,758
Corpus Entries

Word Frequency List

72,305 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
421 2022 323
0.03%
0.0275% 274.8
422 luulyo 323
0.03%
0.0275% 274.8
423 jirta 322
0.03%
0.0274% 274.0
424 siyaasadeed 322
0.03%
0.0274% 274.0
425 kan 321
0.03%
0.0273% 273.1
426 waana 317
0.03%
0.0270% 269.7
427 15 316
0.03%
0.0269% 268.9
428 abriil 316
0.03%
0.0269% 268.9
429 hadal 315
0.03%
0.0268% 268.0
430 oktoobar 314
0.03%
0.0267% 267.2

By Language

LanguageEntries%
Somali 1,755
99.8%
somali 3
0.2%

Top 10 Words

oo
32678
ka
28876
ay
26627
ku
24569
ah
23345
ee
21302
in
21075
ayaa
20270
uu
15763
iyo
15577