Frequency Analysis

Reset
1,468,127
Total Tokens
87,969
Unique Types
5.99%
Type-Token Ratio
2,304
Corpus Entries

Word Frequency List

87,969 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
86691 cayntiisa 1
0.00%
0.0001% 0.7
86692 cayntiisu 1
0.00%
0.0001% 0.7
86693 dhaddigna 1
0.00%
0.0001% 0.7
86694 gaarigacanka 1
0.00%
0.0001% 0.7
86695 jaanqaadaa 1
0.00%
0.0001% 0.7
86696 nominal 1
0.00%
0.0001% 0.7
86697 gaarigacanku 1
0.00%
0.0001% 0.7
86698 gaarigacantu 1
0.00%
0.0001% 0.7
86699 jabine 1
0.00%
0.0001% 0.7
86700 qodobkuba 1
0.00%
0.0001% 0.7

By Language

LanguageEntries%
Somali 2,296
99.7%
somali 8
0.3%

Top 10 Words

oo
40973
ka
36465
ay
31334
ku
30747
ah
28403
ee
26549
in
24639
iyo
23193
ayaa
23062
uu
19787