Frequency Analysis

Reset
1,499,586
Total Tokens
77,734
Unique Types
5.18%
Type-Token Ratio
1,804
Corpus Entries

Word Frequency List

77,734 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
72981 file38161500 1
0.00%
0.0001% 0.7
72982 salaysantahay 1
0.00%
0.0001% 0.7
72983 moosaa 1
0.00%
0.0001% 0.7
72984 galiyaa 1
0.00%
0.0001% 0.7
72985 leexin 1
0.00%
0.0001% 0.7
72986 mawjaddan 1
0.00%
0.0001% 0.7
72987 qalqaalinayaa 1
0.00%
0.0001% 0.7
72988 badalasho 1
0.00%
0.0001% 0.7
72989 boobsiiska 1
0.00%
0.0001% 0.7
72990 daahista 1
0.00%
0.0001% 0.7

By Language

LanguageEntries%
Somali 1,799
99.7%
somali 5
0.3%

Top 10 Words

oo
33789
ka
30109
font
27985
ay
27402
ku
25594
ah
24573
ee
22715
in
21768
ayaa
20556
iyo
16787