Frequency Analysis

Reset
1,499,586
Total Tokens
77,734
Unique Types
5.18%
Type-Token Ratio
1,804
Corpus Entries

Word Frequency List

77,734 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
75511 3104248046875pt 1
0.00%
0.0001% 0.7
75512 65567016601562pt 1
0.00%
0.0001% 0.7
75513 44598388671875pt 1
0.00%
0.0001% 0.7
75514 6240005493164062pt 1
0.00%
0.0001% 0.7
75515 matilaa 1
0.00%
0.0001% 0.7
75516 94586181640625pt 1
0.00%
0.0001% 0.7
75517 cutubka15aad 1
0.00%
0.0001% 0.7
75518 3939208984375pt 1
0.00%
0.0001% 0.7
75519 baaraandegidda 1
0.00%
0.0001% 0.7
75520 igmashada 1
0.00%
0.0001% 0.7

By Language

LanguageEntries%
Somali 1,799
99.7%
somali 5
0.3%

Top 10 Words

oo
33789
ka
30109
font
27985
ay
27402
ku
25594
ah
24573
ee
22715
in
21768
ayaa
20556
iyo
16787