Frequency Analysis

Reset
1,499,586
Total Tokens
77,734
Unique Types
5.18%
Type-Token Ratio
1,804
Corpus Entries

Word Frequency List

77,734 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
72431 advocate 1
0.00%
0.0001% 0.7
72432 recommending 1
0.00%
0.0001% 0.7
72433 disclose 1
0.00%
0.0001% 0.7
72434 argue 1
0.00%
0.0001% 0.7
72435 vacated 1
0.00%
0.0001% 0.7
72436 ranks 1
0.00%
0.0001% 0.7
72437 corrupt 1
0.00%
0.0001% 0.7
72438 successful 1
0.00%
0.0001% 0.7
72439 measures 1
0.00%
0.0001% 0.7
72440 file38160728 1
0.00%
0.0001% 0.7

By Language

LanguageEntries%
Somali 1,799
99.7%
somali 5
0.3%

Top 10 Words

oo
33789
ka
30109
font
27985
ay
27402
ku
25594
ah
24573
ee
22715
in
21768
ayaa
20556
iyo
16787