Frequency Analysis

Reset
2,308,541
Total Tokens
144,351
Unique Types
6.25%
Type-Token Ratio
3,070
Corpus Entries

Word Frequency List

144,351 words
Rank Word Freq Distribution % Tokens Per 1M KWIC
471 Reuters 561
0.02%
0.0243% 243.0
472 an 560
0.02%
0.0243% 242.6
473 cid 559
0.02%
0.0242% 242.1
474 dibadda 559
0.02%
0.0242% 242.1
475 sidaas 558
0.02%
0.0242% 241.7
476 2022 558
0.02%
0.0242% 241.7
477 bilood 556
0.02%
0.0241% 240.8
478 xisbiga 556
0.02%
0.0241% 240.8
479 xogta 555
0.02%
0.0240% 240.4
480 Balse 555
0.02%
0.0240% 240.4

Top 10 Words

oo
62045
ka
52893
ku
46423
ah
42737
ay
42498
ee
37436
iyo
37181
in
34858
ayaa
29666
uu
29539