Based on the words found in the Esperanto Wikipedia dump of 2023-03-01. All words are reduced to their base form (plural -j and accusative -n are stripped, the verb endings -as/-is/-os/-us/-u are changed to the infinitive -i). Each word is listed in the most typical case form (lower-case, capitalized, or all-caps). Non-Esperanto-ified proper names are mostly omitted (unless listed in common dictionaries). The total size of the corpus is more than 43 million words.
Together these 100 words cover 51.93% percent of the whole corpus.
Together these 200 words cover 58.22% percent of the whole corpus.
Together these 300 words cover 62.10% percent of the whole corpus.
Together these 400 words cover 64.99% percent of the whole corpus.
Together these 500 words cover 67.32% percent of the whole corpus.
Together these 1000 words cover 74.73% percent of the whole corpus.
Together these 2000 words cover 81.81% percent of the whole corpus.
Together these 3000 words cover 85.60% percent of the whole corpus.
Together these 4000 words cover 88.02% percent of the whole corpus.
Together these 5000 words cover 89.73% percent of the whole corpus.
Together these 6000 words cover 91.03% percent of the whole corpus.
Together these 7000 words cover 92.06% percent of the whole corpus.
Together these 8000 words cover 92.90% percent of the whole corpus.
Together these 9000 words cover 93.59% percent of the whole corpus.
Together these 10000 words cover 94.18% percent of the whole corpus.
Together these 11000 words cover 94.69% percent of the whole corpus.
Together these 12000 words cover 95.14% percent of the whole corpus.
Together these 13000 words cover 95.53% percent of the whole corpus.
Together these 14000 words cover 95.88% percent of the whole corpus.
Together these 15000 words cover 96.19% percent of the whole corpus.