User:Prosfilaes/Esperanto corpus

Hello, you have come here looking for the meaning of the word User:Prosfilaes/Esperanto corpus. In DICTIOUS you will not only get to know all the dictionary meanings for the word User:Prosfilaes/Esperanto corpus, but we will also tell you about its etymology, its characteristics and you will know how to say User:Prosfilaes/Esperanto corpus in singular and plural. Everything you need to know about the word User:Prosfilaes/Esperanto corpus you have here. The definition of the word User:Prosfilaes/Esperanto corpus will help you to be more precise and correct when speaking or writing your texts. Knowing the definition ofUser:Prosfilaes/Esperanto corpus, as well as those of other words, enriches your vocabulary and provides you with more and better linguistic resources.

Introduction to the Corpus

This is a corpus of all the Esperanto material I found on Project Gutenberg on January 15th, 2011. I turned them all into lowercase. I turned a trailing "as", "is", "os", "us", or "u" to "i", and removed a trailing "jn", "j" or "n". (This introduced a couple errors in prepositions and the like; tamen, nun, kun, jen, en, etc. all lost their last n, and correlatives lost trailing u's.) Many texts are not independent by CFI standards; several books are translated by Edwin Grobe, several others are translations of Ibsen by Odd Tangerud, and a number are issues of the Esperantist. There probably is a little bit of English or French left in, though I took care to strip the English parts of the Esperantist. Ultimately, this list must be used with a certain degree of care.

There were 49,249 "words" found at least once, and 18476 found at least twice. The numbered links are labeled by PG numbers, and link back to the book on PG's servers.

The first few red links were errors, but the first real missing word found is terura, with 60 incidents in our small corpus.

These pages tend to get larger as you proceed to the bottom; for both technical and non-technical reasons, the first page has around 150 words and 4-5 has almost 3,000, which is why it is the cut off point.

If you check pages, feel free to leave notes at their top of that page and here.

Corpus

/66-64 -- All blue, believed to all link to Esperanto definitions

/63-57 -- All blue, believed to all link to Esperanto definitions

/56-50 -- All blue, believed to all link to Esperanto definitions

/49-45 -- All blue, believed to all link to Esperanto definitions

/44-40 -- All blue, believed to all link to Esperanto definitions

/39-35 -- All blue, believed to all link to Esperanto definitions

/34-30 -- All blue, believed to all link to Esperanto definitions

/29-25 -- All blue, believed to all link to Esperanto definitions

/24-20 -- All blue, believed to all link to Esperanto definitions

/19-17 -- All blue, believed to all link to Esperanto definitions

/16-14 -- All blue, believed to all link to Esperanto definitions

/13-11

/8-10

/6-7

/4-5