Section outline

  • Here you'll find a selection of the most common English corpora. The corpora are grouped into four categories. Each subpage features short descriptions, links and other useful information. If you're looking for a specific corpus or need help, please contact Fabian Vetter.

    • The corpora in this group

      • usually sample texts from many different registers in order to represent the language as it is used in one national variety.
      • are synchronic, i.e. contain only material from one specific point in time
      • contain present day English
    • Old, Middle and Early Modern English corpora.
    • Specialised spoken corpora.

    • Specialised written corpora. Highlights: ICLE (International Corpus of Learner English), Oxford Text Archive (OTA), TIME Magazine Corpus of American English.