The Humanities Institute has purchased four specialized linguistic corpora, exclusively available for faculty members with approved access:

  • Corpus del Español
  • Corpus of Contemporary American Literature
  • Corpus of Historical American English
  • News on the Web

These corpora serve as invaluable resources for studying language patterns, usage, and evolution. They can be used to analyze syntax, semantics, phonetics, and other linguistic features; aid in the development of language models and educational tools; and more.

These corpora were originally created by Mark Davies (https://corpusdata.org). Please use the original citation when referring to the data; this has been provided to you in the "Recommended Citation" field.

The Humanities Institute gratefully acknowledges the support of the College of Arts and Sciences in purchasing these resources.

Please note that these corpora are licensed for faculty research uses only. They should not be used for teaching or by undergraduate students. Full license and restriction information can be viewed on the corpora homepage: https://www.corpusdata.org/restrictions.asp.

If you are a faculty member seeking access, please contact the Repository staff for more information.

Follow

Browse the Corpora Collections:

Corpus del Español

Corpus of Contemporary American English

Corpus of Historical American English

Corpus of News on the Web