open  

Corpus of Contemporary American English

The Corpus of Contemporary American English (COCA) is the only large, genre-balanced corpus of American English.

Authors:  Mark Davies
Updated:  2020-03-18
Source:  https://www.english-corpora.org/coca/
Keywords:  Englishcorporalinguisticsfrequency-dataword-form

open  

The Hansard Corpus

A corpus of the speeches given in the British Parliament from 1803-2005.

Authors:  Marc AlexanderFraser DallachyStephen WattamPaul RaysonMark Davies
Updated:  2016-02-18
Source:  https://www.english-corpora.org/hansard/
Keywords:  Englishsemanticslanguagelinguisticscorporacollocatesword-frequency

open  

Strathy Corpus of Canadian English

The Strathy Corpus of Canadian English is a product of the Strathy Language Unit at Queen's University. The corpus contains 50 million words from more than 1,100 spoken, fiction, magazines, newspapers, and academic texts.

Authors:  W.C. Lougheed
Updated:  2010-02-18
Source:  https://www.english-corpora.org/can/
Keywords:  Englishlinguisticscorporalanguage

open   documented  

Corpora of Historical American English

The Corpus of Historical American English (COHA) is the largest structured corpus of historical English.

Authors:  Mark Davies
Updated:  2010-02-18
Source:  https://www.english-corpora.org/coha/
Keywords:  historical-Englishgenrecorporalinguisticssyntax

open  

British National Corpus

The British National Corpus (BNC) was originally created by Oxford University press in the 1980s - early 1990s, and it contains 100 million words of text from a wide range of genres (e.g. spoken, fiction, magazines, newspapers, and academic).

Authors:  Oxford University Press
Updated:  2007-02-18
Source:  https://www.english-corpora.org/bnc/
Keywords:  Englishlinguisticscorporadatabasewritten-textspeech