The English Consistent Confusion Corpus

A corpus of misperceptions in English under varying noise conditions

Authors: Ricard Marxer, Jon Barker, Martin Cooke
Updated: 2016-04-30
Source: https://spandh.dcs.shef.ac.uk//confusion_corpus/
Keywords: English, Noise, Misperception, Corpus

Data sharing: fMRI timecourses / story-listening

FMRI data from hearing readings of Alice in Wonderland

Authors: Jonathan R.Brennan, Edward P.Stabler, Sarah E.Van Wagenen, Wen-Ming Luh, John T.Haled
Updated: 2016-04-30
Source: https://sites.lsa.umich.edu/cnllab/2016/06/11/data-sharing-fmri-timecourses-story-listening/
Keywords: neuroscience, FMRI, story telling, naturalistic

HelexKids

A child language corpus of Greek

Authors: Aris Terzopoulos, Dr. Lynne Duncan, Dr. Georgia Niolaki, Prof. Jackie Masterson, Mark Wilson
Updated: 2016-04-30
Source: https://www.helexkids.org/home
Keywords: Greek, Child Language, Corpus

LSE-Sign

The LSE-Sign database is a free online tool for selecting Spanish Sign Language stimulus materials to be used in experiments.

Authors: Eva Gutierrez-Sigut, Brendan Costello, Cristina Baus, Manuel Carreiras
Updated: 2016-04-30
Source: http://lse-sign.bcbl.eu/web-busqueda/
Keywords: psycholinguistics, phonetics, database, Spanish-Sign-Language, Spanish

Maninka Reference Corpus

A corpus of Maninka, a Mandinka language spoken in Guinea.

Authors: Valentin Vydrin, Andrij Rovenchak & Kirill Maslinsky
Updated: 2016-04-30
Source: http://cormand.huma-num.fr/cormani/index.html
Keywords: Maninka, Corpus, Corpora Mandeica

Penn Parsed Corpora of Historical English

Authors: Anthony Kroch
Updated: 2016-04-30
Source: https://www.ling.upenn.edu/hist-corpora/
Keywords: English, Middle English, Corpus, Corpora

The Sino-Tibetan Etymological Dictionary and Thesaurus

Authors: Dr. John B. Lowe, Dr. Liberty Lidz, Dr. Kenneth VanBik, Dr. David Mortensen, Dr. Dominic Yu, Dr. Daniel Bruhn, Dr. Chundra Cathcart, Dr. David Solnit
Updated: 2016-04-30
Source: https://stedt.berkeley.edu/
Keywords: Etymology, Dictionary, Etymological Dictionary, Sino-Tibetean, Southeast Asia, Chinese, Tibetan, Burmese

The Hansard Corpus

A corpus of the speeches given in the British Parliament from 1803-2005.

Authors: Marc Alexander, Fraser Dallachy, Stephen Wattam, Paul Rayson, Mark Davies
Updated: 2016-04-30
Source: https://www.english-corpora.org/hansard/
Keywords: English, semantics, language, linguistics, corpora, collocates, word-frequency

HomeBank

A study through automatic speech recognition of untranscribed daylong recordings in the home and elsewhere.

Authors: Brian MacWhinney, Anne Warlaumont, Mark VanDam
Updated: 2015-10-30
Source: https://homebank.talkbank.org/
Keywords: speech-recognition, recordings, conversational-interaction

CogSci2016

Self-paced reading data on Dutch sentences (Dutch native speakers) and English sentences (Dutch and German native speakers).

Authors: Stefan L. Frank, Thijs Trompenaars, Shravan Vasishth
Updated: 2015-05-06
Source: https://onlinelibrary.wiley.com/action/downloadSupplement?doi=10.1111%2Fcogs.12247&file=cogs12247-sup-0001-DataS1.zip
Keywords: language, reading, cross-linguistic, Dutch, English, German