CMUSphinx4

CMU Sphinx is a set of speech recognition development libraries and tools that can be linked in to speech-enable applications.

Authors: Evandro Gouvea, Peter Gorniak, Philip Kwok, Paul Lamere, Beth Logan, Pedro Moreno, Bhiksha Raj, Mosur Ravishankar, Bent Schmidt-Nielsen, Rita Singh, JM Van Thong, Willie Walker, Manfred Warmuth, Joe Woelfel, Peter Wolf
Updated: 2019-10-23
Source: https://cmusphinx.github.io/
Keywords: speech, programming, Java, experiment, English, French, Mandarin, German, Dutch, Russian

PredPsych

PredPsych is a user-friendly toolbox based on machine learning predictive algorithms. It comprises of multiple functionalities for multivariate analyses of quantitative behavioral data based on machine learning models.

Authors: Atesh Koul
Updated: 2019-07-23
Source: https://CRAN.R-project.org/package=PredPsych ; https://github.com/ateshkoul/PredPsych
Keywords: psychology, experiment, programming, neuroscience, English

Normative Data on Dutch Idiomatic Expressions

Normative data of 374 Dutch idiomatic expressions by 390 native speakers.

Authors: Ferdy Hubers, Catia Cucchiarini, Helmer Strik, Ton Dijkstra
Updated: 2019-05-14
Source: https://www.ru.nl/cls/publications/corpora/
Keywords: language, speech, English, Dutch

Auditory English Lexicon Project

The Auditory English Lexicon Project (AELP) is a multi-talker, multi-region psycholinguistic database of 10,170 spoken words and 10,170 spoken nonwords.

Authors: Winston D. Goh, Melvin J. Yap, Qian Wen Chee
Updated: 2019-04-30
Source: https://inetapps.nus.edu.sg/aelp/
Keywords: psycholinguistics, database, lexicon, audition, semantics, English

Large database of English compounds

A large database of English two-constituent compounds.

Authors: Christina Gagne, Thomas Spalding, Daniel Schmidtke
Updated: 2019-04-30
Source: https://era.library.ualberta.ca/items/dc3b9033-14d0-48d7-b6fa-6398a30e61e4
Keywords: psycholinguistics, compounds, pragmatics, English

Macroscope

Authors: Li Ying, Tomas Engelthalar, Cynthia Siew, Thomas Hills
Updated: 2019-04-30
Source: http://macroscope.intelligence-media.com/index.html
Keywords: Corpus, Psycholinguistics, English, Computational Linguistics, Distributional Semantics

subs2vec

Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.

Authors: Jeroen van Paridon, Bill Thompson
Updated: 2019-04-30
Source: https://github.com/jvparidon/subs2vec
Keywords: language, bigram, trigram, lexical norms, psycholinguistics, Afrikaans, Arabic, Bulgarian, Bengali, Breton, Bosnian, Catalan, Czech, Danish, German, Greek, English, Esperanto, Spanish, Estonian, Basque, Farsi, Finnish, French, Galician, Hebrew, Hindi, Croatian, Hungarian, Armenian, Indonesian, Icelandic, Italian, Georgian, Kazakh, Korean, Lithuanian, Latvian, Macedonian, Malayalam, Malay, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Sinhala, Slovak, Slovenian, Albanian, Serbian, Swedish, Tamil, Telugu, Tagalog, Turkish, Ukranian, Urdu, Vietnamese

Model-Matched Sounds

Cochleograms and sound files are shown for example stimuli from the model-matching experiment.

Authors: Sam V. Norman-Haignere & Josh H. McDermott
Updated: 2018-12-03
Source: http://mcdermottlab.mit.edu/svnh/model-matching/Stimuli_from_Model-Matching_Experiment.html
Keywords: audition, sensory, auditory-cortex, neuroscience, English

EEG Datasets for Naturalistic Listening to "Alice in Wonderland"

EEG datasets of listeners of the story "Alice in Wonderland"

Authors: Jonathan R. Brennan, John T. Hale
Updated: 2018-11-30
Source: https://deepblue.lib.umich.edu/data/concern/data_sets/bg257f92t
Keywords: Naturalistic, Listening, Predictions, EEG, English

Arizona Child Acoustic Database

The Arizona Child Acoustic Database is a longitudinal collection of audio samples from children between the ages of 2-7 years.

Authors: Kate Bunton and Brad Story
Updated: 2018-06-20
Source: https://repository.arizona.edu/handle/10150/316065
Keywords: language, speech-development, acoustics, linguistics, English