Speech and Language Resource Bank
CMU Sphinx is a set of speech recognition development libraries and tools that can be linked in to speech-enable applications.
Authors: Evandro Gouvea,
Peter Gorniak,
Philip Kwok,
Paul Lamere,
Beth Logan,
Pedro Moreno,
Bhiksha Raj,
Mosur Ravishankar,
Bent Schmidt-Nielsen,
Rita Singh,
JM Van Thong,
Willie Walker,
Manfred Warmuth,
Joe Woelfel,
Peter Wolf
Updated: 2019-10-23
Source: https://cmusphinx.github.io/
Keywords: speech,
programming,
Java,
experiment,
English,
French,
Mandarin,
German,
Dutch,
Russian
The Auditory English Lexicon Project (AELP) is a multi-talker, multi-region psycholinguistic database of 10,170 spoken words and 10,170 spoken nonwords.
Authors: Winston D. Goh,
Melvin J. Yap,
Qian Wen Chee
Updated: 2019-04-30
Source: https://inetapps.nus.edu.sg/aelp/
Keywords: psycholinguistics,
database,
lexicon,
audition,
semantics,
English
Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.
Authors: Jeroen van Paridon,
Bill Thompson
Updated: 2019-04-30
Source: https://github.com/jvparidon/subs2vec
Keywords: language,
bigram,
trigram,
lexical norms,
psycholinguistics,
Afrikaans,
Arabic,
Bulgarian,
Bengali,
Breton,
Bosnian,
Catalan,
Czech,
Danish,
German,
Greek,
English,
Esperanto,
Spanish,
Estonian,
Basque,
Farsi,
Finnish,
French,
Galician,
Hebrew,
Hindi,
Croatian,
Hungarian,
Armenian,
Indonesian,
Icelandic,
Italian,
Georgian,
Kazakh,
Korean,
Lithuanian,
Latvian,
Macedonian,
Malayalam,
Malay,
Dutch,
Norwegian,
Polish,
Portuguese,
Romanian,
Russian,
Sinhala,
Slovak,
Slovenian,
Albanian,
Serbian,
Swedish,
Tamil,
Telugu,
Tagalog,
Turkish,
Ukranian,
Urdu,
Vietnamese