Speech and Language Resource Bank
A corpus of European Parlimentary speech and tools for machine learning models.
Authors: Chanhan Wang,
Morgane Riviere,
Ann Lee,
Anne Wu,
Chaitanya Talnikar,
Daniel Haziza,
Mary WIlliamson,
Juan Pino,
Emmanuel Dupoux
Updated: 2021-04-30
Source: https://aclanthology.org/2021.acl-long.80/
Keywords: English,
German,
French,
Spanish,
Polish,
Italian,
Romanian,
Hungarian,
Czech,
Dutch,
Finnish,
Slovak,
Slovenian,
Estonian,
Lithuanian,
Portuguese,
Bulgarian,
Greek,
Latvian,
Maltese,
Swedish,
Danish,
speech synthesis,
machine learning,
Accented Speech
The ERT is a computerized task to assess the perception of facial expressions. The task presents morphed facial expressions that gradually increase in intensity.
Authors: Barbara Montagne,
Roy Kessels,
David Perrett,
Edward de Haan
Updated: 2020-04-30
Source: https://www.emotionrecognitiontask.com/
Keywords: emotion,
psychology,
neuropsychology,
cognition,
Dutch,
English,
German,
French,
Spanish,
Finnish,
Italian,
Russian,
Lithuanian,
Greek,
Portuguese,
Turkish
CMU Sphinx is a set of speech recognition development libraries and tools that can be linked in to speech-enable applications.
Authors: Evandro Gouvea,
Peter Gorniak,
Philip Kwok,
Paul Lamere,
Beth Logan,
Pedro Moreno,
Bhiksha Raj,
Mosur Ravishankar,
Bent Schmidt-Nielsen,
Rita Singh,
JM Van Thong,
Willie Walker,
Manfred Warmuth,
Joe Woelfel,
Peter Wolf
Updated: 2019-10-23
Source: https://cmusphinx.github.io/
Keywords: speech,
programming,
Java,
experiment,
English,
French,
Mandarin,
German,
Dutch,
Russian
Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.
Authors: Jeroen van Paridon,
Bill Thompson
Updated: 2019-04-30
Source: https://github.com/jvparidon/subs2vec
Keywords: language,
bigram,
trigram,
lexical norms,
psycholinguistics,
Afrikaans,
Arabic,
Bulgarian,
Bengali,
Breton,
Bosnian,
Catalan,
Czech,
Danish,
German,
Greek,
English,
Esperanto,
Spanish,
Estonian,
Basque,
Farsi,
Finnish,
French,
Galician,
Hebrew,
Hindi,
Croatian,
Hungarian,
Armenian,
Indonesian,
Icelandic,
Italian,
Georgian,
Kazakh,
Korean,
Lithuanian,
Latvian,
Macedonian,
Malayalam,
Malay,
Dutch,
Norwegian,
Polish,
Portuguese,
Romanian,
Russian,
Sinhala,
Slovak,
Slovenian,
Albanian,
Serbian,
Swedish,
Tamil,
Telugu,
Tagalog,
Turkish,
Ukranian,
Urdu,
Vietnamese