Speech and Language Resource Bank
This is a corpus of four European sign languages. It contains linguistically annotated video files of Sign Language of the Netherlands (Nederlandse Gebarentaal), British Sign Language, and Swedish Sign Language; data include narratives, dialogues, small lexicons, and poetry.
Authors: Stephen Levinson and Louis Boves
Updated: 2010-12-24
Source: http://sign-lang.ruhosting.nl/echo/
Keywords: language,
dialogue,
lexicon,
Dutch Sign Language,
British Sign Language,
Swedish Sign Language
The Dutch Bilingualism Database contains over 1,500 sessions from a number of projects and research programs that were directed at investigating multilingualism.
Authors: Pieter Muysken
Updated: 2008-12-24
Source: http://portal.clarin.nl/node/4175
Keywords: bilingual,
multilingualism,
language,
Arabic,
Berber,
Dutch,
English,
Papiamento,
Sarnami,
Sranan,
Turkish
An acted, multimodal and multispeaker database containing approximately 12 hours of audiovisual data, including video, speech, motion capture of face, text transcriptions.
Authors: Carlos Busso,
Murtaza Bulut,
Chi-Chun Lee,
Abe Kazemzadeh,
Emily Mower,
Samuel Kim,
Jeannette N. Chang,
Sungbok Lee,
Shrikanth Narayanan
Updated: 2008-11-09
Source: https://sail.usc.edu/iemocap/
Keywords: emotions,
behavior,
speech,
gesture,
motion-capture,
english
The British National Corpus (BNC) was originally created by Oxford University press in the 1980s - early 1990s, and it contains 100 million words of text from a wide range of genres (e.g. spoken, fiction, magazines, newspapers, and academic).
Authors: Oxford University Press
Updated: 2007-12-24
Source: https://www.english-corpora.org/bnc/
Keywords: English,
linguistics,
corpora,
database,
written-text,
speech