open  

Nijmegen Corpus of Casual French

The Nijmegen Corpus of Casual French contains 35 hours of high-quality recordings featuring 46 French speakers conversing among friends. The speech has been orthographically annotated by professional transcribers.

Authors:  Mirjam ErnestusFrancisco TorreiraMartine Adda-Decker
Updated:  2010-02-18
Source:  https://www.ru.nl/cls/publications/corpora/
Keywords:  languagecommunicationcasual speechFrench

open   documented  

European Cultural Heritage Online

This is a corpus of four European sign languages. It contains linguistically annotated video files of Sign Language of the Netherlands (Nederlandse Gebarentaal), British Sign Language, and Swedish Sign Language; data include narratives, dialogues, small lexicons, and poetry.

Authors:  Stephen Levinson and Louis Boves
Updated:  2010-02-18
Source:  http://sign-lang.ruhosting.nl/echo/
Keywords:  languagedialoguelexiconDutch Sign LanguageBritish Sign LanguageSwedish Sign Language

open   documented  

IEMOCAP: The Interactive Emotional Dyadic Motion Capture Database

An acted, multimodal and multispeaker database containing approximately 12 hours of audiovisual data, including video, speech, motion capture of face, text transcriptions.

Authors:  Carlos BussoMurtaza BulutChi-Chun LeeAbe KazemzadehEmily MowerSamuel KimJeannette N. ChangSungbok LeeShrikanth Narayanan
Updated:  2008-11-09
Source:  https://sail.usc.edu/iemocap/
Keywords:  emotionsbehaviorspeechgesturemotion-captureenglish

open  

Dutch Bilingualism Database

The Dutch Bilingualism Database contains over 1,500 sessions from a number of projects and research programs that were directed at investigating multilingualism.

Authors:  Pieter Muysken
Updated:  2008-02-18
Source:  http://portal.clarin.nl/node/4175
Keywords:  bilingualmultilingualismlanguageArabicBerberDutchEnglishPapiamentoSarnamiSrananTurkish

open  

British National Corpus

The British National Corpus (BNC) was originally created by Oxford University press in the 1980s - early 1990s, and it contains 100 million words of text from a wide range of genres (e.g. spoken, fiction, magazines, newspapers, and academic).

Authors:  Oxford University Press
Updated:  2007-02-18
Source:  https://www.english-corpora.org/bnc/
Keywords:  Englishlinguisticscorporadatabasewritten-textspeech

open  

CELEX

Web-based version of the Center for Lexical Information (CELEX) database, containing Dutch word frequencies.

Authors:  R H. BaayenR PiepenbrockL Gulikers
Updated:  2001-02-18
Source:  http://celex.mpi.nl/
Keywords:  languageword-formpsycholinguisticsDutchEnglishGermanCyrillic

open  

The Dutch Polyphone Corpus

The Dutch Polyphone Corpus contains speech from 5050 speakers from all regions of The Netherlands. Speakers had to read digits and sentences and answer questions over telephone.

Authors:  Els Den OsT.I. BoogaartLou BovesEsther Klabbers
Updated:  1995-01-18
Source:  https://www.ru.nl/cls/publications/corpora/
Keywords:  languagespeechDutch

« Page 7 / 7