Nijmegen Corpus of Casual Czech

Authors: Mirjam ErnestusLucie Kočková-AmortováPetr Pollak
Updated: Sat 24 May 2014
Source: https://mirjamernestus.nl/Ernestus/NCCCz/index.php
Type: multimedia database
Languages: Czech
Keywords: languagecommunicationphoneticsCzech
Open Access: no
License:
Publications: Kočková-Amortová, L., Pollák, P., Rajnoha, J., Ernestus, M. (2014).
Citation: Kočková-Amortová, L., Pollák, P., Rajnoha, J., Ernestus, M. (2014). The Nijmegen corpus of casual Czech. In Proceedings of LREC 2014: 9th International Conference on Language Resources and Evaluation, pages 365-370. https://aclanthology.org/L14-1162/
Summary:

The Nijmegen Corpus of Casual Czech (NCCCz), contains more than 30 hours of high-quality recordings of casual conversations in Common Czech, among ten groups of three male and ten groups of three female friends. All speakers were native speakers of Czech, raised in Prague or in the region of Central Bohemia, and were between 19 and 26 years old. This corpus can form the basis for all types of research on casual conversations in Czech, including phonetic research and research on how to improve automatic speech recognition.