Corpus Gesproken Nederlands (The Spoken Dutch Corpus)
| Authors: | W.J.M. Levelt, S.G. Nooteboom, J. Bil, G.E. Booij, P. Dengis, E. DeWallef, A. Hulk, B. Krekels, C. Lucas, D. Van Compernolle, W. Vonk |
|---|---|
| Updated: | Wed 30 July 2014 |
| Source: | https://ivdnt.org/images/stories/producten/documentatie/cgn_website/doc_English/topics/index.htm |
| Type: | database |
| Languages: | Dutch |
| Keywords: | language, phonology, syntax, word-frequency, Dutch |
| Open Access: | yes |
| License: | |
| Documentation: | https://ivdnt.org/images/stories/producten/documentatie/cgn_website/doc_English/topics/index.htm |
| Citation: | Corpus Spoken Dutch - CGN (Version 2.0.3) (2014) [Data set]. Available at the Dutch Language Institute: http://hdl.handle.net/10032/tm-a2-k6 |
| Summary: | A collection of 900 hours (almost 9 million words) of contemporary spoken Dutch from native speakers in Flanders and the Netherlands. The speech recordings are aligned with several transcriptions (e.g. orthographic, phonetic) and annotations (syntax, POS-tags). Metadata, lexica, frequency lists and the tool Corex which can be used to explore the data are included. |