|<   <   Page 1 / 2   >   >|

open   documented  

The Gigaword Corpus

Authors:  Sigrún HelgadóttirEiríkur RögnvaldssonJörgen PindStarkaður BarkarsonTomaž ErjavecMaciej OgrodniczukPetya OsenovaNikola LjubešićKiril SimovAndrej PančurMichał RudolfMatyáš KoppSteinþór SteingrímssonÇağrı ÇöltekinJesse de DoesKatrien DepuydtTommaso AgnoloniGiulia VenturiMaría Calzada PérezLuciana D. de MacedoCostanza NavarrettaGiancarlo LuxardoMatthew CoolePaul RaysonVaidas MorkevičiusTomas KrilavičiusRoberts DarǵisOrsolya RingRuben van HeusdenMaarten MarxDarja Fišer
Updated:  2022-04-30
Source:  https://malheildir.arnastofnun.is/?mode=rmh2022#?stats_reduce=word&isCaseInsensitive&searchBy=word&cqp=%5B%5D&lang=en&display=about
Keywords:  IcelandicCorpusMonolingual

open   documented  

SpiCE: Speech in Cantonese and English

An open-access corpus of conversational bilingual Speech in Cantonese and English.

Authors:  Khia A. JohnsonMolly BabelIvan FongNancy Yiu
Updated:  2021-05-20
Source:  https://doi.org/10.5683/SP2/MJOXP3
Keywords:  bilingualconversationcorpuscantoneseenglish

open  

Latin American and Iberian Languages Open Corpora Forum

Authors:  Livy Real & Ivan Meza
Updated:  2021-04-30
Source:  https://opencor.gitlab.io/corpora/mager18wixarika/
Keywords:  Latin AmericanSpanishCorpusCorporaCollection

open  

Corpus-derived Chinese Lexical Association Database

Authors:  Shu-Yen LinHsueh-Chih ChenTao-Hsing ChangWei-En Lee & Yao-Ting Sung
Updated:  2019-04-30
Source:  http://www.chinesereadability.net/LexicalAssociation/CLAD/
Keywords:  chineseLexical AssociationCorpus

open  

Macroscope

Authors:  Li YingTomas EngelthalarCynthia SiewThomas Hills
Updated:  2019-04-30
Source:  http://macroscope.intelligence-media.com/index.html
Keywords:  CorpusPsycholinguisticsEnglishComputational LinguisticsDistributional Semantics

open  

Bambara Reference Corpus

A corpus of Bambara, a Mandinka language spoken in Mali.

Authors:  Valentin VydrinEkaterina Aplonova
Updated:  2017-04-30
Source:  http://cormand.huma-num.fr/
Keywords:  BambaraCorpusCorpora Mandeica

open  

Ghent Eye-Tracking Corpus

A mono and bilingual corpus of eye-tracking data

Authors:  Uschi CopNicolas DirixDenis DriegheWouter Duyck
Updated:  2017-04-30
Source:  https://expsy.ugent.be/downloads/geco/
Keywords:  DutchEnglishEye TrackingCorpus

open  

The English Consistent Confusion Corpus

A corpus of misperceptions in English under varying noise conditions

Authors:  Ricard MarxerJon BarkerMartin Cooke
Updated:  2016-04-30
Source:  https://spandh.dcs.shef.ac.uk//confusion_corpus/
Keywords:  EnglishNoiseMisperceptionCorpus

open  

HelexKids

A child language corpus of Greek

Authors:  Aris TerzopoulosDr. Lynne DuncanDr. Georgia NiolakiProf. Jackie MastersonMark Wilson
Updated:  2016-04-30
Source:  https://www.helexkids.org/home
Keywords:  GreekChild LanguageCorpus

open  

Maninka Reference Corpus

A corpus of Maninka, a Mandinka language spoken in Guinea.

Authors:  Valentin VydrinAndrij Rovenchak & Kirill Maslinsky
Updated:  2016-04-30
Source:  http://cormand.huma-num.fr/cormani/index.html
Keywords:  ManinkaCorpusCorpora Mandeica

|<   <   Page 1 / 2   >   >|