|<   <   Page 1 / 1   >   >|

open  

LinguaPix

LinguaPix is a database of picture naming norms. 1,620 colour photographs of items spanning across 42 semantic categories were named and rated by a group of German speakers (and are currently evaluated by a group of Dutch, English, Polish, and Cantonese speakers).

Authors:  Agniszka Ewa KrautzEmmanuel KeuleersGabriella RundbladSusanna Yeung
Updated:  2021-04-30
Source:  https://linguapix.uni-mannheim.de/frontend/web/
Keywords:  psychologylinguisticssemanticsaudiopicture-namingEnglishGermanPolish

open   documented  

Vox Populi

A corpus of European Parlimentary speech and tools for machine learning models.

Authors:  Chanhan WangMorgane RiviereAnn LeeAnne WuChaitanya TalnikarDaniel HazizaMary WIlliamsonJuan PinoEmmanuel Dupoux
Updated:  2021-04-30
Source:  https://aclanthology.org/2021.acl-long.80/
Keywords:  EnglishGermanFrenchSpanishPolishItalianRomanianHungarianCzechDutchFinnishSlovakSlovenianEstonianLithuanianPortugueseBulgarianGreekLatvianMalteseSwedishDanishspeech synthesismachine learningAccented Speech

subs2vec

Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.

Authors:  Jeroen van ParidonBill Thompson
Updated:  2019-04-30
Source:  https://github.com/jvparidon/subs2vec
Keywords:  languagebigramtrigramlexical normspsycholinguisticsAfrikaansArabicBulgarianBengaliBretonBosnianCatalanCzechDanishGermanGreekEnglishEsperantoSpanishEstonianBasqueFarsiFinnishFrenchGalicianHebrewHindiCroatianHungarianArmenianIndonesianIcelandicItalianGeorgianKazakhKoreanLithuanianLatvianMacedonianMalayalamMalayDutchNorwegianPolishPortugueseRomanianRussianSinhalaSlovakSlovenianAlbanianSerbianSwedishTamilTeluguTagalogTurkishUkranianUrduVietnamese

|<   <   Page 1 / 1   >   >|