|<   <   Page 1 / 1   >   >|

open   documented  

Vox Populi

A corpus of European Parlimentary speech and tools for machine learning models.

Authors:  Chanhan WangMorgane RiviereAnn LeeAnne WuChaitanya TalnikarDaniel HazizaMary WIlliamsonJuan PinoEmmanuel Dupoux
Updated:  2021-04-30
Source:  https://aclanthology.org/2021.acl-long.80/
Keywords:  EnglishGermanFrenchSpanishPolishItalianRomanianHungarianCzechDutchFinnishSlovakSlovenianEstonianLithuanianPortugueseBulgarianGreekLatvianMalteseSwedishDanishspeech synthesismachine learningAccented Speech

subs2vec

Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.

Authors:  Jeroen van ParidonBill Thompson
Updated:  2019-04-30
Source:  https://github.com/jvparidon/subs2vec
Keywords:  languagebigramtrigramlexical normspsycholinguisticsAfrikaansArabicBulgarianBengaliBretonBosnianCatalanCzechDanishGermanGreekEnglishEsperantoSpanishEstonianBasqueFarsiFinnishFrenchGalicianHebrewHindiCroatianHungarianArmenianIndonesianIcelandicItalianGeorgianKazakhKoreanLithuanianLatvianMacedonianMalayalamMalayDutchNorwegianPolishPortugueseRomanianRussianSinhalaSlovakSlovenianAlbanianSerbianSwedishTamilTeluguTagalogTurkishUkranianUrduVietnamese

|<   <   Page 1 / 1   >   >|