|<   <   Page 1 / 1   >   >|

subs2vec

Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.

Authors:  Jeroen van ParidonBill Thompson
Updated:  2019-12-24
Source:  https://github.com/jvparidon/subs2vec
Keywords:  languagebigramtrigramlexical normspsycholinguisticsAfrikaansArabicBulgarianBengaliBretonBosnianCatalanCzechDanishGermanGreekEnglishEsperantoSpanishEstonianBasqueFarsiFinnishFrenchGalicianHebrewHindiCroatianHungarianArmenianIndonesianIcelandicItalianGeorgianKazakhKoreanLithuanianLatvianMacedonianMalayalamMalayDutchNorwegianPolishPortugueseRomanianRussianSinhalaSlovakSlovenianAlbanianSerbianSwedishTamilTeluguTagalogTurkishUkranianUrduVietnamese

open   documented  

PBCM: Code-mixed Hindi-English corpus

A multispeaker code-mixed Hindi read-speech corpus.

Authors:  Ayushi PandeyBrij Mohan Lal SrivastavaRohit KumarBT NelloreKS TejaSV Gangashetty
Updated:  2018-12-24
Source:  https://brijmohan.github.io/publication/pbcm-lrec18/
Keywords:  code-mixinghindi-englishhindienglishmulti-speaker

open  

Natural Sounds Stimulus Set

The sound set includes 165 natural sounds, each 2-seconds in duration. The sounds were intended to include many of the sounds people commonly hear in their daily life.

Authors:  Sam Norman-HaignereNancy G. KanwisherJosh H. McDermott
Updated:  2015-11-24
Source:  http://mcdermottlab.mit.edu/svnh/Natural-Sound/Stimuli.html
Keywords:  speechmusicstimulifrequencyGermanFrenchItalianRussianHindiChinese

|<   <   Page 1 / 1   >   >|