subs2vec

Authors: Jeroen van ParidonBill Thompson
Updated: Tue 24 December 2019
Source: https://github.com/jvparidon/subs2vec
Type: Github Repository
Languages: cross-linguistic
Keywords: languagebigramtrigramlexical normspsycholinguisticsAfrikaansArabicBulgarianBengaliBretonBosnianCatalanCzechDanishGermanGreekEnglishEsperantoSpanishEstonianBasqueFarsiFinnishFrenchGalicianHebrewHindiCroatianHungarianArmenianIndonesianIcelandicItalianGeorgianKazakhKoreanLithuanianLatvianMacedonianMalayalamMalayDutchNorwegianPolishPortugueseRomanianRussianSinhalaSlovakSlovenianAlbanianSerbianSwedishTamilTeluguTagalogTurkishUkranianUrduVietnamese
Open Access:
License:
Citation: van Paridon, J., & Thompson, B. (2019, October 13). subs2vec: Word embeddings from subtitles in 55 languages. https://doi.org/10.31234/osf.io/fcrmy
Summary:

Python 3.7 scripts and command line tools to evaluate a set of word vectors on semantic similarity, semantic and syntactic analogy, and lexical norm prediction tasks.