Speech and Language Resource Bank
Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.
Authors: Jeroen van Paridon,
Bill Thompson
Updated: 2019-12-24
Source: https://github.com/jvparidon/subs2vec
Keywords: language,
bigram,
trigram,
lexical norms,
psycholinguistics,
Afrikaans,
Arabic,
Bulgarian,
Bengali,
Breton,
Bosnian,
Catalan,
Czech,
Danish,
German,
Greek,
English,
Esperanto,
Spanish,
Estonian,
Basque,
Farsi,
Finnish,
French,
Galician,
Hebrew,
Hindi,
Croatian,
Hungarian,
Armenian,
Indonesian,
Icelandic,
Italian,
Georgian,
Kazakh,
Korean,
Lithuanian,
Latvian,
Macedonian,
Malayalam,
Malay,
Dutch,
Norwegian,
Polish,
Portuguese,
Romanian,
Russian,
Sinhala,
Slovak,
Slovenian,
Albanian,
Serbian,
Swedish,
Tamil,
Telugu,
Tagalog,
Turkish,
Ukranian,
Urdu,
Vietnamese
The Dutch Bilingualism Database contains over 1,500 sessions from a number of projects and research programs that were directed at investigating multilingualism.
Authors: Pieter Muysken
Updated: 2008-12-24
Source: http://portal.clarin.nl/node/4175
Keywords: bilingual,
multilingualism,
language,
Arabic,
Berber,
Dutch,
English,
Papiamento,
Sarnami,
Sranan,
Turkish