Speech and Language Resource Bank
Python 3.7 resources to evaluate bigram and trigram frequencies in corpora.
Authors: Jeroen van Paridon,
Bill Thompson
Updated: 2019-12-24
Source: https://github.com/jvparidon/subs2vec
Keywords: language,
bigram,
trigram,
lexical norms,
psycholinguistics,
Afrikaans,
Arabic,
Bulgarian,
Bengali,
Breton,
Bosnian,
Catalan,
Czech,
Danish,
German,
Greek,
English,
Esperanto,
Spanish,
Estonian,
Basque,
Farsi,
Finnish,
French,
Galician,
Hebrew,
Hindi,
Croatian,
Hungarian,
Armenian,
Indonesian,
Icelandic,
Italian,
Georgian,
Kazakh,
Korean,
Lithuanian,
Latvian,
Macedonian,
Malayalam,
Malay,
Dutch,
Norwegian,
Polish,
Portuguese,
Romanian,
Russian,
Sinhala,
Slovak,
Slovenian,
Albanian,
Serbian,
Swedish,
Tamil,
Telugu,
Tagalog,
Turkish,
Ukranian,
Urdu,
Vietnamese
SLABank is a component of TalkBank dedicated to providing corpora for the study of second language acquisition.
Authors: Brian MacWhinney
Updated: 2018-05-04
Source: https://slabank.talkbank.org/
Keywords: language-acquisition,
second-language,
Czech,
English,
French,
German,
Hungarian,
Icelandic,
Italian,
Mandarin,
Spanish
An integrated repository for the study of language in aphasia.
Authors: Brian MacWhinney
Updated: 2017-08-30
Source: https://aphasia.talkbank.org/
Keywords: aphasia,
conversation,
language,
English,
Cantonese,
Croatian,
French,
German,
Greek,
Hungarian,
Italian,
Japanese,
Madarin,
Romanian,
Spanish