The Gigaword Corpus

Authors: Sigrún HelgadóttirEiríkur RögnvaldssonJörgen PindStarkaður BarkarsonTomaž ErjavecMaciej OgrodniczukPetya OsenovaNikola LjubešićKiril SimovAndrej PančurMichał RudolfMatyáš KoppSteinþór SteingrímssonÇağrı ÇöltekinJesse de DoesKatrien DepuydtTommaso AgnoloniGiulia VenturiMaría Calzada PérezLuciana D. de MacedoCostanza NavarrettaGiancarlo LuxardoMatthew CoolePaul RaysonVaidas MorkevičiusTomas KrilavičiusRoberts DarǵisOrsolya RingRuben van HeusdenMaarten MarxDarja Fišer
Updated: Sat 30 April 2022
Source: https://malheildir.arnastofnun.is/?mode=rmh2022#?stats_reduce=word&isCaseInsensitive&searchBy=word&cqp=%5B%5D&lang=en&display=about
Type: Corpus
Languages: Icelandic
Keywords: IcelandicCorpusMonolingual
Open Access: Yes
License:
Documentation: https://malheildir.arnastofnun.is/?mode=rmh2022#?stats_reduce=word&isCaseInsensitive&searchBy=word&cqp=%5B%5D&lang=en&display=about
Publications: Steinthór Steingrímsson, Sigrún Helgadóttir, Eiríkur Rögnvaldsson, Starkaður Barkarson and Jón Guðnason. 2018. Risamállheild: A Very Large Icelandic Text Corpus. Proceedings of LREC 2018, p. 4361-4366. Miyazaki, Japan.
Summary:

Sigrún Helgadóttir, Ásta Svavarsdóttir, Eiríkur Rögnvaldsson, Kristín Bjarnadóttir and Hrafn Loftsson. 2012. The Tagged Icelandic Corpus (MÍM). Proceedings of the Workshop on Language Technology for Normalization of Less-Resourced Languages -SaLTMiL 8 – AfLaT2012,s. 67-72. Istanbul, Turkey.

Jörgen Pind (ed.), Friðrik Magnússon and Stefán Briem. 1991. Icelandic word frequency book. The …