The Gigaword Corpus
Authors: | Sigrún Helgadóttir, Eiríkur Rögnvaldsson, Jörgen Pind, Starkaður Barkarson, Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana D. de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darǵis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fišer |
---|---|
Updated: | Sat 30 April 2022 |
Source: | https://malheildir.arnastofnun.is/?mode=rmh2022#?stats_reduce=word&isCaseInsensitive&searchBy=word&cqp=%5B%5D&lang=en&display=about |
Type: | Corpus |
Languages: | Icelandic |
Keywords: | Icelandic, Corpus, Monolingual |
Open Access: | Yes |
License: | |
Documentation: | https://malheildir.arnastofnun.is/?mode=rmh2022#?stats_reduce=word&isCaseInsensitive&searchBy=word&cqp=%5B%5D&lang=en&display=about |
Publications: | Steinthór Steingrímsson, Sigrún Helgadóttir, Eiríkur Rögnvaldsson, Starkaður Barkarson and Jón Guðnason. 2018. Risamállheild: A Very Large Icelandic Text Corpus. Proceedings of LREC 2018, p. 4361-4366. Miyazaki, Japan. |
Summary: | Sigrún Helgadóttir, Ásta Svavarsdóttir, Eiríkur Rögnvaldsson, Kristín Bjarnadóttir and Hrafn Loftsson. 2012. The Tagged Icelandic Corpus (MÍM). Proceedings of the Workshop on Language Technology for Normalization of Less-Resourced Languages -SaLTMiL 8 – AfLaT2012,s. 67-72. Istanbul, Turkey. Jörgen Pind (ed.), Friðrik Magnússon and Stefán Briem. 1991. Icelandic word frequency book. The … |