Congress of Balticists - Baltic-language digital resources in grammar research

Baltic-language digital resources in grammar research

Organizers:

Andra Kalnača
Latvijas Universitāte 
 
Ilze Lokmane
Latvijas Universitāte
 
Daiki Horiguči
Kioto Universitāte

Program:

Open program

Abstracts:

Open abstracts

Description:

The last few years have seen a growing need for the analysis of linguistic data using various types of corpora, databases, dictionaries, and other digital resources, which provide a number of possibilities for data processing and annotation based in various statistical and corpus linguistics methods. Because corpora and other digital sources can be used to analyze language both sinchronically and diachronically, they provide the chance to look beyond the traditional methods and apply new theoretical and practical approaches in a variety of languages when conducting grammatical research.

This workshop is conceived as a forum for the exchange of new and creative ideas between researchers interested in various areas of grammar and interdisciplinary approaches. We encourage the participation of those who base their research on various Baltic corpora, databases, and other electronic material and who apply corpus linguistics methodology in their research.

It is important to note that material in both Lithuanian and Latvian is already compiled in various synchronic and diachronic digital resources (ex. Aleksaitė et al. 2011; Rimkutė et al. 2013; Dadurkevičius 2020a, 2020b; Andronova et al. 2022; Levāne-Petrova et al. 2023; see also NKK https://korpuss.lv/; Spector et al. https://tezaurs.lv/) and that in addition new corpora, databases, and other resources are in the process of creation (ex. Latvian WordNet, see https://wordnet.ailab.lv/; the database of Latvian morphemes and word-formation models, see https://www.dlmdm.lu.lv/). At the same time, a large-scale study of grammatical and semantic systems in the Baltic languages is underway which makes use of these resources and allows us to improve them. This encourages discussion about research in these areas with the international community of Baltic language researchers.

We invite presentations which widen the theoretical and practical perception not only of individual Baltic languages and their respective variations but also universal regularities of the language system, supported by digital resources of various breadth and type as well as applied research methods.

The main topics of discussion are these:

The use of corpora, databases, dictionaries, and other electronic resources in the analysis of grammatical phenomena, both in individual Baltic languages and from a typological or cognitive perspective
Various types of corpora and their combinations in the analysis of various aspects of Baltic grammar
Baltic-language digital resources and the analysis of the grammar system‘s evolution
The interconnections between Baltic-language digital resources and grammar (semantics, pragmatics, etc.)

References

Aleksaitė, A. et al. 2011. Lietuvių kalbos naujažodžių duomenynas. Tęstinis internetinis žinynas nuo 2011 m. Vilnius: Lietuvių kalbos institutas. https://doi.org/10.35321/neo 
Andronova, E. et al., 2022. The Corpus of Early Written Latvian (2022). CLARIN-LV digital library at IMCS, University of Latvia. Žr. http://hdl.handle.net/20.500.12574/90 
Dadurkevičius, V. 2020a. Wordlist of lemmas from the Joint Corpus of Lithuanian. CLARIN-LT digital library in the Republic of Lithuania. Žr. https://clarin.vdu.lt/xmlui/handle/20.500.11821/41 
Dadurkevičius, V. 2020b. Assessment data of the Dictionary of Modern Lithuanian versus Joint Corpora. CLARIN-LT digital library in the Republic of Lithuania. Žr. https://clarin.vdu.lt/xmlui/handle/20.500.11821/36 
LatvianWordNet. Žr. https://wordnet.ailab.lv/project1; https://wordnet.ailab.lv/project2 
Latviešu valodas morfēmu un vārddarināšanas modeļu datubāze (DLMDM). Žr. https://www.dlmdm.lu.lv/ 
Levāne-Petrova, K. et al. 2023. Balanced Corpus of Modern Latvian (LVK2022). CLARIN-LV digital library at IMCS, University of Latvia. Žr. http://hdl.handle.net/20.500.12574/84 
Nacionālā korpusu kolekcija (NKK). Žr. www.korpuss.lv 
Rimkutė, E. et al. (eds.). 2013. Lietuvių kalbos morfemikos duomenų bazė [elektroninis išteklius]: duomenų bazė. Kaunas: Vytauto Didžiojo universitetas. Žr. https://klc.vdu.lt/morfema/ 
Spektors, A. et al. 2024, Tēzaurs.lv 2024 (Autumn Edition). CLARIN-LV digital library at IMCS, University of Latvia. Žr. http://hdl.handle.net/20.500.12574/110

Accepted papers:

Vanesa Balmane
Determinatīvie salikteņi lietvārdsGEN + lietvārds “Latviešu valodas morfēmu un vārddarināšanas modeļu datubāzes” materiālā
Ineta Balode, Dzintra Lele-Rozentāle
Leksēmas gramatizācija un gramatiskā raksturojuma atspoguļojums leksikogrāfijā
Agnė Bielinskienė
Lietuvių kalbos automatinės sintaksinės analizės plėtojimas ir problematika
Diana Burbienė
Naujų lietuvių kalbos žodžių eksperimentas žmogiškajam ir dirbtiniam intelektui
Anita Butāne
Terminelementi morfēmu un terminu datubāzēs
Anna Frīdenberga
Salikteņu un vārdu savienojumu šķīrums – viens no problemātiskiem jautājumiem, veidojot “Latviešu valodas vēsturisko vārdnīcu”
Daiki Horiguchi
Saliktie divdabji ar pirmajiem internacionālajiem elementiem: korpusu datu analīze
Dalia Jakulytė, Asta Balčiūnienė
Senųjų raštų leksikos ir morfologijos sistemos prototipo – „Knygos nobažnystės“ duomenų bazės – plėtra morfemikos ir žodžių darybos tyrimams
Erika Jasionytė-Mikučionienė
Lietuvių kalbos klausiamųjų dalelyčių raidos aspektai
Giedrė Junčytė
The emotive, perception and cognition middles in Lithuanian: a corpus-based study
Andra Kalnača, Ilze Lokmane
Latviešu valodas konstrukcija kas tur ko + nenoteiksme un miratīvs: korpusa datu analīze
Kristīne Levāne-Petrova, Mikus Grasmanis, Baiba Saulīte
“Nacionālās korpusu kolekcijas” izmantošana latviešu valodas biežuma saraksta izveidē
Gunta Ločmele
Pirmās latviešu reklāmas kā valodu kontaktu un tulkošanas attīstības pētījumu avots
Paula Miķelsone
Vokāļu kontrakcija priedēkļa un saknes sadurā runas korpusu datos
Gunta Nešpore-Bērzkalne, Madara Stāde
The variety of semantic links in the electronic dictionary “Tēzaurs”
Jurgis Pakerys, Agnė Navickaitė-Klišauskienė, Virginijus Dadurkevičius
Mišriuoju būdu sudaryti daiktavardžiai Jungtinio lietuvių kalbos tekstyno duomenimis
Erika Rimkutė
Nauji lietuvių kalbos gramatiškai anotuoti tekstynai: morfologiškai anotuoto tekstyno rengimas
Baiba Saulīte, Ilze Auziņa
Spontānas runas marķējuma līmeņi un gramatiskā analīze
Inta Urbanoviča
Čehu valodas elektroniskās lietojumprogrammas “Morfio” izmantošana latviešu valodas paronīmu izpētē
Evelīna Zilgalve
Latviešu valodas elektronisko resursu izmantojums valodas konsultācijās