Organizers:
- Andra Kalnača
Latvijas Universitāte
- Ilze Lokmane
Latvijas Universitāte
- Daiki Horiguči
Kioto Universitāte
Description:
The last few years have seen a growing need for the analysis of linguistic data using various types of corpora, databases, dictionaries, and other digital resources, which provide a number of possibilities for data processing and annotation based in various statistical and corpus linguistics methods. Because corpora and other digital sources can be used to analyze language both sinchronically and diachronically, they provide the chance to look beyond the traditional methods and apply new theoretical and practical approaches in a variety of languages when conducting grammatical research.
This workshop is conceived as a forum for the exchange of new and creative ideas between researchers interested in various areas of grammar and interdisciplinary approaches. We encourage the participation of those who base their research on various Baltic corpora, databases, and other electronic material and who apply corpus linguistics methodology in their research.
It is important to note that material in both Lithuanian and Latvian is already compiled in various synchronic and diachronic digital resources (ex. Aleksaitė et al. 2011; Rimkutė et al. 2013; Dadurkevičius 2020a, 2020b; Andronova et al. 2022; Levāne-Petrova et al. 2023; see also NKK https://korpuss.lv/; Spector et al. https://tezaurs.lv/) and that in addition new corpora, databases, and other resources are in the process of creation (ex. Latvian WordNet, see https://wordnet.ailab.lv/; the database of Latvian morphemes and word-formation models, see https://www.dlmdm.lu.lv/). At the same time, a large-scale study of grammatical and semantic systems in the Baltic languages is underway which makes use of these resources and allows us to improve them. This encourages discussion about research in these areas with the international community of Baltic language researchers.
We invite presentations which widen the theoretical and practical perception not only of individual Baltic languages and their respective variations but also universal regularities of the language system, supported by digital resources of various breadth and type as well as applied research methods.
The main topics of discussion are these:
- The use of corpora, databases, dictionaries, and other electronic resources in the analysis of grammatical phenomena, both in individual Baltic languages and from a typological or cognitive perspective
- Various types of corpora and their combinations in the analysis of various aspects of Baltic grammar
- Baltic-language digital resources and the analysis of the grammar system‘s evolution
- The interconnections between Baltic-language digital resources and grammar (semantics, pragmatics, etc.)
References
- Aleksaitė, A. et al. 2011. Lietuvių kalbos naujažodžių duomenynas. Tęstinis internetinis žinynas nuo 2011 m. Vilnius: Lietuvių kalbos institutas. https://doi.org/10.35321/neo
- Andronova, E. et al., 2022. The Corpus of Early Written Latvian (2022). CLARIN-LV digital library at IMCS, University of Latvia. Žr. http://hdl.handle.net/20.500.12574/90
- Dadurkevičius, V. 2020a. Wordlist of lemmas from the Joint Corpus of Lithuanian. CLARIN-LT digital library in the Republic of Lithuania. Žr. https://clarin.vdu.lt/xmlui/handle/20.500.11821/41
- Dadurkevičius, V. 2020b. Assessment data of the Dictionary of Modern Lithuanian versus Joint Corpora. CLARIN-LT digital library in the Republic of Lithuania. Žr. https://clarin.vdu.lt/xmlui/handle/20.500.11821/36
- LatvianWordNet. Žr. https://wordnet.ailab.lv/project1; https://wordnet.ailab.lv/project2
- Latviešu valodas morfēmu un vārddarināšanas modeļu datubāze (DLMDM). Žr. https://www.dlmdm.lu.lv/
- Levāne-Petrova, K. et al. 2023. Balanced Corpus of Modern Latvian (LVK2022). CLARIN-LV digital library at IMCS, University of Latvia. Žr. http://hdl.handle.net/20.500.12574/84
- Nacionālā korpusu kolekcija (NKK). Žr. www.korpuss.lv
- Rimkutė, E. et al. (eds.). 2013. Lietuvių kalbos morfemikos duomenų bazė [elektroninis išteklius]: duomenų bazė. Kaunas: Vytauto Didžiojo universitetas. Žr. https://klc.vdu.lt/morfema/
- Spektors, A. et al. 2024, Tēzaurs.lv 2024 (Autumn Edition). CLARIN-LV digital library at IMCS, University of Latvia. Žr. http://hdl.handle.net/20.500.12574/110
Accepted papers:
- Vanesa Balmane
Determinatīvie salikteņi lietvārdsGEN + lietvārds “Latviešu valodas morfēmu un vārddarināšanas modeļu datubāzes” materiālā - Ineta Balode, Dzintra Lele-Rozentāle
Leksēmas gramatizācija un gramatiskā raksturojuma atspoguļojums leksikogrāfijā - Agnė Bielinskienė
Lietuvių kalbos automatinės sintaksinės analizės plėtojimas ir problematika - Diana Burbienė
Naujų lietuvių kalbos žodžių eksperimentas žmogiškajam ir dirbtiniam intelektui - Anita Butāne
Terminelementi morfēmu un terminu datubāzēs - Anna Frīdenberga
Salikteņu un vārdu savienojumu šķīrums – viens no problemātiskiem jautājumiem, veidojot “Latviešu valodas vēsturisko vārdnīcu” - Daiki Horiguchi
Saliktie divdabji ar pirmajiem internacionālajiem elementiem: korpusu datu analīze - Dalia Jakulytė, Asta Balčiūnienė
Senųjų raštų leksikos ir morfologijos sistemos prototipo – „Knygos nobažnystės“ duomenų bazės – plėtra morfemikos ir žodžių darybos tyrimams - Erika Jasionytė-Mikučionienė
Lietuvių kalbos klausiamųjų dalelyčių raidos aspektai - Giedrė Junčytė
The emotive, perception and cognition middles in Lithuanian: a corpus-based study - Andra Kalnača, Ilze Lokmane
Latviešu valodas konstrukcija kas tur ko + nenoteiksme un miratīvs: korpusa datu analīze - Kristīne Levāne-Petrova, Mikus Grasmanis, Baiba Saulīte
“Nacionālās korpusu kolekcijas” izmantošana latviešu valodas biežuma saraksta izveidē - Gunta Ločmele
Pirmās latviešu reklāmas kā valodu kontaktu un tulkošanas attīstības pētījumu avots - Paula Miķelsone
Vokāļu kontrakcija priedēkļa un saknes sadurā runas korpusu datos - Gunta Nešpore-Bērzkalne, Madara Stāde
The variety of semantic links in the electronic dictionary “Tēzaurs” - Jurgis Pakerys, Agnė Navickaitė-Klišauskienė, Virginijus Dadurkevičius
Mišriuoju būdu sudaryti daiktavardžiai Jungtinio lietuvių kalbos tekstyno duomenimis - Erika Rimkutė
Nauji lietuvių kalbos gramatiškai anotuoti tekstynai: morfologiškai anotuoto tekstyno rengimas - Baiba Saulīte, Ilze Auziņa
Spontānas runas marķējuma līmeņi un gramatiskā analīze - Inta Urbanoviča
Čehu valodas elektroniskās lietojumprogrammas “Morfio” izmantošana latviešu valodas paronīmu izpētē - Evelīna Zilgalve
Latviešu valodas elektronisko resursu izmantojums valodas konsultācijās
Abstract submission:
If you would like to submit a paper for this workshop, please fill out the abstract submission form.