Baltic-language digital resources in grammar research

 

Organizers:


 

Description:


The last few years have seen a growing need for the analysis of linguistic data using various types of corpora, databases, dictionaries, and other digital resources, which provide a number of possibilities for data processing and annotation based in various statistical and corpus linguistics methods. Because corpora and other digital sources can be used to analyze language both sinchronically and diachronically, they provide the chance to look beyond the traditional methods and apply new theoretical and practical approaches in a variety of languages when conducting grammatical research.

This workshop is conceived as a forum for the exchange of new and creative ideas between researchers interested in various areas of grammar and interdisciplinary approaches. We encourage the participation of those who base their research on various Baltic corpora, databases, and other electronic material and who apply corpus linguistics methodology in their research.

It is important to note that material in both Lithuanian and Latvian is already compiled in various synchronic and diachronic digital resources (ex. Aleksaitė et al. 2011; Rimkutė et al. 2013; Dadurkevičius 2020a, 2020b; Andronova et al. 2022; Levāne-Petrova et al. 2023; see also NKK https://korpuss.lv/; Spector et al. https://tezaurs.lv/) and that in addition new corpora, databases, and other resources are in the process of creation (ex. Latvian WordNet, see https://wordnet.ailab.lv/; the database of Latvian morphemes and word-formation models, see https://www.dlmdm.lu.lv/). At the same time, a large-scale study of grammatical and semantic systems in the Baltic languages is underway which makes use of these resources and allows us to improve them. This encourages discussion about research in these areas with the international community of Baltic language researchers.

We invite presentations which widen the theoretical and practical perception not only of individual Baltic languages and their respective variations but also universal regularities of the language system, supported by digital resources of various breadth and type as well as applied research methods.

The main topics of discussion are these:

  • The use of corpora, databases, dictionaries, and other electronic resources in the analysis of grammatical phenomena, both in individual Baltic languages and from a typological or cognitive perspective
  • Various types of corpora and their combinations in the analysis of various aspects of Baltic grammar
  • Baltic-language digital resources and the analysis of the grammar system‘s evolution
  • The interconnections between Baltic-language digital resources and grammar (semantics, pragmatics, etc.)

 

References

 

Accepted papers:


  • Vanesa Balmane
    Determinatīvie salikteņi lietvārdsGEN + lietvārds “Latviešu valodas morfēmu un vārddarināšanas modeļu datubāzes” materiālā

  • Ineta Balode, Dzintra Lele-Rozentāle
    Leksēmas gramatizācija un gramatiskā raksturojuma atspoguļojums leksikogrāfijā

  • Agnė Bielinskienė
    Lietuvių kalbos automatinės sintaksinės analizės plėtojimas ir problematika

  • Diana Burbienė
    Naujų lietuvių kalbos žodžių eksperimentas žmogiškajam ir dirbtiniam intelektui

  • Anita Butāne
    Terminelementi morfēmu un terminu datubāzēs

  • Anna Frīdenberga
    Salikteņu un vārdu savienojumu šķīrums – viens no problemātiskiem jautājumiem, veidojot “Latviešu valodas vēsturisko vārdnīcu”

  • Daiki Horiguchi
    Saliktie divdabji ar pirmajiem internacionālajiem elementiem: korpusu datu analīze

  • Dalia Jakulytė, Asta Balčiūnienė
    Senųjų raštų leksikos ir morfologijos sistemos prototipo – „Knygos nobažnystės“ duomenų bazės – plėtra morfemikos ir žodžių darybos tyrimams

  • Erika Jasionytė-Mikučionienė
    Lietuvių kalbos klausiamųjų dalelyčių raidos aspektai

  • Giedrė Junčytė
    The emotive, perception and cognition middles in Lithuanian: a corpus-based study

  • Andra Kalnača, Ilze Lokmane
    Latviešu valodas konstrukcija kas tur ko + nenoteiksme un miratīvs: korpusa datu analīze

  • Kristīne Levāne-Petrova, Mikus Grasmanis, Baiba Saulīte
    “Nacionālās korpusu kolekcijas” izmantošana latviešu valodas biežuma saraksta izveidē

  • Gunta Ločmele
    Pirmās latviešu reklāmas kā valodu kontaktu un tulkošanas attīstības pētījumu avots

  • Paula Miķelsone
    Vokāļu kontrakcija priedēkļa un saknes sadurā runas korpusu datos

  • Gunta Nešpore-Bērzkalne, Madara Stāde
    The variety of semantic links in the electronic dictionary “Tēzaurs”

  • Jurgis Pakerys, Agnė Navickaitė-Klišauskienė, Virginijus Dadurkevičius
    Mišriuoju būdu sudaryti daiktavardžiai Jungtinio lietuvių kalbos tekstyno duomenimis

  • Erika Rimkutė
    Nauji lietuvių kalbos gramatiškai anotuoti tekstynai: morfologiškai anotuoto tekstyno rengimas

  • Baiba Saulīte, Ilze Auziņa
    Spontānas runas marķējuma līmeņi un gramatiskā analīze

  • Inta Urbanoviča
    Čehu valodas elektroniskās lietojumprogrammas “Morfio” izmantošana latviešu valodas paronīmu izpētē

  • Evelīna Zilgalve
    Latviešu valodas elektronisko resursu izmantojums valodas konsultācijās

 

Abstract submission: 


If you would like to submit a paper for this workshop, please fill out the abstract submission form.