Quantitative analysis of syllables in Slavic languages(Russian, Slovak, Serbian)
Kvantitatívna analýza slabík v slovanských jazykoch (ruština, slovenčina, srbčina)
Кванититативна анализа слогова у словенским језицима (руски, словачки, српски)
The project is focused on quantitative analysis of syllables in Slavic languages, namely, in Russian, Slovak, and Serbian. These three languages represent three geographical groups of Slavic languages (East, West, and South). Syllables, as opposed to other language units, have not been mathematically modelled systematically, the main reason being problems with their definition (i.e., with word syllabification). The aim of the project is to fill this gap. The syllabification can be performed algorithmically, the approach makes use, among others, of statistical tests. In particular, we will focus on models for syllable frequency and syllable length in the three abovementioned languages. We will work with orthographic input of texts. It is expected that the new models will be related to the already known models for grapheme frequencies and word length.
This project is co-funded Slovak - Serbian joint research project in the period 2017-2018.
Ministry of Education, Science and Technological Development (Serbia)
Project Team
Slovak team members
Slovak Principal Investigator / Professor Ján Mačutek, PhD.
Michaela Koščová, PhD student
Lívia Leššová, PhD student
Serbian team members
Serbian Principal Investigator (2017) / Professor Ivan Obradović, PhD.
Serbian Principal Investigator (2017 - 2018)/ Professor Ranka Stanković, PhD.
Marija Radojičić, PhD student
Biljana Lazić, PhD student
Základné informácie o projekte / Project basic information
Team - Serbian Slovakian bilateral project proposal for period 20172018.
Тим - Предлог билатералног пројекта Србија Словачка за циклус 20172018.
Project - Description Serbian Slovakian bilateral project proposal for period 20172018.
Опис пројекта - Предлог билатералног пројекта Србија Словачка за циклус 20172018.
No | Who? | When? | Where? | Short desription |
---|---|---|---|---|
1 | Ján Mačutek | 7-10.02.2018 | Belgrade | ... |
2 | Marija Radojičić | 14-18.02.2018 | Bratislava | ... |
3 | Ján Mačutek | 6-10.03.2018 | Belgrade | ... |
4 | Lívia Leššová | 6-10.03.2018 | Belgrade | ... |
5 | Biljana Lazić | 23-26.03.2018 | Bratislava | ... |
6 | Ján Mačutek | 6-9.11.2018 | Belgrade | revision of the outputs of the analysis of Serbian texts; mathematical modelling of syllable frequencies and syllable length in Serbian; preparing the first draft of the paper with results of the abovementioned analysis |
7 | Marija Radojičić | 22-25.11.2018 | Bratislava | |
8 | Biljana Lazić | 23-26.11.2018 | Bratislava | |
Tools | Data sets |
---|---|
Web application Analysis of linguistic data — Syllabification | How the Steel Was Tempered [Serbian] |
How the Steel Was Tempered [Croatian] | |
How the Steel Was Tempered [Ukrainian] |
Syllable frequencies |
---|
Croatian |
Russian |
Serbian |
Ukrainian |