Loading...
Projects / Programmes source: ARIS

FORMANT COMBINATORICS IN SLOVENIAN

Research activity

Code Science Field Subfield
6.05.00  Humanities  Linguistics   

Code Science Field
6.02  Humanities  Languages and Literature 
Keywords
Word Formation, Word Families, Semantic Functions of Morphemes, Morphotactics, Digital Humanities, Deep Learning, Natural Language Learning, Slovenian Language
Evaluation (rules)
source: COBISS
Points
9,659.55
A''
2,082.69
A'
4,010.52
A1/2
5,010.82
CI10
1,478
CImax
75
h10
21
A1
30.23
A3
4.04
Data for the last 5 years (citations for the last 10 years) on April 24, 2024; A3 for period 2018-2022
Data for ARIS tenders ( 04.04.2019 – Programme tender, archive )
Database Linked records Citations Pure citations Average pure citations
WoS  95  667  585  6.16 
Scopus  238  2,127  1,778  7.47 
Researchers (11)
no. Code Name and surname Research area Role Period No. of publicationsNo. of publications
1.  05023  PhD Tomaž Erjavec  Linguistics  Researcher  2021 - 2024  636 
2.  32204  PhD Boris Kern  Linguistics  Head  2021 - 2024  279 
3.  29395  PhD Nina Ledinek  Linguistics  Researcher  2021 - 2024  132 
4.  50070  PhD Matej Martinc  Linguistics  Researcher  2021 - 2022  84 
5.  53800  Andraž Pelicon  Linguistics  Researcher  2021 - 2024  47 
6.  29539  PhD Vid Podpečan  Computer science and informatics  Researcher  2023 - 2024  103 
7.  31844  PhD Senja Pollak  Linguistics  Researcher  2021 - 2024  288 
8.  56524  Marko Pranjić  Linguistics  Researcher  2023 - 2024  24 
9.  00299  PhD Irena Stramljič Breznik  Humanities  Researcher  2021 - 2024  646 
10.  33264  PhD Ines Voršič  Linguistics  Researcher  2021 - 2024  95 
11.  12845  PhD Andreja Žele  Linguistics  Researcher  2021 - 2024  859 
Organisations (3)
no. Code Research organisation City Registration number No. of publicationsNo. of publications
1.  0618  Research Centre of the Slovenian Academy of Sciences and Arts  Ljubljana  5105498000  62,985 
2.  0106  Jožef Stefan Institute  Ljubljana  5051606000  90,724 
3.  2565  University of Maribor Faculty of Arts  Maribor  5089638050  33,023 
Abstract
Slovenian, like other Slavic languages, is characterized by an extremely rich morphemic structure of words, which is a result of multistage word formation; for example, in the first stage, the adjective mlad ‘young’ yields the noun mladost ‘youth’, which in turn yields the adjective mladosten ‘youthful’ in the second stage, which in turn yields the noun mladostnik ‘adolescent’ in the third stage, which in turn yields the possessive adjective mladostnikov ‘adolescent’s’ in the fourth stage. This example shows the compatibility of four suffixal formants: -ost + -en + -ik + -ov. The compatibility of formants is considered to be the ability of different word-formation morphemes to coexist within multistage formation, taking into account the semantic-extension aspect. The proposed project focuses on the analysis and description of the compatibility of wordformation morphemes (formants) within multistage formation (mlad › mladost › mladosten › mladostnik › mladostnikov), which will generate a new research field in Slovenian linguistics: morphotactics. This is an innovation because this field does not yet exist in Slovenian linguistics. Such an analysis – by determining (a) the systemic predictability of formation in terms of the compatibility of suffixal formants and (b) its limitations – will make it possible to present the characteristics of word-formation and semantic-extension mechanisms of Slovenian, on contemporary language material, including all contemporary Slovenian dictionaries and corpora, by integrating the most state-of-the-art research methods in linguistics and language technologies (including deep learning). In so doing, we will perform the first comprehensive analysis of Slovenian word formation in 70 years. The language technology objective of the project is a pioneering compilation of the first training set and the first language technology application allowing automatic morpheme segmentation of Slovenian words. This is also of key importance for the development of semantic language resources and language technologies for Slovenian, and it is of course undoubtedly important for linguistics as well.
Views history
Favourite