Loading...
Projects / Programmes source: ARIS

Contemporary slovene lexicon (online language resources)

Research activity

Code Science Field Subfield
6.05.01  Humanities  Linguistics  Languages - research 

Code Science Field
H353  Humanities  Lexicology 
Keywords
lexicology, corpus linguistics, language resources, web dictionaries
Evaluation (rules)
source: COBISS
Researchers (5)
no. Code Name and surname Research area Role Period No. of publicationsNo. of publications
1.  07488  MSc Alenka Gložančev  Humanities  Researcher  2007 - 2009  126 
2.  05494  PhD Primož Jakopin  Linguistics  Researcher  2007 - 2009  221 
3.  28437  PhD Mija Michelizza  Linguistics  Junior researcher  2007 - 2009  122 
4.  17538  Rožamarija Uršič    Technical associate  2007 - 2009 
5.  12845  PhD Andreja Žele  Linguistics  Head  2007 - 2009  861 
Organisations (1)
no. Code Research organisation City Registration number No. of publicationsNo. of publications
1.  0618  Research Centre of the Slovenian Academy of Sciences and Arts  Ljubljana  5105498000  63,142 
Abstract
Contemporary slovene lexicon in Dictionary of Contemporary lexicon (DCL) involves about 6200 lexems – lemmas/entries and near 1500 collocations as lemmas/entries. This fund of words is base for further lexicological work and at the same time base for new Dictionary of standard Slovenian, in the first place as addition for Dictionary of Standard Slovenian. Collected and selected one- or multi-words lemmas/entries to tend to development of Slovenian from the word-formational, semantic and syntactic aspect. We may find out more and more intensitive comparative-contrastive relations between Slovenian and English. In the framework of the Target research project Online List of Slovenian Words (2004-2006) 355.000 slovenian headwords have been gathered, including the Dictionary of Standard Slovenian, Dictionary of Lesser Used Slovenian Words, Nova beseda text corpus and index of the slovenian web search engine NAJDI.SI. Headword entries include basic part-of-speech description and links to relevant web pages, where the entries can be observed in context or where they are explained in more detail. There are however 81.000 entries in the list, all from the Dictionary of Lesser Used Slovenian Words (178.000 headwords, with part-of-speech, but without explanation or usage examples), where no internet link could be found. The proposed project therefore includes research, directed to enhancement of the 355.000 slovenian entries with links to the relevant entries of the english version of Wikipedia, the free web encyclopaedia (1.400.000 entries – Slovenian Wikipedia only contains 32.000 entries). This effort would anchor Slovenian words to the world’s largest word knowledge resource and foster current and future cross language (Slovenian-English-Slovenian) research. Part of the project, especially targeted to no-link entries, is the research of a robust algorithm that would split compound Slovenian words, into prefixes and common endings, which could be correctly machine-translated to English counterparts.
Significance for science
The aim of these new recources od lexicon is The Glossary for the new Dictionary of Contemporary lexicon (DCL), which involves about 6200 lexems – lemmas/entries and near 1500 collocations as lemmas/entries and which will bi published in 2010, is the same time the fund of words or the base for further lexicological work and at the same time is the lexicon base for new Dictionary of standard Slovenian, in the first place as addition for Dictionary of Standard Slovenian. The concrect result of this project had been published with the same title in may 2009.
Significance for the country
Applications of results of the proposed research project (as dictionaries of Standard Language, text corpus) are possible on the Institute's web pages. This kind of Institute's basic research and materials (basic Slovenian language resources, glossaries, corpus, data base) into the Slovenian language, which are unique in, are important for the development of linguistics here and at an international level. Research results are also useful in various professions, particularly for development of new terminology and monolingual dictionaries of Standard Slovenian and for terminological dictionaries. This kind of compiling linguistic materials and basic research into Slovenian language, which are unique in, are important for development of society and research results are useful for various professions, particularly for development of new terminology.
Most important scientific results Annual report 2008, final report, complete report on dLib.si
Most important socioeconomically and culturally relevant results Annual report 2008, final report, complete report on dLib.si
Views history
Favourite