Analysis of large text datasets

Code

J2-1313 (C) - included in ARIS records

Head

PhD Dunja Mladenić

Period

1/1/1999 - 6/30/2001

Science

Engineering sciences and technologies (1)

Reseacher status

Researcher (1)
Junior expert or technical associate (0)

Education

Doctoral degree (1)

Sex

Woman (1)

Status

Employed at RO and RRD (1)

No. of publications

100–999 (1)

Projects / Programmes source: ARIS

Analysis of large text datasets

Research activity

Code	Science	Field	Subfield
2.07.07	Engineering sciences and technologies	Computer science and informatics	Intelligent systems - software

Code	Science	Field
T171	Technological sciences	Microelectronics

Keywords

machine learning, text learning, learning on the Web, information retreival

Evaluation (metodology)

Evaluation of bibliographic research performance indicators according to ARIS methodology

Citations Citations for bibliographic records in COBIB.SI that are linked to records in citation databases

Organisations (1) , Researchers (1)

0106 Jožef Stefan Institute

no.	Code	Name and surname	Research area	Role	Period	No. of publicationsNo. of publications
1.	12570	PhD Dunja Mladenić	Computer science and informatics	Head	1999 - 2001	719

Abstract

The research will be focused at the development of new and improvement of the existing computer methods for the analysis of large text datasets. Special emphasis will be put on the analysis of Slovenian text. The developed methods will enable automatic document categorization of Slovenian text, adaptation of the existing methods for text-learning to Slovenian texts, analysis of text datasets based on the new, extended document representation and better Web browsing by using a personal browsing assistant based on the new text analysis methods. The development of different applications will be enabled, including automatic updating of some existing document categorizations that are currently updated manualy, like for example, the categorization of Slovene Web documents named žMat Kurja'' or the specialized categorization of Slovenian text documents žBiomedicina Slovenica’, a national bibliography for biomedicine.

Analysis of large text datasets

Views history

Favourite

Analysis of large text datasets

FRASCATI classification

CERIF classification

Confirmation required

Views history

Favourite