Digital humanities: resources, tools and methods

Code

P6-0436 (A) - included in ARIS records

Head

PhD Darja Fišer

Science

Engineering sciences and technologies (7)
Social sciences (1)
Humanities (16)
Other (1)

Reseacher status

Researcher (24)
Junior expert or technical associate (1)

Education

Doctoral degree (14)
Doctorate/foreign document (2)
Other (9)

Sex

Woman (10)
Man (15)

Status

Employed at RO and RRD (23)
No data on employment in RO (1)
Retired (1)

No. of publications

0 (1)
1–9 (1)
10–99 (13)
100–999 (10)

Projects / Programmes source: ARIS

Digital humanities: resources, tools and methods

Periods

January 1, 2022 - December 31, 2027

Research activity

Code	Science	Field	Subfield
6.05.00	Humanities	Linguistics
2.07.00	Engineering sciences and technologies	Computer science and informatics

Code	Science	Field
6.02	Humanities	Languages and Literature
1.02	Natural Sciences	Computer and information sciences

Keywords

Digital Humanities, Digital Editions, Historical Collections, Oral History, Corpus Linguistics, Machine Learning, Mixed Methods, Language Technologies, Speech Technologies, Computer Vision

Evaluation (metodology)

Evaluation of bibliographic research performance indicators according to ARIS methodology

Points

13,730.61

A''

3,682.72

7,591.23

A1/2

8,734.67

CI10

1,573

CImax

h10

48.3

7.64

Data for the last 5 years (citations for the last 10 years) on June 24, 2026; Data for score A3 calculation refer to period 2020-2024

Data for ARIS tenders ( 04.04.2019 – Programme tender, archive )

Citations Citations for bibliographic records in COBIB.SI that are linked to records in citation databases

Database	Linked records	Citations	Pure citations	Average pure citations
WoS	148	1,045	923	6.24
Scopus	289	2,242	1,956	6.77

Organisations (3) , Researchers (27)

0501 Institute for Contemporary History

no.	Code	Name and surname	Research area	Role	Period	No. of publicationsNo. of publications
1.	53681	PhD Ksenija Bogetić Pejović	Linguistics	Researcher	2023 - 2026	93
2.	61674	Agustin Cosovschi, Ph.D.	Historiography	Researcher	2025 - 2026	61
3.	57130	Filip Dobranić	Linguistics	Researcher	2022 - 2026	42
4.	57652	Bojan Evkoski	Linguistics	Researcher	2023	0
5.	26294	PhD Darja Fišer	Linguistics	Head	2022 - 2026	436
6.	60185	Adela Hincu, Ph.D.	Historiography	Researcher	2026	22
7.	56952	PhD Ganna Kryvenko	Linguistics	Researcher	2022 - 2026	77
8.	50983	PhD Jakob Lenardič	Linguistics	Researcher	2022 - 2026	75
9.	16350	PhD Andrej Pančur	Historiography	Researcher	2022 - 2026	290
10.	08411	PhD Jurij Perovšek	Historiography	Retired researcher	2022	870
11.	38461	PhD Ajda Pretnar Žagar	Computer science and informatics	Researcher	2022 - 2026	63
12.	56554	Jure Skubic	Sociology	Researcher	2023 - 2026	37
13.	17106	PhD Mojca Šorn	Historiography	Researcher	2022 - 2026	208

0581 University of Ljubljana, Faculty of Arts

no.	Code	Name and surname	Research area	Role	Period	No. of publicationsNo. of publications
1.	55593	David Bordon	Linguistics	Young researcher	2022 - 2026	21
2.	14681	PhD Vojko Gorjanc	Linguistics	Researcher	2022 - 2026	514
3.	50983	PhD Jakob Lenardič	Linguistics	Researcher	2022	75
4.	61704	Tina Munda	Linguistics	Researcher	2026	36
5.	53255	Kristina Pahor de Maiti Tekavčič	Linguistics	Researcher	2022 - 2026	53
6.	33783	PhD Damjan Popič	Linguistics	Researcher	2024 - 2026	138
7.	56554	Jure Skubic	Sociology	Researcher	2022 - 2023	37

1539 University of Ljubljana, Faculty of Computer and Information Science

no.	Code	Name and surname	Research area	Role	Period	No. of publicationsNo. of publications
1.	30062	PhD Ciril Bohak	Computer science and informatics	Researcher	2024 - 2026	243
2.	59036	Eva Boneš	Computer science and informatics	Researcher	2025 - 2026	12
3.	56644	Andraž Juvan		Researcher	2024 - 2026	7
4.	16131	PhD Alenka Kavčič	Computer science and informatics	Researcher	2022 - 2026	158
5.	15677	PhD Matija Marolt	Computer science and informatics	Researcher	2022 - 2026	471
6.	35071	PhD Matevž Pesek	Computer science and informatics	Researcher	2022 - 2026	240
7.	59790	Uroš Šmajdek	Computer science and informatics	Young researcher	2024 - 2026	18

Abstract

Digitisation of cultural heritage in Slovenia falls below the European average. With the exception of corpus linguistics and literary history where richly annotated corpora and critical editions are plentiful and internationally renowned, datalization work on DH collections has been highly fragmented and largely incompatible due to different encoding standards used. This clearly shows that the requirements of supporting state-of-the art data-driven DH research has not yet been met in Slovenia which significantly hinders research in the national context but also its international visibility. The research programme addresses three broad research problems: 1) the development and integration of advanced workflows for creating and publishing reliable, interpretable, interoperable and richly annotated complex digital editions; 2) the development and promotion of novel interdisciplinary quantitative and qualitative methods for Slovenian DH; and 3) the development and evaluation of advanced technologies for the processing, enrichment and visualization of historical heterogeneous multilingual and multimodal data. The scientific framework of the proposed research programme is highly interdisciplinary but rooted in corpus and computational linguistics and machine learning of heterogeneous structured and unstructured collections from Slovenian contemporary history, periodical studies, political studies and anthropology that are both mono- and multilingual, and contain textual, speech and image data and metadata. Developing and integrating these methods into Slovene DH is crucial because of their importance and potential to contribute to a comprehensive understanding of past and present cultural phenomena in the European context. The programme will also contribute novel technologies to support digitization of DH research data and cultural heritage. The programme will perform the complete cycle of DH research activities: capture, organization and storage; enrichment; analysis, visualization, interpretation, and dissemination of the results. While the methods and technologies developed within the programme are not limited to specific data collections, the programme will contribute 6 major new open-source datasets containing text, speech and images. It will improve methods for digitizing complex historical documents, develop novel methods for transcribing oral history recordings, and engage in development of internationally novel methods for processing images in historical documents. The programme will adapt language and text enrichment technologies to historical and dialectal language as well as develop support for exploring multilingual documents important for investigations of Slovenian history. The programme will develop advanced mixed methods for network, geospatial and temporal analysis of DH data.

Significance for science

The impact of the proposed digital scholarship research programme will be manifold. First and foremost, it will engage in traditional humanities research in a significant new way but will also generate novel research questions, methodological approaches, findings and theoretical paradigms at the interface between language-driven analysis of historical multimodal data and data science. Beyond ensuring preservation and accessibility of cultural heritage data, it will enable the enrichment of DH data and metadata using state-of-the-art data science methods. The programme will also address the exploitation of the contents of the created digital resources and the adaptation and development of appropriate language technologies to search and retrieve information from Big Data of the Past. Considering the fact that the members of the research group are already involved in international DH research, especially in DH text and speech processing, we expect that the results will achieve international impact and recognition and be relevant for other languages. The proposed programme will solve open and topical problems with long-term goals which have a high scientific and technological potential. It will promote critical, transparent and reproducible research in humanities, covering data, code, workflows, methods and documentation. Successful implementation of the presented challenges requires strong cooperation between humanities and data science experts. By engaging in cross-domain knowledge-transfer and promoting an interdisciplinary approach focusing on new digital methods and tools for digital humanities research and teaching, it will have a lasting disciplinary impact.

Significance for the country

The proposed programme will importantly contribute to the infrastructure for cultural heritage, which is also addressed by the following objectives of the Resolution of the national programme for language policy 2021-2025: digitization, description, preservation and open access of cultural heritage Slovene language materials. The resources, tools and methods developed in the programme will be woven into humanities and data science curricula, which will develop interdisciplinary profiles of human data analysts that are in great demand in the public sector but also in the industry. The results of the project will also enable cultural innovation and algorithmic creativity in the field of representation and mediation of the arts and humanities.  With the open-source historical digital collections, language models, training sets and toolchains that will enable automated handling of complex documents, audio archives and images containing non-canonical Slovene, the proposed programme will close the gap on processing of non-canonical (historical, dialectal, multilingual) language data that is important in today's information society and economy. The urgent need for digitisation of all aspects of the society and economy has been recognized by the newly established national Strategic council for digitization which shares many objectives of the proposed research programme.

Digital humanities: resources, tools and methods

Views history

Favourite

Digital humanities: resources, tools and methods

FRASCATI classification

FORD classification

Confirmation required

Views history

Favourite