Loading...
Projects / Programmes source: ARIS

Computational Toolbox for Discovery of Prognostic Biomarkers for Survival Analysis

Research activity

Code Science Field Subfield
2.07.00  Engineering sciences and technologies  Computer science and informatics   

Code Science Field
1.02  Natural Sciences  Computer and information sciences 
Keywords
biomarkers, survival analysis, gene expression, interactive visualisations, explorative data analysis
Evaluation (rules)
source: COBISS
Points
1,765.37
A''
289.16
A'
820.96
A1/2
925.19
CI10
15,698
CImax
8,314
h10
39
A1
6.07
A3
5.34
Data for the last 5 years (citations for the last 10 years) on April 25, 2024; A3 for period 2018-2022
Data for ARIS tenders ( 04.04.2019 – Programme tender, archive )
Database Linked records Citations Pure citations Average pure citations
WoS  141  16,135  15,742  111.65 
Scopus  181  19,500  18,991  104.92 
Researchers (8)
no. Code Name and surname Research area Role Period No. of publicationsNo. of publications
1.  16324  PhD Janez Demšar  Computer science and informatics  Researcher  2021 - 2024  340 
2.  32930  Aleš Erjavec    Technical associate  2021 - 2024  12 
3.  56629  Pavlin Gregor Poličar  Computer science and informatics  Researcher  2022 - 2024  10 
4.  57109  Ela Praznik  Computer science and informatics  Researcher  2022 
5.  38461  PhD Ajda Pretnar Žagar  Computer science and informatics  Researcher  2021 - 2024  46 
6.  30142  PhD Marko Toplak  Computer science and informatics  Researcher  2021 - 2024  27 
7.  12536  PhD Blaž Zupan  Computer science and informatics  Head  2021 - 2024  531 
8.  30921  PhD Lan Žagar  Computer science and informatics  Researcher  2021 - 2024  17 
Organisations (1)
no. Code Research organisation City Registration number No. of publicationsNo. of publications
1.  1539  University of Ljubljana, Faculty of Computer and Information Science  Ljubljana  1627023  16,242 
Abstract
The proposed project will build a powerful yet intuitive toolbox to support and automate the discovery of complex prognostic biomarkers from transcriptomic data. Multi-gene biomarkers of the disease can capture patients' physiological state at the point of diagnosis or treatment decisions and are a cornerstone of precision medicine. To advance this modern paradigm of medicine in our aging society, we propose to employ methods from data science, machine learning, and data visualizations to design and implement a system for computer-aided discovery (CAD) of biomarkers. Current solutions for the discovery of prognostic biomarkers from clinical survival data consist of disparate code fragments in R or Python that require a substantial effort to integrate with knowledge bases and more work still to then be able to interactively and visually explore the results. There is a great need for comprehensive tools that would help the domain experts uncover hidden patterns and communicate the results to other stakeholders in the process, e.g., clinicians and health regulators. We propose to develop a set of computational methods and interactive model exploration techniques to democratize the science of biomarker discovery. We will focus on complex multi-gene expression (transcriptomic) biomarkers that can predict cancer patients' clinical outcomes, including overall survival and progression free survival. Our methods will be included in a tool to find and rank potential biomarker genesets and visualize the relationships between genes in a geneset using ontologies, controlled vocabulary, and other forms of curated public knowledge. This knowledge-supported discovery of biomarkers will open up the black box of a data-driven approach to biomarker discovery and make the results interpretable in the broader context of biomedical science. The project's deliverables will empower newcomers and experts from academia and the industry for faster biomarker discovery. We will achieve that by focusing on three aspects: Robust data science. Developing and open-sourcing a coherent Python library of computational approaches, including survival-based biomarker interaction analysis, biomarker maps, and heuristic search for groups of biomarkers through the integration of data and knowledge-bases.Ease of use. Implementing the tools within Orange, our established framework for visual programming and data science, with interactive visualizations that seamlessly integrate with public repositories of data and knowledge.Communication. The guiding principle for software design will be improved communication between stakeholders in the R&D. But we also aim to empower newcomers to the field with a range of training materials. The project is ambitious and challenging but builds on our previous work. The project leader has published approaches in feature interaction discovery, feature construction, data projection and mapping techniques, knowledge-based search, and intelligent data visualizations. We have been developing Orange (http://orangedatamining.com), which has a vast user base in the industry and education, to which we will add a new add-on for biomarker discovery and survival analysis. We are partnering with Genialis, an SME specializing in data science research for precision medicine. They are leaders in the area of complex transcriptomic biomarkers and are currently registering the first-ever transcriptomic biomarker for clinical use. The tools developed in this project will primarily speed-up their discovery process and simplify the communication with their partners. We also envision a joint patent as a direct result of this work. Ultimately, the tools developed in the project will be made available to a broad scientific community and will have a real potential to advance the field of precision medicine globally.
Views history
Favourite