Methodology for data analysis in medical sciences

3.08.00  Medical sciences  Public health (occupational safety)   

B110  Biomedical sciences  Bioinformatics, medical informatics, biomathematics biometrics 

3.05  Medical and Health Sciences  Other medical sciences 
biostatistics, survival analysis, Cox model, explained variation, frailties, relative survival, logistic regression, goodness of fit, scientometrics, OLAP, data mining, information retrieval, generating scientific hypotheses, search algorithms, scientific indicators
1.  30722  PhD Rok Blagus  Systems and cybernetics  Researcher  2009 - 2014  184 
2.  20134  PhD Mojca Čižek Sajko  Human reproduction  Researcher  2012 - 2014  106 
3.  04041  PhD Jurček Dimec  Computer science and informatics  Researcher  2009 - 2014  99 
4.  36370  PhD Matej Filip  Mathematics  Junior researcher  2013 - 2014  11 
5.  30505  PhD Michael Galsworthy  Computer science and informatics  Researcher  2009 - 2011  14 
6.  24769  PhD Gregor Gorjanc  Animal production  Researcher  2012 - 2013  473 
7.  11373  PhD Dimitar Hristovski  Computer science and informatics  Researcher  2009 - 2014  139 
8.  07705  PhD Emil Hudomalj  Public health (occupational safety)  Researcher  2009 - 2014  39 
9.  24344  PhD Nataša Kejžar  Systems and cybernetics  Researcher  2010 - 2014  145 
10.  29100  Maja Kunstelj    Technician  2012 - 2014 
11.  15355  PhD Branimir Leskošek  Public health (occupational safety)  Researcher  2009 - 2014  167 
12.  29917  PhD Lara Lusa  Public health (occupational safety)  Researcher  2009 - 2014  240 
13.  14641  PhD Mirna Macur  Sociology  Researcher  2011 - 2013  203 
14.  37491  PhD Klemen Pavlič  Public health (occupational safety)  Junior researcher  2014  24 
15.  30409  PhD Stojan Pečlin  Information science and librarianship  Researcher  2012 - 2013  18 
16.  23437  PhD Maja Pohar Perme  Public health (occupational safety)  Researcher  2009 - 2014  279 
17.  32048  PhD Anamarija Rebolj Kodre  Systems and cybernetics  Junior researcher  2009 - 2014  21 
18.  03410  PhD Drago Rudel  Medical sciences  Researcher  2009 - 2013  193 
19.  08992  PhD Janez Stare  Public health (occupational safety)  Principal Researcher  2009 - 2014  275 
20.  17837  PhD Gaj Vidmar  Systems and cybernetics  Researcher  2009 - 2014  534 
1.  0381  University of Ljubljana, Faculty of Medicine  Ljubljana  1627066  44,777 
The topic of our research program is methodology for discovering actual or possible patterns, trends and assotiations in medical data. These are the methods for discovering new knowledge or generating new hypotheses that may lead to knowledge. The data that we are analysing manly arise from research, but also from routine practice in medicine. In the recent years, we are paying special attention to methods for generating hypotheses from bibliographic data. We are also cautiously expanding the scope of our research, presently into the field of electric stimulation of smooth muscles and associated electromiography. In brief, our research can be divided into three sub-fields: 1. Biostatistics 2. Scientometrics 3. Data mining in v bibliographic databases The focus of our research in biostatistics is on regression models for survival analysis, especially the Cox model. In addition to explained variation, prognostic value and frailties, which have so far been our main topics related to the Cox model, we will concentrate our efforts on time-varying coefficients and testing specific alternative hypotheses, such as crossing hazards, during the forthcomming five-year period. The presently available tests do not distinguish the later situations from the null hypothesis. Beside the Cox model, we will continue studying intensely the field of relative survival, where we have recently developed an entirely new method. We will also extend the scope of our research outside survival analysis. We will investigate the methods for assessing goodness-of-fit of the logistic regression model. The present approaches are based on unit grouping, which has several disadvantages. Our approach will be based on application of results from the theory of stochastic processes, especially Brownian motion. Research in scientometrics is relatively new in general, while it is virtually nonexistent in Slovenia. It is almost impossible to conduct such research without an adequate bibliographic database, so the Biomedicina Slovenica is of fundamental importance for us. Another indispensable tool is a system for automated citation analysis, which we have also developed. The third key factor is selection of appropriate indicators, which has also been a field of our experiseje for a number of years. It is only the combination of these three components that gives one with the possibility to work on research evaluation, even though the methodological problems do not end there. Namely, all the various bibliographic databases are organised in a way that prevents the usage of standard data-analytic approaches. Hence, we have developed a system based on OLAP (On Line Analytical Processing) methodology that transforms data from bibliographic databases into a multidimensional orthogonal structure, which can then be analysed by means of the usual (statistical) methods. Two of our staff recently published an article on this in Scientometrics, the leading journal in the field. During the next five years, we will be mainly interested in research trends in Slovene medicine, as well as the ifluence of the number of authors, inter-institutional co-operation, authors' citation history and other factors on the impact of publications. Data mining in bibliographic databases is a novel approach to browsing such databases. So far, we have developed a system for supporting biomedical discovery. The system aids researchers in creating new hypotheses, which can then be tested using the established research methods. Our approach treats hypotheses as relations between biomedical concepts that have not been published in the scientific literature yet.The core of the system is the Medline bibliographic database, which is joined with the LocusLink, HUGO, OMIM and UniGene genetic databases in the present version. This makes the system particularly useful for discovering new relations in the field of genetics, such as predicting candidate genes for a new disease.
Significance for science
Besides general facts, mentioned in the program (our work in explained variation, relative surviva, multidimensional datal and programme for creation of hypothesis), the importance of our research for the development of biostatistical science is best illustrated by numerous invited lectures that we had on different, always international, occasions. In the last year, so from Jan 1 2010 these were: Janez Stare - invited talks at - workshop on individual prediction 2010 (all lectures were invited), Oberwolfach, Germany http://www.mfo.de/admin/workshops/1662/www_view - International Biometric Conference, Florianopolis, Brazilija at universities - Lyon 2010, two invited talks - London 2010, two invited talks - Milano 2010, two invited talks Lara Lusa at universities - Torino 2010 - Naples 2010 This year (2011) we already had 3 invited lectures, at least two more at important international conferences (EMR-IBS on Crete nad ISCB in Ottawa), and we are alreadz receiving invitations for year 2012.
Significance for the country
All our research is based on our goal to provide adequate support to research in medicine in Slovenia. And without our own research our support would not have been what it is. Through our support to research in medicine, we participate in its technological development. Our collaboration with researchers in medicine has in hundreds of papers over the years in journals with impact factors. Any activity with international impact strenghtens national identity. And our research certainly becomes known to people that no one in Slovenia has addressed before.
