The paper presents efforts of the LUKS research group from the University of Ljubljana in the Interspeech 2011 Speaker State Challenge. Both systems, for the Intoxication and the Sleepiness Sub-Challenge, are based on a Universal Background Model (UBM) in a form of a Hidden Markov Model (HMM), and the Maximum A Posteriori (MAP) adaptation. With the combination of our HMM-UBM-MAP derived supervectors and selected statistical functionals from the baseline feature set, we were able to surpass the baseline system in both sub-challenges. By employing majority voting fusion of best systems we were able to further improve the performance. In the Intoxication Sub-Challenge our best result on the test set is 67.46%, and in the Sleepiness Sub-Challenge 71.28%. Recognition results of both subsystems put as at the 2. place between the Challange participants.
B.03 Paper at an international scientific conferenceCOBISS.SI-ID: 8580692
The paper investigates the time- and acoustic-mediated alignment algorithms that can be used for better speech recognition evaluation. The edit-cost function, which weights the cost of speech unit matches, substitutions, deletions and insertions, is defined as a function of timed symbols or even as a function of speech signal segments. The algorithms are compared using several classical statistical measures of different types that are derived from speech recognition confusion matrices and are normally used to measure the agreement between different classifications of the same set of objects. These measures provide a reasonable indication that the investigated algorithms provide more relevant speech recognition error statistics than the algorithms that are commonly used for this purpose.
B.03 Paper at an international scientific conferenceCOBISS.SI-ID: 8580948
A novel 3D triangulation-based imaging method and device is proposed, which has the following the following advantages over known state-of-the-art methods: low energy consumption, enables uninterrupted usage of several devices in the same room, is robust to illumination changes and disturbancies.
F.32 International patentCOBISS.SI-ID: 8985172