Marko Kos

Marko Kos
University of Maribor | UM · Faculty of Electrical Engineering and Computer Science (FERI)

PhD

About

26
Publications
4,924
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
208
Citations
Introduction
Marko Kos currently works at the Faculty of Electrical Engineering and Computer Science (FERI), University of Maribor. Marko does research in Digital signal processing, Acoustics segmentation, and Pattern recognition. Their most recent publication is 'A speech-based distributed architecture platform for an intelligent ambience.'
Additional affiliations
November 2006 - present
University of Maribor
Position
  • Asistant

Publications

Publications (26)
Article
Full-text available
The purpose of this article is to present the use of a previously validated wearable sensor device, Armbeep, in a real-life application, to enhance a tennis player’s training by monitoring and analysis of the time, physiological, movement, and tennis-specific workload and recovery indicators, based on fused sensor data acquired by the wearable sens...
Article
Full-text available
The presented paper describes a hardware-accelerated field programmable gate array (FPGA)–based solution capable of real-time stereo matching for temporal statistical pattern projector systems. Modern 3D measurement systems have seen an increased use of temporal statistical pattern projectors as their active illumination source. The use of temporal...
Chapter
For monitoring the progress of athletes in various sports and disciplines, several different approaches are nowadays available. Recently, miniature wearables have gained popularity for this task due to being lightweight and typically cheaper than other approaches. They can be positioned on the athlete’s body, or in some cases, the devices are incor...
Article
In this paper, a novel balancing circuit is proposed, which is well suited to aerospace electronics, as it allows for the use of off-the-shelf radiation tolerant components. The new design was examined analytically with and without a zero- voltage switching topology. Its performance was compared to other popular battery balancing circuits. Further,...
Article
In the paper, a speech-based platform for intelligent ambience and/or supportive environment applications is presented. The platform has a distributed architecture, which enables extended connectivity and support for multiple intelligent ambience services. The mobile unit Genesis is an integral part of the distributed platform, enabling interaction...
Article
Full-text available
This paper presents a miniature wearable device and a system for detecting and recording the movement and biometric information of a user during sport activities. The wearable device is designed to be worn on a wrist and can monitor skin temperature and pulse rate. Furthermore it can monitor arm movement and detect gestures using inertial measureme...
Conference Paper
This paper presents work related to tennis stroke detection and classification. For arm movement acquisition a miniature wearable IMU device, positioned on the player's forearm (right above the wrist) is proposed and presented. The device uses a MEMS-based accelerometer and gyroscope with 6-DOF. For reliable and accurate tennis stroke detection the...
Article
This paper presents novel features and an architecture for an automatic on-line acoustic classification and segmentation system. The system includes speech/non-speech segmentation (with the emphasis on accurate speech/music segmentation), gender segmentation, and speech bandwidth segmentation. This automatic segmentation system can be easily integr...
Article
This paper introduces a nonlinear function into the frequency spectrum that improves the detection of vowels, diphthongs, and semivowels within the speech signal. The lower efficiency of consonant detection was solved by implementing the hangover and hangbefore criteria. This paper presents a procedure for faster definition of those optimal constan...
Article
Full-text available
This study presents a new online method for speaker segmentation and clustering in real-world environments. It analyses and discusses the difficulties of online speaker diarisation and proposes a new segmentation and clustering method, in which the Bayesian information criterion (BIC) and the normalised cross-likelihood ratio (NCLR) are combined in...
Article
In this paper the segmentation of the Aurora 2 database with three different types of models is presented. The segmentation is based on speech recognition results obtained by tests on the Aurora 2 database. Three types of tests are performed. In the first test the speech units are words (16 state HMMs) and in the second test the speech units are mo...
Article
This paper presents speaker gender classification and segmentation. Such classification is frequently used in broadcast news domain. Because pitch is a feature that is difficult to calculate reliably in noisy environment, and because telephone speech is present in broadcast material, we focused on using general acoustic features for gender discrimi...
Article
Full-text available
This paper presents a novel feature for online speech/music segmentation based on the variance mean of filter bank energy (VMFBE). The idea that encouraged the feature’s construction is energy variation in a narrow frequency sub-band. The energy varies more rapidly, and to a greater extent for speech than for music. Therefore, an energy variance in...
Article
In this paper the influence of hangover and hangbefore criteria on automatic speech recognition is presented. Voice activity detection (VAD) algorithm is nowadays almost always part of automatic speech recognition systems. Hangover and hangbefore criteria can be integrated into VAD algorithm after basic VAD decision. Hangover and hangbefore criteri...
Article
The paper analyses the influence of speech/non-speech segmentation on on-line and off-line speaker segmentation accuracy. On-line and off-line speaker segmentation approaches together with speaker diarization are shortly reviewed and popular "state of the art" test systems are presented. Both systems are tested on a given test set with and without...
Article
This paper presents novel feature-group for on-line speech/music segmentation for broadcast news domain. The features are based on mel-frequency cepstral coefficients variance (MFCCV). The idea behind the feature-group construction is the energy variation in a narrow frequency sub-band. The variation is bigger for speech than for music. For feature...
Conference Paper
This paper addresses the topic of online unsupervised speaker segmentation in a complex audio environment as it is present in the Broadcast News databases. A new two stage speaker change detection algorithm is proposed, which combines the Bayesian Information Criterion with an ABLS-SCD statistical framework where adapted Gaussian mixture models are...
Conference Paper
This paper addresses the topic of unsupervised speaker segmentation for automatic speech recognition in a complex real life environment like broadcast news domain. A statistical approach where a universal background model (UBM) is applied for online speaker segmentation was compared with the widely used Bayesian information criterion (BIC) approach...
Conference Paper
In this paper we present research work that was carried out on Slovenian BNSI Broadcast News database regarding the speech bandwidth classification. Speech recorded in studio environment has frequency bandwidth of 8 kHz, while speech recorded over telephone channel has the bandwidth of 3.1 kHz. Speech bandwidth classification enables us to use sepa...
Conference Paper
In this paper we present basic ideas of noise reduction for robust speech recognition using minimum statistic algorithm and VAD based on neural networks. Noise estimation is based on minimum statistic procedure and noise subtraction in spectral space is performed based on neural network VAD output. For noise subtraction two different subtraction fa...

Network

Cited By