Conference Proceeding

Computing Mel-frequency cepstral coefficients on the power spectrum

Lehrstuhl fur Inf. VI, Rheinisch-Westfalische Tech. Hochschule Aachen
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on (impact factor: 4.63). 02/2001; DOI:10.1109/ICASSP.2001.940770 ISBN: 0-7803-7041-4 pp.73 - 76 vol.1 In proceeding of: Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on, Volume: 1
Source: IEEE Xplore

ABSTRACT We present a method to derive Mel-frequency cepstral coefficients
directly from the power spectrum of a speech signal. We show that
omitting the filterbank in signal analysis does not affect the word
error rate. The presented approach simplifies the speech recognizers
front end by merging subsequent signal analysis steps into a single one.
It avoids possible interpolation and discretization problems and results
in a compact implementation. We show that frequency warping schemes like
vocal tract normalization can be integrated easily in our concept
without additional computational efforts. Recognition test results
obtained with the RWTH large vocabulary speech recognition system are
presented for two different corpora: The German VerbMobil II dev99
corpus, and the English North American Business News 94 20k development
corpus

0 0
 · 
0 Bookmarks
 · 
22 Views
  • Source
    Article: Different techniques and algorithms for biomedical signal processing
    [show abstract] [hide abstract]
    ABSTRACT: This paper is intended to give a broad overview of the complex area of biomedical and their use in signal processing. It contains sufficient theoretical materials to provide some understanding of the techniques involved for the researcher in the field. This paper consists of two parts: feature extraction and pattern recognition. The first part provides a basic understanding as to how the time domain signal of patient are converted to the frequency domain for analysis. The second part provides basic for understanding the theoretical and practical approaches to the development of neural network models and their implementation in modeling biological system.
  • Source
    Article: FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition
    International Journal of Reconfigurable Computing. 01/2011;
  • Source
    Article: SPEECH ENHANCEMENT USING PITCH DETECTION APPROACH FOR NOISY ENVIRONMENT
    [show abstract] [hide abstract]
    ABSTRACT: Acoustical mismatch among training and testing phases degrades outstandingly speech recognition results. This problem has limited the development of real-world nonspecific applications, as testing conditions are highly variant or even unpredictable during the training process. Therefore the background noise has to be removedfrom the noisy speech signal to increase the signal intelligibility and to reduce the listener fatigue. Enhancement techniques applied, as pre-processing stages; to the systems remarkably improve recognition results. In this paper, a novel approach is used to enhance the perceived quality of the speech signal when the additive noise cannot be directly controlled. Instead of controlling the background noise, we propose to reinforce the speech signal so that it can be heard more clearly in noisy environments.The subjective evaluation shows that the proposed method improves perceptual quality of speech in various noisy environments. As in some cases speaking may be more convenient than typing, even for rapid typists: many mathematical symbols are missing from the keyboard but can be easily spoken and recognized. Therefore, the proposed system can be used in an application designed for mathematical symbol recognition (especially symbols not available on the keyboard) in schools.
    International Journal of Engineering Science and Technology. 01/2011;

Full-text

View
0 Downloads
Available from

Keywords

additional computational efforts
 
derive Mel-frequency cepstral coefficients
 
discretization problems
 
frequency warping schemes
 
power spectrum
 
presented approach simplifies
 
Recognition test results
 
RWTH large vocabulary speech recognition system
 
speech recognizers
 
vocal tract normalization
 

S. Molau