Conference Paper

Dynamic segmental vector quantization in isolated-word speech recognition

Dept. of Comput. Eng., Kyung Hee Univ., Yongin-Si, South Korea
DOI: 10.1109/ISSPIT.2004.1433722 Conference: Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on
Source: IEEE Xplore


The standard vector quantization (VQ) approach that uses a single vector quantizer for each entire duration of the utterance of each class suffers from the following two limitations: 1) high computational cost for large codebook sizes and 2) lack of explicit characterization of the sequential behavior. Both of two these disadvantages can be remedied by treating each utterance class as a concatenation of several information subsources, each of which is represented by a VQ codebook. With this approach, segmentation schemes obviously need to be investigated. And we call this VQ approach dynamic segmental vector quantization (DSVQ). This paper shows how to design DSVQ with some effective segmentation schemes. Better performances could be seen when applying this approach itself or mixed with hidden Markov model (HMM) in isolated-word speech recognition.

25 Reads


25 Reads
Available from