Conference Proceeding

Evaluation of Bangla word recognition performance using acoustic features

01/2011; DOI:10.1109/ICCAIE.2010.5735130 pp.490 - 494 In proceeding of: Computer Applications and Industrial Electronics (ICCAIE), 2010 International Conference on
Source: IEEE Xplore

ABSTRACT In this paper, we have prepared a medium size Bangla speech corpus and compare performances of different acoustic features for Bangla word recognition. Most of the Bangla automatic speech recognition (ASR) system uses a small number of speakers, but 40 speakers selected from a wide area of Bangladesh, where Bangla is used as a native language, are involved here. In the experiments, mel-frequency cepstral coefficients (MFCCs) and local features (LFs) are inputted to the hidden Markov model (HMM) based classifiers for obtaining word recognition performance. From the experiments, it is shown that MFCC-based method of 39 dimensions provides a higher word correct rate (WCR) than the other methods investigated. Moreover, a higher WCR is obtained by the MFCC39-based method with fewer mixture components in the HMM.

0 0
 · 
1 Bookmark
 · 
61 Views

Full-text (2 Sources)

View
4 Downloads
Available from
4 Jan 2013

Keywords

39 dimensions
 
ASR
 
Bangla automatic speech recognition
 
Bangla word recognition
 
classifiers
 
different acoustic features
 
hidden Markov model
 
higher WCR
 
higher word correct rate
 
medium size Bangla speech corpus
 
mel-frequency cepstral coefficients
 
MFCC39-based method
 
mixture components
 
native language
 
performances
 
wide area
 
word recognition performance