Conference Proceeding

A logarithmic based pole-zero vocal tract model estimation for speaker verification.

Acoust. Res. Inst., Austrian Acad. of Sci., Vienna, Austria
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on (impact factor: 4.63). 01/2011; DOI:10.1109/ICASSP.2011.5947434 pp.4820-4823 In proceeding of: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, May 22-27, 2011, Prague Congress Center, Prague, Czech Republic
Source: DBLP

ABSTRACT In this paper we investigate the use of formant and anti formant measurements of nasal consonants for speaker verification. The features are obtained using a pole-zero vocal tract model estimate optimized by minimizing a logarithmic criterion which is motivated by the perception of amplitude by the human auditory system. A GMM-UBM approach is used for performing speaker comparisons within the likelihood-ratio framework. Results are compared with systems based on Mel Frequency Cepstral Coefficients (MFCCs) as well as formant center frequencies and bandwidths obtained using the Snack Toolkit. The formant and anti-formant based system attains comparable results to the MFCC system and outperforms the formant-based approach while offering a more straight for ward interpretation in terms of a physical speech production model.

0 0
 · 
0 Bookmarks
 · 
35 Views

Full-text (2 Sources)

View
3 Downloads
Available from
29 Jan 2013

Ewald Enzinger