Conference Proceeding

Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition

Inst. for Infocomm Res., A*Star, Singapore
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on (impact factor: 4.63). 05/2009; DOI:10.1109/ICASSP.2009.4960516 pp.4045 - 4048 In proceeding of: Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Source: DBLP

ABSTRACT This paper extends our previous work on feature transformation-based support vector machines for speaker recognition by proposing a joint MAP adaptation of feature transformation (FT) and Gaussian Mixture Models (GMM) parameters. In the new approach, the prior probability density functions (PDFs) of FT and GMM parameters are jointly estimated using the background data under the maximum likelihood criteria. In this way, we derive a generic prior GMM that is more compact than the Universal Background Model due to the reduction of speaker variations. With the prior PDFs, we construct a supervector to characterize a speaker using FT and GMM parameters. We conducted experiments on NIST 2006 Speaker Recognition Evaluation (SRE06) data set. The results validated the effectiveness of the joint MAP adaptation approach.

0 0
 · 
0 Bookmarks
 · 
30 Views
  • Source
    Article: An overview of text-independent speaker recognition: From features to supervectors
    [show abstract] [hide abstract]
    ABSTRACT: This paper gives an overview of automatic speaker recognition technology, with an emphasis on text-independent recognition. Speaker recognition has been studied actively for several decades. We give an overview of both the classical and the state-of-the-art methods. We start with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling. We elaborate advanced computational techniques to address robustness and session variability. The recent progress from vectors towards supervectors opens up a new area of exploration and represents a technology trend. We also provide an overview of this recent development and discuss the evaluation methodology of speaker recognition systems. We conclude the paper with discussion on future directions.
    Speech Communication.

Full-text (2 Sources)

View
0 Downloads
Available from

Keywords

feature transformation
 
feature transformation-based support vector machines
 
Gaussian Mixture Models
 
joint MAP adaptation
 
joint MAP adaptation approach
 
maximum likelihood criteria
 
NIST 2006 Speaker Recognition Evaluation
 
prior probability density functions
 
results validated
 
speaker recognition
 
speaker variations
 
Universal Background Model