Conference Proceeding
Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition
Inst. for Infocomm Res., A*Star, Singapore
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on (impact factor:
4.63).
05/2009;
DOI:10.1109/ICASSP.2009.4960516
pp.4045 - 4048 In proceeding of: Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Source: DBLP
-
Citations (0)
- Cited In (1)
-
Article: An overview of text-independent speaker recognition: From features to supervectors
[show abstract] [hide abstract]
ABSTRACT: This paper gives an overview of automatic speaker recognition technology, with an emphasis on text-independent recognition. Speaker recognition has been studied actively for several decades. We give an overview of both the classical and the state-of-the-art methods. We start with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling. We elaborate advanced computational techniques to address robustness and session variability. The recent progress from vectors towards supervectors opens up a new area of exploration and represents a technology trend. We also provide an overview of this recent development and discuss the evaluation methodology of speaker recognition systems. We conclude the paper with discussion on future directions.Speech Communication.
Data provided are for informational purposes only. Although carefully collected, accuracy cannot be guaranteed.
The impact factor represents a rough estimation of the journal's impact factor and does not reflect the actual
current impact factor.
Publisher conditions are provided by RoMEO. Differing provisions from the publisher's actual policy or licence
agreement may be applicable.
Keywords
feature transformation
feature transformation-based support vector machines
Gaussian Mixture Models
joint MAP adaptation
joint MAP adaptation approach
maximum likelihood criteria
NIST 2006 Speaker Recognition Evaluation
prior probability density functions
results validated
speaker recognition
speaker variations
Universal Background Model