Conference Proceeding

Text-Independent Speaker Identification Using Vocal Tract Length Normalization for Building Universal Background Model

01/2009; In proceeding of: Interspeech, At U.K, Brighton
Source: DBLP

ABSTRACT In this paper, we propose to use Vocal Tract Length Normalization (VTLN) to build the Universal
Background Model (UBM) for a closed set speaker identification system.
Vocal Tract Length (VTL) differences among speakers is a major source of variability in the speech signal.
Since the UBM model is trained using data from many speakers,
it statistically captures this inherent variation in the speech signal,
which results in a ``coarse'' model in the acoustic space.
This may cause the adapted speaker models obtained from the UBM model to have significantly
high overlap in the acoustic space.
We hypothesize that the use of VTLN will help in compacting the UBM model
and thus the speaker adapted models obtained from this compact model will have
better speaker-separability in the acoustic space.
We perform experiments on MIT, TIMIT and NIST 2004 SRE databases and show that using VTLN we can
achieve lesser Identification Error Rates as compared to the conventional GMM-UBM based method.

0 0
 · 
0 Bookmarks
 · 
41 Views

Full-text

View
39 Downloads
Available from
21 Jun 2012