Conference Proceeding

A maximum entropy language model integrating N-grams and topicdependencies for conversational speech recognition

Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD;
04/1999; DOI:10.1109/ICASSP.1999.758185 ISBN: 0-7803-5041-3 pp.553-556 vol.1 In proceeding of: Acoustics, Speech, and Signal Processing, 1999. ICASSP '99. Proceedings., 1999 IEEE International Conference on, Volume: 1
Source: IEEE Xplore

ABSTRACT A compact language model which incorporates local dependencies in the form of N-grams and long distance dependencies through dynamic topic conditional constraints is presented. These constraints are integrated using the maximum entropy principle. Issues in assigning a topic to a test utterance are investigated. Recognition results on the Switchboard corpus are presented showing that with a very small increase in the number of model parameters, reduction in word error rate and language model perplexity are achieved over trigram models. Some analysis follows, demonstrating that the gains are even larger on content-bearing words. The results are compared with those obtained by interpolating topic-independent and topic-specific N-gram models. The framework presented here extends easily to incorporate other forms of statistical dependencies such as syntactic word-pair relationships or hierarchical topic constraints

0 0
 · 
0 Bookmarks
 · 
33 Views

Keywords

compact language model
 
content-bearing words
 
distance dependencies
 
forms
 
gains
 
hierarchical topic constraints
 
incorporates local dependencies
 
interpolating topic-independent
 
language model perplexity
 
maximum entropy principle
 
statistical dependencies
 
syntactic word-pair relationships
 
test utterance
 
trigram models
 

S. Khudanpur