Article

Low Rank Language Models for Small Training Sets

Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
IEEE Signal Processing Letters (impact factor: 1.39). 10/2011; DOI:10.1109/LSP.2011.2160850 pp.489 - 492
Source: IEEE Xplore

ABSTRACT Several language model smoothing techniques are available that are effective for a variety of tasks; however, training with small data sets is still difficult. This letter introduces the low rank language model, which uses a low rank tensor representation of joint probability distributions for parameter-tying and optimizes likelihood under a rank constraint. It obtains lower perplexity than standard smoothing techniques when the training set is small and also leads to perplexity reduction when used in domain adaptation via interpolation with a general, out-of-domain model.

0 0
 · 
0 Bookmarks
 · 
39 Views

Full-text

View
0 Downloads
Available from

Keywords

joint probability distributions
 
language model smoothing techniques
 
low rank language model
 
low rank tensor representation
 
optimizes likelihood
 
out-of-domain model
 
perplexity reduction
 
rank constraint
 
standard smoothing techniques
 
tasks
 

B. Hutchinson