Backoff Inspired Features for Maximum Entropy Language Models

Fadi Biadsy; Keith Hall; Pedro Moreno; Brian Roark

Backoff Inspired Features for Maximum Entropy Language Models

Fadi Biadsy

Keith Hall

Pedro Moreno

Brian Roark

Proceedings of Interspeech, ISCA (2014)

Google Scholar

Abstract

Maximum Entropy (MaxEnt) language models are linear models that are typically regularized via well-known L1 or L2 terms in the likelihood objective, hence avoiding the need for the kinds of backoff or mixture weights used in smoothed n-gram language models using Katz backoff and similar techniques. Even though backoff cost is not required to regularize the model, we investigate the use of backoff features in MaxEnt models, as well as some backoff-inspired variants. These features are shown to improve model quality substantially, as shown in perplexity and word-error rate reductions, even in very large scale training scenarios of tens or hundreds of billions of words and hundreds of millions of features.

Research Areas

Natural language processing

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Backoff Inspired Features for Maximum Entropy Language Models

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs