Contextual Recovery of Out-of-Lattice Named Entities in Automatic Speech Recognition

Jack Serrino; Leonid Velikovich; Petar Aleksic; Cyril Allauzen

Contextual Recovery of Out-of-Lattice Named Entities in Automatic Speech Recognition

Jack Serrino

Leonid Velikovich

Petar Aleksic

Cyril Allauzen

ISCA Interspeech 2019, ISCA, Graz, Austria (2019), pp. 3830-3834

Download Google Scholar

Abstract

As voice-driven intelligent assistants become commonplace, adaptation to user context becomes critical for Automatic Speech Recognition (ASR) systems. For example, ASR systems may be expected to recognize a user’s contact names containing improbable or out-of-vocabulary (OOV) words.

We introduce a method to identify contextual cues in a firstpass ASR system’s output and to recover out-of-lattice hypotheses that are contextually relevant. Our proposed module is agnostic to the architecture of the underlying recognizer, provided it generates a word lattice of hypotheses; it is sufficiently compact for use on device. The module identifies subgraphs in the lattice likely to contain named entities (NEs), recovers phoneme hypotheses over corresponding time spans, and inserts NEs that are phonetically close to those hypotheses. We measure a decrease in the mean word error rate (WER) of word lattices from 11.5% to 4.9% on a test set of NEs.

Research Areas

Natural language processing

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Contextual Recovery of Out-of-Lattice Named Entities in Automatic Speech Recognition

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs