On Weight Interpolation of the Hybrid Autoregressive Transducer Model

Bhuvana Ramabhadran

Cyril Allauzen

David Rybach

Ehsan Variani

Michael D. Riley

Tongzhou Chen

Interspeech 2022, Interspeech 2022 (2022) (to appear)

Download Google Scholar

Abstract

This paper explores ways to improve a two-pass speech recognition system when the first-pass is hybrid autoregressive transducer model and the second-pass is a neural language model. The main focus is on the scores provided by each of these models, their quantitative analysis, how to improve them and the best way to integrate them with the objective of better recognition accuracy. Several analysis are presented to show the importance of the choice of the integration weights for combining the first-pass and the second-pass scores. A sequence level weight estimation model along with four training criteria are proposed which allow adaptive integration of the scores per acoustic sequence. The effectiveness of this algorithm is demonstrated by constructing and analyzing models on the Librispeech data set.

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

On Weight Interpolation of the Hybrid Autoregressive Transducer Model

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

On Weight Interpolation of the Hybrid Autoregressive Transducer Model

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities