Monotonic Infinite Lookback Attention for Simultaneous Machine Translation

Naveen Ari; Colin Andrew Cherry; Wolfgang Macherey; Chung-Cheng Chiu; Semih Yavuz; Ruoming Pang; Wei Li; Colin Raffel

Monotonic Infinite Lookback Attention for Simultaneous Machine Translation

Naveen Ari

Colin Andrew Cherry

Wolfgang Macherey

Chung-Cheng Chiu

Semih Yavuz

Ruoming Pang

Wei Li

Colin Raffel

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), Association for Computational Linguistics, Florence, Italy (2019), pp. 1313-1323

Download Google Scholar

Abstract

Simultaneous machine translation begins to translate each source sentence before the source speaker is finished speaking, with applications to live and streaming scenarios. Simultaneous systems must carefully schedule their reading of the source sentence to balance quality against latency. We present the first simultaneous translation system to learn an adaptive schedule jointly with a neural machine translation (NMT) model that attends over all source tokens read thus far. We do so by introducing Monotonic Infinite Lookback (MILk) attention, which maintains both a hard,monotonic attention head to schedule the read-ing of the source sentence, and a soft attention head that extends from the monotonic head back to the beginning of the source. We show that MILk’s adaptive schedule allows it to arrive at latency-quality trade-offs that are favorable to those of a recently proposed wait-k strategy for many latency values.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Monotonic Infinite Lookback Attention for Simultaneous Machine Translation

Abstract

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs