# Monotonic Infinite Lookback Attention for Simultaneous Machine   Translation

**Authors:** Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu,, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel

arXiv: 1906.05218 · 2019-06-13

## TL;DR

This paper introduces MILk attention, a novel adaptive scheduling method for simultaneous machine translation that balances latency and quality by jointly learning to attend over source tokens.

## Contribution

It presents the first simultaneous translation system with an adaptive schedule learned jointly with a neural machine translation model using MILk attention.

## Key findings

- MILk achieves favorable latency-quality trade-offs compared to wait-k strategies.
- The system adapts its reading schedule effectively across different latency settings.
- Demonstrates improved performance in streaming translation scenarios.

## Abstract

Simultaneous machine translation begins to translate each source sentence before the source speaker is finished speaking, with applications to live and streaming scenarios. Simultaneous systems must carefully schedule their reading of the source sentence to balance quality against latency. We present the first simultaneous translation system to learn an adaptive schedule jointly with a neural machine translation (NMT) model that attends over all source tokens read thus far. We do so by introducing Monotonic Infinite Lookback (MILk) attention, which maintains both a hard, monotonic attention head to schedule the reading of the source sentence, and a soft attention head that extends from the monotonic head back to the beginning of the source. We show that MILk's adaptive schedule allows it to arrive at latency-quality trade-offs that are favorable to those of a recently proposed wait-k strategy for many latency values.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.05218/full.md

## Figures

27 figures with captions in the complete paper: https://tomesphere.com/paper/1906.05218/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/1906.05218/full.md

---
Source: https://tomesphere.com/paper/1906.05218