Overcoming Non-monotonicity in Transducer-based Streaming Generation
Zhengrui Ma, Yang Feng, Min Zhang

TL;DR
This paper introduces MonoAttn-Transducer, a novel method that integrates learnable monotonic attention with Transducer models to effectively handle non-monotonic alignments in streaming generation tasks like simultaneous translation.
Contribution
It proposes a new approach combining Transducer decoding with a forward-backward algorithm for alignment inference, addressing non-monotonicity in streaming generation.
Findings
Effective handling of non-monotonic alignments in streaming scenarios
Improved performance over traditional Transducer models in complex tasks
Robustness demonstrated through extensive experiments
Abstract
Streaming generation models are utilized across fields, with the Transducer architecture being popular in industrial applications. However, its input-synchronous decoding mechanism presents challenges in tasks requiring non-monotonic alignments, such as simultaneous translation. In this research, we address this issue by integrating Transducer's decoding with the history of input stream via a learnable monotonic attention. Our approach leverages the forward-backward algorithm to infer the posterior probability of alignments between the predictor states and input timestamps, which is then used to estimate the monotonic context representations, thereby avoiding the need to enumerate the exponentially large alignment space during training. Extensive experiments show that our MonoAttn-Transducer effectively handles non-monotonic alignments in streaming scenarios, offering a robust solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Control Systems Optimization
MethodsSoftmax · Attention Is All You Need
