Fixing exposure bias with imitation learning needs powerful oracles

Luca Hormann; Artem Sokolov

arXiv:2109.04114·cs.CL·September 20, 2021·1 cites

Fixing exposure bias with imitation learning needs powerful oracles

Luca Hormann, Artem Sokolov

PDF

Open Access

TL;DR

This paper explores using imitation learning with error-correcting oracles to address exposure bias in neural machine translation, but finds that highly performant SMT-based oracles may be too specialized for effective IL training.

Contribution

It demonstrates the challenges of applying powerful SMT-based oracles in IL for NMT exposure bias correction due to their pruning and idiosyncrasies.

Findings

01

SMT lattice-based oracle performs well in unconstrained translation

02

Pruned and idiosyncratic SMT oracles are ineffective for IL

03

Imitation learning requires more generalizable oracles for NMT

Abstract

We apply imitation learning (IL) to tackle the NMT exposure bias problem with error-correcting oracles, and evaluate an SMT lattice-based oracle which, despite its excellent performance in an unconstrained oracle translation task, turned out to be too pruned and idiosyncratic to serve as the oracle for IL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Topic Modeling