Cost-Sensitive Training for Autoregressive Models
Irina Saparina, Anton Osokin

TL;DR
This paper explores cost-sensitive training for autoregressive models, proposing a reference policy based on alignment, optimizing test metrics, and replacing KL loss with ranking objectives to improve model performance.
Contribution
It introduces a novel reference policy construction, demonstrates benefits of test metric-based costs, and advocates replacing KL loss with ranking objectives for autoregressive training.
Findings
Reference policy based on alignment improves predictions.
Test metric-based costs enhance training effectiveness.
Ranking objectives outperform KL loss in learning high-probability tokens.
Abstract
Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daum\'e III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way to construct a reference policy based on an alignment between the model output and ground truth. Our reference policy is optimal when applied to the Kendall-tau distance between permutations (appear in the task of word ordering) and helps when working with the METEOR score for machine translation. Second, we observe that the learning-to-search approach benefits from choosing the costs related to the test metrics. Finally, we study the effect of different learning objectives and find that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification
MethodsTest
