Cost-Sensitive Training for Autoregressive Models

Irina Saparina; Anton Osokin

arXiv:1912.03771·cs.LG·December 10, 2019

Cost-Sensitive Training for Autoregressive Models

Irina Saparina, Anton Osokin

PDF

Open Access

TL;DR

This paper explores cost-sensitive training for autoregressive models, proposing a reference policy based on alignment, optimizing test metrics, and replacing KL loss with ranking objectives to improve model performance.

Contribution

It introduces a novel reference policy construction, demonstrates benefits of test metric-based costs, and advocates replacing KL loss with ranking objectives for autoregressive training.

Findings

01

Reference policy based on alignment improves predictions.

02

Test metric-based costs enhance training effectiveness.

03

Ranking objectives outperform KL loss in learning high-probability tokens.

Abstract

Training autoregressive models to better predict under the test metric, instead of maximizing the likelihood, has been reported to be beneficial in several use cases but brings additional complications, which prevent wider adoption. In this paper, we follow the learning-to-search approach (Daum\'e III et al., 2009; Leblond et al., 2018) and investigate its several components. First, we propose a way to construct a reference policy based on an alignment between the model output and ground truth. Our reference policy is optimal when applied to the Kendall-tau distance between permutations (appear in the task of word ordering) and helps when working with the METEOR score for machine translation. Second, we observe that the learning-to-search approach benefits from choosing the costs related to the test metrics. Finally, we study the effect of different learning objectives and find that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification

MethodsTest