Language Generation with Strictly Proper Scoring Rules

Chenze Shao; Fandong Meng; Yijin Liu; Jie Zhou

arXiv:2405.18906·cs.CL·May 30, 2024

Language Generation with Strictly Proper Scoring Rules

Chenze Shao, Fandong Meng, Yijin Liu, Jie Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to adapt strictly proper scoring rules for language generation, replacing the traditional log-likelihood loss, leading to improved model performance, especially in large language models.

Contribution

It proposes a novel strategy to incorporate non-local proper scoring rules into language modeling, enabling the use of Brier and Spherical scores as training objectives.

Findings

01

Replacing log-likelihood with proper scoring rules improves generation quality.

02

The approach scales effectively to large language models like LLaMA-7B and 13B.

03

Substituting the loss function yields significant performance gains without hyperparameter tuning.

Abstract

Language generation based on maximum likelihood estimation (MLE) has become the fundamental approach for text generation. Maximum likelihood estimation is typically performed by minimizing the log-likelihood loss, also known as the logarithmic score in statistical decision theory. The logarithmic score is strictly proper in the sense that it encourages honest forecasts, where the expected score is maximized only when the model reports true probabilities. Although many strictly proper scoring rules exist, the logarithmic score is the only local scoring rule among them that depends exclusively on the probability of the observed sample, making it capable of handling the exponentially large sample space of natural text. In this work, we propose a straightforward strategy for adapting scoring rules to language generation, allowing for language modeling with any non-local scoring rules.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shaochenze/scoringruleslm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems