Counterfactual Language Model Adaptation for Suggesting Phrases

Kenneth C. Arnold; Kai-Wei Chang; Adam T. Kalai

arXiv:1710.01799·cs.CL·October 6, 2017·5 cites

Counterfactual Language Model Adaptation for Suggesting Phrases

Kenneth C. Arnold, Kai-Wei Chang, Adam T. Kalai

PDF

Open Access 1 Repo

TL;DR

This paper introduces a counterfactual approach to adapt language models for suggesting phrases that writers are more likely to accept, improving text entry on mobile devices.

Contribution

It proposes a novel counterfactual training framework for phrase suggestion models, enabling offline optimization of suggestions based on acceptance likelihood.

Findings

01

Simple language models can effectively capture acceptability cues.

02

Counterfactual training improves suggestion acceptance rates.

03

The approach allows offline evaluation of interactive suggestion systems.

Abstract

Mobile devices use language models to suggest words and phrases for use in text entry. Traditional language models are based on contextual word frequency in a static corpus of text. However, certain types of phrases, when offered to writers as suggestions, may be systematically chosen more often than their frequency would predict. In this paper, we propose the task of generating suggestions that writers accept, a related but distinct task to making accurate predictions. Although this task is fundamentally interactive, we propose a counterfactual setting that permits offline training and evaluation. We find that even a simple language model can capture text characteristics that improve acceptability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kcarnold/counterfactual-lm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems