Controlling the Focus of Pretrained Language Generation Models

Jiabao Ji; Yoon Kim; James Glass; Tianxing He

arXiv:2203.01146·cs.AI·March 3, 2022

Controlling the Focus of Pretrained Language Generation Models

Jiabao Ji, Yoon Kim, James Glass, Tianxing He

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to control the focus of pretrained language models by using trainable focus vectors, enabling targeted content generation based on user-selected highlights.

Contribution

It proposes a novel mechanism with trainable focus vectors applied to fixed pretrained models, allowing explicit control over the model's attention to specific input spans.

Findings

01

Focus vectors effectively steer model output towards user-selected highlights.

02

The approach improves relevance in dialogue response generation.

03

The method enhances controllability without retraining the entire model.

Abstract

The finetuning of pretrained transformer-based language generation models are typically conducted in an end-to-end manner, where the model learns to attend to relevant parts of the input by itself. However, there does not exist a mechanism to directly control the model's focus. This work aims to develop a control mechanism by which a user can select spans of context as "highlights" for the model to focus on, and generate relevant output. To achieve this goal, we augment a pretrained model with trainable "focus vectors" that are directly applied to the model's embeddings, while the model itself is kept fixed. These vectors, trained on automatic annotations derived from attribution methods, act as indicators for context importance. We test our approach on two core generation tasks: dialogue response generation and abstractive summarization. We also collect evaluation data where the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

question406/learningtofocus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems