Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

Kyle Richardson; Ronen Tamari; Oren Sultan; Reut Tsarfaty; Dafna; Shahaf; Ashish Sabharwal

arXiv:2211.07950·cs.CL·November 16, 2022·1 cites

Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

Kyle Richardson, Ronen Tamari, Oren Sultan, Reut Tsarfaty, Dafna, Shahaf, Ashish Sabharwal

PDF

Open Access 1 Repo

TL;DR

This paper introduces breakpoint modeling, a framework enabling language models to track and query intermediate beliefs throughout text, improving reasoning and understanding in natural language tasks.

Contribution

The paper presents a novel breakpoint transformer based on T5 that efficiently learns to represent and query intermediate beliefs, outperforming traditional methods in accuracy and consistency.

Findings

01

Improved prediction accuracy over conventional approaches.

02

Achieved state-of-the-art results on TRIP benchmark reasoning tasks.

03

Enhanced processing efficiency and belief tracking consistency.

Abstract

Can we teach natural language understanding models to track their beliefs through intermediate points in text? We propose a representation learning framework called breakpoint modeling that allows for learning of this type. Given any text encoder and data marked with intermediate states (breakpoints) along with corresponding textual queries viewed as true/false propositions (i.e., the candidate beliefs of a model, consisting of information changing through time) our approach trains models in an efficient and end-to-end fashion to build intermediate representations that facilitate teaching and direct querying of beliefs at arbitrary points alongside solving other end tasks. To show the benefit of our approach, we experiment with a diverse set of NLU tasks including relational reasoning on CLUTRR and narrative understanding on bAbI. Using novel belief prediction tasks for both tasks, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/situation_modeling
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Bayesian Modeling and Causal Inference

MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Dropout · Attention Dropout · Dense Connections · Adafactor