Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs
Kyle Richardson, Ronen Tamari, Oren Sultan, Reut Tsarfaty, Dafna, Shahaf, Ashish Sabharwal

TL;DR
This paper introduces breakpoint modeling, a framework enabling language models to track and query intermediate beliefs throughout text, improving reasoning and understanding in natural language tasks.
Contribution
The paper presents a novel breakpoint transformer based on T5 that efficiently learns to represent and query intermediate beliefs, outperforming traditional methods in accuracy and consistency.
Findings
Improved prediction accuracy over conventional approaches.
Achieved state-of-the-art results on TRIP benchmark reasoning tasks.
Enhanced processing efficiency and belief tracking consistency.
Abstract
Can we teach natural language understanding models to track their beliefs through intermediate points in text? We propose a representation learning framework called breakpoint modeling that allows for learning of this type. Given any text encoder and data marked with intermediate states (breakpoints) along with corresponding textual queries viewed as true/false propositions (i.e., the candidate beliefs of a model, consisting of information changing through time) our approach trains models in an efficient and end-to-end fashion to build intermediate representations that facilitate teaching and direct querying of beliefs at arbitrary points alongside solving other end tasks. To show the benefit of our approach, we experiment with a diverse set of NLU tasks including relational reasoning on CLUTRR and narrative understanding on bAbI. Using novel belief prediction tasks for both tasks, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Bayesian Modeling and Causal Inference
MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Dropout · Attention Dropout · Dense Connections · Adafactor
