FLEx: Language Modeling with Few-shot Language Explanations
Adar Avsian, Christopher Richardson, Anirudh Sundar, Larry Heck

TL;DR
FLEx is a method that enhances language model accuracy by using a small set of explanatory examples, selected through clustering, to guide the model and reduce errors without changing its weights.
Contribution
FLEx introduces a novel approach for improving language model performance with few-shot explanations, leveraging embedding-based clustering and error correction summaries.
Findings
FLEx outperforms chain-of-thought prompting on multiple datasets.
FLEx reduces up to 83% of errors compared to CoT.
The method does not require modifying model weights.
Abstract
Language models have become effective at a wide range of tasks, from math problem solving to open-domain question answering. However, they still make mistakes, and these mistakes are often repeated across related queries. Natural language explanations can help correct these errors, but collecting them at scale may be infeasible, particularly in domains where expert annotators are required. To address this issue, we introduce FLEx (ew-shot anguage planations), a method for improving model behavior using a small number of explanatory examples. FLEx selects representative model errors using embedding-based clustering, verifies that the associated explanations correct those errors, and summarizes them into a prompt prefix that is prepended at inference-time. This summary guides the model to avoid similar errors on new inputs, without modifying model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
