Thinking About Thinking: SAGE-nano's Inverse Reasoning for Self-Aware Language Models
Basab Jha, Firoj Paudel, Ujjwal Puri, Zhang Yuting, Choi Donghyuk, Wang Junhao

TL;DR
This paper introduces inverse reasoning in LLMs, enabling models to explain their reasoning processes post-hoc, which enhances transparency and reasoning accuracy, demonstrated through a new framework and extensive evaluations.
Contribution
It presents the first rigorous framework for LLM self-reflection via inverse reasoning, along with a novel meta-learning approach and comprehensive evaluation methods.
Findings
SAGE-nano achieves 74.6% accuracy on AQUA-RAT
Explanation quality scored 92.1% on human preference
Inverse reasoning improves interpretability and reasoning performance
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities at solving complex reasoning tasks with Chain-of-Thought (CoT) prompting, but their decision-making processes remain somewhat blackbox. We introduce textbfinverse reasoning, a novel paradigm enabling LLMs to decompose and explain their own reasoning chains post-hoc. Our approach, used in SAGE-nano, a 4-billion-parameter reasoning model, employs a metacognitive structure that reflects back via attention processes to identify major decision points and generate explanations of reasoning choices. While typical CoT approaches are directed towards forward reasoning generation, inverse reasoning provides insight into why specific reasoning chains were selected over others. Through thorough testing of logical reasoning puzzles, math problems and ethical dilemmas from AQUA-RAT, CommonsenseQA, and customized benchmarks, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education
