Thinking About Thinking: SAGE-nano's Inverse Reasoning for Self-Aware Language Models

Basab Jha; Firoj Paudel; Ujjwal Puri; Zhang Yuting; Choi Donghyuk; Wang Junhao

arXiv:2507.00092·cs.AI·July 2, 2025

Thinking About Thinking: SAGE-nano's Inverse Reasoning for Self-Aware Language Models

Basab Jha, Firoj Paudel, Ujjwal Puri, Zhang Yuting, Choi Donghyuk, Wang Junhao

PDF

Open Access

TL;DR

This paper introduces inverse reasoning in LLMs, enabling models to explain their reasoning processes post-hoc, which enhances transparency and reasoning accuracy, demonstrated through a new framework and extensive evaluations.

Contribution

It presents the first rigorous framework for LLM self-reflection via inverse reasoning, along with a novel meta-learning approach and comprehensive evaluation methods.

Findings

01

SAGE-nano achieves 74.6% accuracy on AQUA-RAT

02

Explanation quality scored 92.1% on human preference

03

Inverse reasoning improves interpretability and reasoning performance

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities at solving complex reasoning tasks with Chain-of-Thought (CoT) prompting, but their decision-making processes remain somewhat blackbox. We introduce textbfinverse reasoning, a novel paradigm enabling LLMs to decompose and explain their own reasoning chains post-hoc. Our approach, used in SAGE-nano, a 4-billion-parameter reasoning model, employs a metacognitive structure that reflects back via attention processes to identify major decision points and generate explanations of reasoning choices. While typical CoT approaches are directed towards forward reasoning generation, inverse reasoning provides insight into why specific reasoning chains were selected over others. Through thorough testing of logical reasoning puzzles, math problems and ethical dilemmas from AQUA-RAT, CommonsenseQA, and customized benchmarks, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education