ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

Siyuan Ma; Bo Gao; Xiaojun Jia; Simeng Qin; Tianlin Li; Ke Ma; Xiaoshuang Jia; Wenqi Ren; and Yang Liu

arXiv:2602.23681·cs.AI·March 2, 2026

ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

Siyuan Ma, Bo Gao, Xiaojun Jia, Simeng Qin, Tianlin Li, Ke Ma, Xiaoshuang Jia, Wenqi Ren, and Yang Liu

PDF

Open Access

TL;DR

ODAR-Expert introduces an adaptive routing framework for LLM reasoning that dynamically allocates resources between heuristic and deliberative agents, improving accuracy and efficiency through active inference and free-energy principles.

Contribution

This work presents ODAR-Expert, a novel adaptive routing method for LLMs that balances accuracy and compute cost using active inference and a free-energy-based decision mechanism.

Findings

01

Achieves 98.2% accuracy on MATH benchmark.

02

Reduces computational costs by 82% compared to homogeneous sampling.

03

Improves the compute-accuracy trade-off across 23 benchmarks.

Abstract

The paradigm of large language model (LLM) reasoning is shifting from parameter scaling to test-time compute scaling, yet many existing approaches still rely on uniform brute-force sampling (for example, fixed best-of-N or self-consistency) that is costly, hard to attribute, and can trigger overthinking with diminishing returns. We propose ODAR-Expert, an adaptive routing framework that optimizes the accuracy-efficiency trade-off via principled resource allocation. ODAR uses a difficulty estimator grounded in amortized active inference to dynamically route queries between a heuristic Fast Agent and a deliberative Slow Agent. We further introduce a free-energy-principled, risk-sensitive fusion mechanism that selects answers by minimizing a variational free energy objective, balancing log-likelihood with epistemic uncertainty (varentropy) as a principled alternative to ad hoc voting over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification