Active Inference and Reinforcement Learning: A unified inference on continuous state and action spaces under partial observability
Parvin Malekzadeh, Konstantinos N. Plataniotis

TL;DR
This paper unifies active inference and reinforcement learning to improve decision-making in continuous, partially observable environments, enabling better exploration and learning without explicit reward signals.
Contribution
It establishes a theoretical connection between active inference and reinforcement learning, allowing their integration in continuous space POMDPs and overcoming previous computational limitations.
Findings
Demonstrates superior learning in continuous POMDP tasks.
Enables reward-free exploration through information-seeking behavior.
Shows theoretical and empirical advantages over traditional methods.
Abstract
Reinforcement learning (RL) has garnered significant attention for developing decision-making agents that aim to maximize rewards, specified by an external supervisor, within fully observable environments. However, many real-world problems involve partial observations, formulated as partially observable Markov decision processes (POMDPs). Previous studies have tackled RL in POMDPs by either incorporating the memory of past actions and observations or by inferring the true state of the environment from observed data. However, aggregating observed data over time becomes impractical in continuous spaces. Moreover, inference-based RL approaches often require many samples to perform well, as they focus solely on reward maximization and neglect uncertainty in the inferred state. Active inference (AIF) is a framework formulated in POMDPs and directs agents to select actions by minimizing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Complex Systems and Decision Making
