AoI-MDP: An AoI Optimized Markov Decision Process (Student Abstract)

Yimian Ding; Jingzehua Xu; Yiyuan Yang; Guanwen Xie; Xinqi Wang; Shuai Zhang

arXiv:2605.16777·eess.SY·May 19, 2026

AoI-MDP: An AoI Optimized Markov Decision Process (Student Abstract)

Yimian Ding, Jingzehua Xu, Yiyuan Yang, Guanwen Xie, Xinqi Wang, Shuai Zhang

PDF

1 Repo

TL;DR

This paper introduces AoI-MDP, a reinforcement learning framework that models observation delays to optimize information freshness in underwater autonomous vehicle tasks.

Contribution

It presents a novel AoI-MDP model that incorporates delay modeling and wait times, improving decision-making for underwater exploration.

Findings

01

AoI-MDP outperforms standard MDP in simulations.

02

The approach enhances information freshness and decision accuracy.

03

Code is available at https://github.com/Xiboxtg/AoI-MDP.

Abstract

Ocean exploration places high demands on autonomous underwater vehicles, especially when there's observation delay. We propose age of information optimized Markov decision process (AoI-MDP) to enhance underwater tasks by modeling observation delay as signal delay and including it in the state space. AoI-MDP also introduces wait time in the action space and integrates AoI with reward functions, optimizing information freshness and decision-making using reinforcement learning. Simulations show AoI-MDP outperforms the standard MDP, demonstrating superior performance, feasibility, and generalization in underwater tasks. To accelerate relevant research, we have made the codes available as open-source at https://github.com/Xiboxtg/AoI-MDP.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Xiboxtg/AoI-MDP
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.