Anypath Routing Protocol Design via Q-Learning for Underwater Sensor Networks
Yuan Zhou, Tao Cao, and Wei Xiang

TL;DR
This paper introduces a Q-learning-based routing protocol for underwater sensor networks that aims to reduce latency and extend network lifetime by considering residual energy and depth information.
Contribution
It presents a novel localization-free anypath routing protocol using Q-learning with reward functions for energy and depth, improving delay and lifetime.
Findings
Reduces end-to-end delay in underwater sensor networks.
Extends network lifetime compared to existing protocols.
Demonstrates superior performance through extensive simulations.
Abstract
As a promising technology in the Internet of Underwater Things, underwater sensor networks have drawn a widespread attention from both academia and industry. However, designing a routing protocol for underwater sensor networks is a great challenge due to high energy consumption and large latency in the underwater environment. This paper proposes a Q-learning-based localization-free anypath routing (QLFR) protocol to prolong the lifetime as well as reduce the end-to-end delay for underwater sensor networks. Aiming at optimal routing policies, the Q-value is calculated by jointly considering the residual energy and depth information of sensor nodes throughout the routing process. More specifically, we define two reward functions (i.e., depth-related and energy-related rewards) for Q-learning with the objective of reducing latency and extending network lifetime. In addition, a new holding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUnderwater Vehicles and Communication Systems · Energy Efficient Wireless Sensor Networks · Energy Harvesting in Wireless Networks
MethodsQ-Learning
