Age and Power Minimization via Meta-Deep Reinforcement Learning in UAV   Networks

Sankani Sarathchandra; Eslam Eldeeb; Mohammad Shehab; Hirley Alves,; Konstantin Mikhaylov; Mohamed-Slim Alouini

arXiv:2501.14603·cs.LG·January 27, 2025

Age and Power Minimization via Meta-Deep Reinforcement Learning in UAV Networks

Sankani Sarathchandra, Eslam Eldeeb, Mohammad Shehab, Hirley Alves,, Konstantin Mikhaylov, Mohamed-Slim Alouini

PDF

Open Access

TL;DR

This paper introduces a meta-deep reinforcement learning method combining DQNs and MAML to optimize UAV trajectories for minimizing age-of-information and power in IoT networks, demonstrating faster convergence and adaptability.

Contribution

It presents a novel meta-RL approach integrating DQNs with MAML for scalable, efficient UAV data collection optimization under varying objectives.

Findings

01

Faster convergence compared to traditional RL methods.

02

Effective adaptation to different AoI and power objectives.

03

Achieves lower AoI and power consumption overall.

Abstract

Age-of-information (AoI) and transmission power are crucial performance metrics in low energy wireless networks, where information freshness is of paramount importance. This study examines a power-limited internet of things (IoT) network supported by a flying unmanned aerial vehicle(UAV) that collects data. Our aim is to optimize the UAV flight trajectory and scheduling policy to minimize a varying AoI and transmission power combination. To tackle this variation, this paper proposes a meta-deep reinforcement learning (RL) approach that integrates deep Q-networks (DQNs) with model-agnostic meta-learning (MAML). DQNs determine optimal UAV decisions, while MAML enables scalability across varying objective functions. Numerical results indicate that the proposed algorithm converges faster and adapts to new objectives more effectively than traditional deep RL methods, achieving minimal AoI…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · UAV Applications and Optimization · Distributed Control Multi-Agent Systems

MethodsModel-Agnostic Meta-Learning