Coordination of Bounded Rational Drones through Informed Prior Policy
Durgakant Pushp, Junhong Xu, and Lantao Liu

TL;DR
This paper presents a method for coordinating multiple drones using a prior policy based on bounded rationality, combining reinforcement learning and game theory to improve collective decision-making in familiar and unfamiliar environments.
Contribution
It introduces an adaptive prior policy for drones that enhances cooperative motion planning within a bounded rationality framework, integrating reinforcement learning and game theory.
Findings
Informed prior policies improve decision-making efficiency.
Drones navigate safely and coordinate effectively in simulations and real-world tests.
The approach adapts to environment familiarity, enhancing robustness.
Abstract
Biological agents, such as humans and animals, are capable of making decisions out of a very large number of choices in a limited time. They can do so because they use their prior knowledge to find a solution that is not necessarily optimal but good enough for the given task. In this work, we study the motion coordination of multiple drones under the above-mentioned paradigm, Bounded Rationality (BR), to achieve cooperative motion planning tasks. Specifically, we design a prior policy that provides useful goal-directed navigation heuristics in familiar environments and is adaptive in unfamiliar ones via Reinforcement Learning augmented with an environment-dependent exploration noise. Integrating this prior policy in the game-theoretic bounded rationality framework allows agents to quickly make decisions in a group considering other agents' computational constraints. Our investigation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Transportation and Mobility Innovations
