Temporal Action Representation Learning for Tactical Resource Control and Subsequent Maneuver Generation
Hoseong Jung, Sungil Son, Daesol Cho, Jonghae Park, Changhyun Choi, H. Jin Kim

TL;DR
This paper introduces TART, a framework that learns temporal representations of resource control and maneuvers in autonomous systems, improving decision-making in resource-constrained, fast-evolving scenarios.
Contribution
TART employs contrastive learning and quantization to capture temporal dependencies and multi-modal tactical patterns, addressing limitations of prior hybrid action space methods.
Findings
Outperforms hybrid-action baselines in maze navigation
Enhances resource-aware maneuver generation in air combat simulation
Demonstrates effective modeling of resource-maneuver dependencies
Abstract
Autonomous robotic systems should reason about resource control and its impact on subsequent maneuvers, especially when operating with limited energy budgets or restricted sensing. Learning-based control is effective in handling complex dynamics and represents the problem as a hybrid action space unifying discrete resource usage and continuous maneuvers. However, prior works on hybrid action space have not sufficiently captured the causal dependencies between resource usage and maneuvers. They have also overlooked the multi-modal nature of tactical decisions, both of which are critical in fast-evolving scenarios. In this paper, we propose TART, a Temporal Action Representation learning framework for Tactical resource control and subsequent maneuver generation. TART leverages contrastive learning based on a mutual information objective, designed to capture inherent temporal dependencies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Guidance and Control Systems · Adversarial Robustness in Machine Learning
