Learning Computational Efficient Bots with Costly Features
Anthony Kobanda, Valliappan C.A., Joshua Romoff, Ludovic Denoyer

TL;DR
This paper introduces a cost-aware decision transformer that dynamically selects features to balance computational efficiency and task performance in real-time decision-making scenarios.
Contribution
It proposes the Budgeted Decision Transformer, extending the Decision Transformer to incorporate feature cost constraints for improved efficiency.
Findings
Achieves comparable performance with fewer computational resources.
Demonstrates effectiveness on D4RL benchmarks and 3D video game environments.
Reduces inference time by dynamically selecting features.
Abstract
Deep reinforcement learning (DRL) techniques have become increasingly used in various fields for decision-making processes. However, a challenge that often arises is the trade-off between both the computational efficiency of the decision-making process and the ability of the learned agent to solve a particular task. This is particularly critical in real-time settings such as video games where the agent needs to take relevant decisions at a very high frequency, with a very limited inference time. In this work, we propose a generic offline learning approach where the computation cost of the input features is taken into account. We derive the Budgeted Decision Transformer as an extension of the Decision Transformer that incorporates cost constraints to limit its cost at inference. As a result, the model can dynamically choose the best input features at each timestep. We demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Reinforcement Learning in Robotics · Anomaly Detection Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Layer Normalization · Absolute Position Encodings · Residual Connection
