Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems

Oren Fivel; Matan Rudman; Kobi Cohen

arXiv:2507.00268·cs.RO·July 2, 2025

Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems

Oren Fivel, Matan Rudman, Kobi Cohen

PDF

Open Access

TL;DR

This paper introduces a control-optimized deep reinforcement learning framework that explicitly models and compensates for action execution mismatches, improving robustness of AI agents in real-world engineering systems.

Contribution

It develops a novel two-stage training process that accounts for control errors, bridging the gap between idealized learning and real-world system uncertainties.

Findings

01

Enhanced robustness against execution errors in simulation environments

02

Effective compensation for actuation mismatches during training

03

Improved decision-making accuracy under real-world uncertainties

Abstract

Deep reinforcement learning (DRL) has become a powerful tool for complex decision-making in machine learning and AI. However, traditional methods often assume perfect action execution, overlooking the uncertainties and deviations between an agent's selected actions and the actual system response. In real-world applications, such as robotics, mechatronics, and communication networks, execution mismatches arising from system dynamics, hardware constraints, and latency can significantly degrade performance. This work advances AI by developing a novel control-optimized DRL framework that explicitly models and compensates for action execution mismatches, a challenge largely overlooked in existing methods. Our approach establishes a structured two-stage process: determining the desired action and selecting the appropriate control signal to ensure proper execution. It trains the agent while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Adaptive Dynamic Programming Control