Accelerating Visual-Policy Learning through Parallel Differentiable Simulation

Haoxiang You; Yilang Liu; Ian Abraham

arXiv:2505.10646·cs.LG·November 12, 2025

Accelerating Visual-Policy Learning through Parallel Differentiable Simulation

Haoxiang You, Yilang Liu, Ian Abraham

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a computationally efficient visual policy learning algorithm that uses differentiable simulation with decoupled rendering, resulting in faster training and improved performance on complex control tasks.

Contribution

The authors propose a novel decoupling approach for rendering in differentiable simulation, reducing computational overhead and enhancing policy gradient stability.

Findings

01

Significantly reduces training time on visual control benchmarks.

02

Outperforms baseline methods in final return metrics.

03

Achieves 4x improvement on humanoid locomotion tasks.

Abstract

In this work, we propose a computationally efficient algorithm for visual policy learning that leverages differentiable simulation and first-order analytical policy gradients. Our approach decouple the rendering process from the computation graph, enabling seamless integration with existing differentiable simulation ecosystems without the need for specialized differentiable rendering software. This decoupling not only reduces computational and memory overhead but also effectively attenuates the policy gradient norm, leading to more stable and smoother optimization. We evaluate our method on standard visual control benchmarks using modern GPU-accelerated simulation. Experiments show that our approach significantly reduces wall-clock training time and consistently outperforms all baseline methods in terms of final returns. Notably, on complex tasks such as humanoid locomotion, our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nm512/dreamerv3-torch
pytorchOfficial

Videos

Accelerating Visual-Policy Learning through Parallel Differentiable Simulation· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · Reinforcement Learning in Robotics