TL;DR
This paper investigates how position controller gains influence robot policy learning, revealing that optimal gains depend on the learning method rather than task-specific compliance or stiffness.
Contribution
It systematically analyzes the impact of controller gains on behavior cloning, reinforcement learning, and sim-to-real transfer, offering guidance for gain selection based on learning paradigms.
Findings
Behavior cloning benefits from compliant and overdamped gains.
Reinforcement learning succeeds across all gain regimes with proper hyperparameter tuning.
Stiff and overdamped gains impair sim-to-real transfer.
Abstract
Position controllers have become the dominant interface for executing learned manipulation policies. Yet a critical design decision remains understudied: how should we choose controller gains for policy learning? The conventional wisdom is to select gains based on desired task compliance or stiffness. However, this logic breaks down when controllers are paired with state-conditioned policies: effective stiffness emerges from the interplay between learned reactions and control dynamics, not from gains alone. We argue that gain selection should instead be guided by learnability: how amenable different gain settings are to the learning algorithm in use. In this work, we systematically investigate how position controller gains affect three core components of modern robot learning pipelines: behavior cloning, reinforcement learning from scratch, and sim-to-real transfer. Through extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
