Input-Side Variance Suppression under Non-Normal Transient Amplification in Continuous-Control Reinforcement Learning
Wu Yue

TL;DR
This paper investigates how non-normal transient amplification in control loops causes high variance in reinforcement learning and proposes an input-side variance suppression layer to mitigate this effect.
Contribution
It introduces a novel input-side variance suppression layer and provides control-theoretic analysis to reduce variance caused by non-normal amplification in RL systems.
Findings
Input-side suppression reduces control jitter and variance.
Non-normal amplification significantly contributes to variance in RL.
Control interventions validate the mechanism on quadrotor tasks.
Abstract
Continuous-control reinforcement learning (RL) often exhibits large closed-loop variance, high-frequency control jitter, and sensitivity to disturbance injection. Existing explanations usually emphasize disturbance sources such as action noise, exploration perturbations, or policy nonsmoothness. This letter studies a complementary amplifier-side perspective: in nominally stable yet strongly non-normal closed loops, small input perturbations can undergo transient amplification and lead to disproportionately large state covariance. Motivated by this source--amplifier decomposition, we introduce an input-side variance suppression layer that operates between the learned policy and the plant input to reduce applied-input variance and step-to-step jitter. To separate mechanism from correlation, we use two control-theoretic interventions: one varies only eigenvector geometry under fixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
