Input-Side Variance Suppression under Non-Normal Transient Amplification in Continuous-Control Reinforcement Learning

Wu Yue

arXiv:2604.17744·eess.SY·April 21, 2026

Input-Side Variance Suppression under Non-Normal Transient Amplification in Continuous-Control Reinforcement Learning

Wu Yue

PDF

TL;DR

This paper investigates how non-normal transient amplification in control loops causes high variance in reinforcement learning and proposes an input-side variance suppression layer to mitigate this effect.

Contribution

It introduces a novel input-side variance suppression layer and provides control-theoretic analysis to reduce variance caused by non-normal amplification in RL systems.

Findings

01

Input-side suppression reduces control jitter and variance.

02

Non-normal amplification significantly contributes to variance in RL.

03

Control interventions validate the mechanism on quadrotor tasks.

Abstract

Continuous-control reinforcement learning (RL) often exhibits large closed-loop variance, high-frequency control jitter, and sensitivity to disturbance injection. Existing explanations usually emphasize disturbance sources such as action noise, exploration perturbations, or policy nonsmoothness. This letter studies a complementary amplifier-side perspective: in nominally stable yet strongly non-normal closed loops, small input perturbations can undergo transient amplification and lead to disproportionately large state covariance. Motivated by this source--amplifier decomposition, we introduce an input-side variance suppression layer that operates between the learned policy and the plant input to reduce applied-input variance and step-to-step jitter. To separate mechanism from correlation, we use two control-theoretic interventions: one varies only eigenvector geometry under fixed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.