Functional Regularization for Reinforcement Learning via Learned Fourier Features
Alexander C. Li, Deepak Pathak

TL;DR
This paper introduces a Fourier basis embedding architecture for deep reinforcement learning that enhances sample efficiency and stability by controlling frequency fitting, supported by theoretical analysis and empirical results.
Contribution
It presents a novel learned Fourier feature architecture for RL, with theoretical insights linking initial variance tuning to functional regularization, improving learning stability and efficiency.
Findings
Improved sample efficiency in RL benchmarks.
Theoretically links Fourier basis variance to regularization.
Empirical benefits over baseline methods.
Abstract
We propose a simple architecture for deep reinforcement learning by embedding inputs into a learned Fourier basis and show that it improves the sample efficiency of both state-based and image-based RL. We perform infinite-width analysis of our architecture using the Neural Tangent Kernel and theoretically show that tuning the initial variance of the Fourier basis is equivalent to functional regularization of the learned deep network. That is, these learned Fourier features allow for adjusting the degree to which networks underfit or overfit different frequencies in the training data, and hence provide a controlled mechanism to improve the stability and performance of RL optimization. Empirically, this allows us to prioritize learning low-frequency functions and speed up learning by reducing networks' susceptibility to noise in the optimization process, such as during Bellman updates.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Control Systems and Identification
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
