Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
Hengkai Tan, Songming Liu, Kai Ma, Chengyang Ying, Xingxing Zhang,, Hang Su, Jun Zhu

TL;DR
This paper introduces FCNet, a frequency domain-based neural network for real-time decision-making in embodied learning, which improves efficiency and performance over Transformers by leveraging Fourier transforms for feature encoding.
Contribution
The paper proposes FCNet, a novel frequency domain neural network architecture that enables efficient, real-time reinforcement learning for robotics, outperforming Transformer-based models.
Findings
FCNet outperforms Transformer on diverse robotics datasets.
FCNet achieves real-time inference with parallel training.
Frequency domain features are effective for robot trajectory modeling.
Abstract
Transformer has shown promise in reinforcement learning to model time-varying features for obtaining generalized low-level robot policies on diverse robotics datasets in embodied learning. However, it still suffers from the issues of low data efficiency and high inference latency. In this paper, we propose to investigate the task from a new perspective of the frequency domain. We first observe that the energy density in the frequency domain of a robot's trajectory is mainly concentrated in the low-frequency part. Then, we present the Fourier Controller Network (FCNet), a new network that uses Short-Time Fourier Transform (STFT) to extract and encode time-varying features through frequency domain interpolation. In order to do real-time decision-making, we further adopt FFT and Sliding DFT methods in the model architecture to achieve parallel training and efficient recurrent inference.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Cognitive Science and Education Research
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections
