Trust-Region Neural Moving Horizon Estimation for Robots
Bingheng Wang, Xuyang Chen, and Lin Zhao

TL;DR
This paper introduces a trust-region training method for neural moving horizon estimation (NeuroMHE) in robots, significantly improving disturbance estimation accuracy and efficiency with minimal data and computational resources.
Contribution
It develops a second-order derivative computation (MHE Hessian) for NeuroMHE, enabling trust-region optimization that enhances training speed and robustness over traditional gradient descent methods.
Findings
Achieves training in under 5 minutes with only 100 data points.
Outperforms state-of-the-art neural estimators by up to 68.1% in force estimation accuracy.
Uses only 1.4% of the parameters of previous models.
Abstract
Accurate disturbance estimation is essential for safe robot operations. The recently proposed neural moving horizon estimation (NeuroMHE), which uses a portable neural network to model the MHE's weightings, has shown promise in further pushing the accuracy and efficiency boundary. Currently, NeuroMHE is trained through gradient descent, with its gradient computed recursively using a Kalman filter. This paper proposes a trust-region policy optimization method for training NeuroMHE. We achieve this by providing the second-order derivatives of MHE, referred to as the MHE Hessian. Remarkably, we show that much of computation already used to obtain the gradient, especially the Kalman filter, can be efficiently reused to compute the MHE Hessian. This offers linear computational complexity relative to the MHE horizon. As a case study, we evaluate the proposed trust region NeuroMHE on real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Inertial Sensor and Navigation · Adaptive Control of Nonlinear Systems
