Trust-Region Neural Moving Horizon Estimation for Robots

Bingheng Wang; Xuyang Chen; and Lin Zhao

arXiv:2309.05955·cs.RO·March 8, 2024

Trust-Region Neural Moving Horizon Estimation for Robots

Bingheng Wang, Xuyang Chen, and Lin Zhao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a trust-region training method for neural moving horizon estimation (NeuroMHE) in robots, significantly improving disturbance estimation accuracy and efficiency with minimal data and computational resources.

Contribution

It develops a second-order derivative computation (MHE Hessian) for NeuroMHE, enabling trust-region optimization that enhances training speed and robustness over traditional gradient descent methods.

Findings

01

Achieves training in under 5 minutes with only 100 data points.

02

Outperforms state-of-the-art neural estimators by up to 68.1% in force estimation accuracy.

03

Uses only 1.4% of the parameters of previous models.

Abstract

Accurate disturbance estimation is essential for safe robot operations. The recently proposed neural moving horizon estimation (NeuroMHE), which uses a portable neural network to model the MHE's weightings, has shown promise in further pushing the accuracy and efficiency boundary. Currently, NeuroMHE is trained through gradient descent, with its gradient computed recursively using a Kalman filter. This paper proposes a trust-region policy optimization method for training NeuroMHE. We achieve this by providing the second-order derivatives of MHE, referred to as the MHE Hessian. Remarkably, we show that much of computation already used to obtain the gradient, especially the Kalman filter, can be efficiently reused to compute the MHE Hessian. This offers linear computational complexity relative to the MHE horizon. As a case study, we evaluate the proposed trust region NeuroMHE on real…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

binghengnus/tr-neuromhe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Inertial Sensor and Navigation · Adaptive Control of Nonlinear Systems