Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning
Brian Gaudet, Roberto Furfaro

TL;DR
This paper presents a reinforcement meta-learning approach to optimize a line of sight curvature policy for missile guidance, significantly improving accuracy against maneuvering targets without needing target acceleration estimates.
Contribution
The paper introduces a novel reinforcement meta-learning method that optimizes a line of sight curvature policy using a recurrent neural network, enhancing guidance system performance against maneuvering targets.
Findings
Outperforms augmented proportional navigation with perfect target acceleration knowledge.
Achieves higher accuracy with less control effort across various target maneuvers.
Does not require target acceleration estimates for effective guidance.
Abstract
We use reinforcement meta learning to optimize a line of sight curvature policy that increases the effectiveness of a guidance system against maneuvering targets. The policy is implemented as a recurrent neural network that maps navigation system outputs to a Euler 321 attitude representation. The attitude representation is then used to construct a direction cosine matrix that biases the observed line of sight vector. The line of sight rotation rate derived from the biased line of sight is then mapped to a commanded acceleration by the guidance system. By varying the bias as a function of navigation system outputs, the policy enhances accuracy against highly maneuvering targets. Importantly, our method does not require an estimate of target acceleration. In our experiments, we demonstrate that when our method is combined with proportional navigation, the system significantly outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGuidance and Control Systems · Target Tracking and Data Fusion in Sensor Networks · Aerospace and Aviation Technology
