Trajectory Optimization Under Stochastic Dynamics Leveraging Maximum Mean Discrepancy
Basant Sharma, Arun Kumar Singh

TL;DR
This paper introduces a risk-aware trajectory optimization method that reduces the number of required rollouts by distilling statistical information using RKHS and MMD, leading to safer trajectories with fewer samples.
Contribution
It develops an algorithm to distill statistical information from many rollouts into a smaller set and introduces a novel MMD-based surrogate for collision risk estimation.
Findings
Achieves safer trajectories with fewer rollouts in low-sample regimes.
Outperforms CVaR-based methods in risk estimation accuracy.
Demonstrates effectiveness through extensive benchmarking.
Abstract
This paper addresses sampling-based trajectory optimization for risk-aware navigation under stochastic dynamics. Typically such approaches operate by computing perturbed rollouts around the nominal dynamics to estimate the collision risk associated with a sequence of control commands. We consider a setting where it is expensive to estimate risk using perturbed rollouts, for example, due to expensive collision-checks. We put forward two key contributions. First, we develop an algorithm that distills the statistical information from a larger set of rollouts to a reduced-set with sample size . Consequently, we estimate collision risk using just rollouts instead of . Second, we formulate a novel surrogate for the collision risk that can leverage the distilled statistical information contained in the reduced-set. We formalize both algorithmic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications
