Discontinuity-Sensitive Optimal Control Learning by Mixture of Experts
Gao Tang, Kris Hauser

TL;DR
This paper introduces a mixture of experts model to learn solutions of parametric optimal control problems, effectively handling discontinuities caused by nonconvexities and control switching, resulting in more accurate and reliable trajectory predictions.
Contribution
It presents a novel discontinuity-sensitive learning approach using clustering and mixture of experts to improve accuracy and reliability in optimal control solution approximation.
Findings
Outperforms joint training of MoE in accuracy and data efficiency
Achieves lower prediction error with fewer parameters
Enhances trajectory tracking reliability
Abstract
This paper proposes a discontinuity-sensitive approach to learn the solutions of parametric optimal control problems with high accuracy. Many tasks, ranging from model predictive control to reinforcement learning, may be solved by learning optimal solutions as a function of problem parameters. However, nonconvexity, discrete homotopy classes, and control switching cause discontinuity in the parameter-solution mapping, thus making learning difficult for traditional continuous function approximators. A mixture of experts (MoE) model composed of a classifier and several regressors is proposed to address such an issue. The optimal trajectories of different parameters are clustered such that in each cluster the trajectories are continuous function of problem parameters. Numerical examples on benchmark problems show that training the classifier and regressors individually outperforms joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Adaptive Dynamic Programming Control
