Offline Learning for Planning: A Summary
Giorgio Angelotti, Nicolas Drougard, Caroline Ponzoni Carvalho Chanel

TL;DR
This paper reviews offline learning methods for autonomous agents, focusing on how to leverage existing data to optimize policies without further environment interaction, and discusses practical and theoretical aspects of these approaches.
Contribution
It summarizes state-of-the-art offline learning algorithms that incorporate uncertainty constraints and discusses their practical utility and potential improvements using generative models.
Findings
Offline learning reduces the need for costly environment interactions.
Uncertainty-dependent constraints help mitigate distributional mismatch.
Generative Adversarial Networks can estimate distributional shifts.
Abstract
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions with the environment. Nowadays several data sets containing recorded experiences of intelligent agents performing various tasks, spanning from the control of unmanned vehicles to human-robot interaction and medical applications are accessible on the internet. With the intention of limiting the costs of the learning procedure it is convenient to exploit the information that is already available rather than collecting new data. Nevertheless, the incapability to augment the batch can lead the autonomous agents to develop far from optimal behaviours when the sampled experiences do not allow for a good estimate of the true distribution of the environment. Offline learning is the area of machine learning concerned with efficiently obtaining an optimal policy with a batch of previously collected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
