Amortized Safe Active Learning for Real-Time Data Acquisition: Pretrained Neural Policies From Simulated Nonparametric Functions
Cen-You Li, Marc Toussaint, Barbara Rakitsch, Christoph Zimmer

TL;DR
This paper introduces an amortized safe active learning approach that uses pretrained neural policies to enable real-time data acquisition while respecting safety constraints, significantly reducing computational costs.
Contribution
It replaces computationally intensive online GP updates with pretrained neural policies trained on simulated functions, enabling fast, safe, and effective active learning.
Findings
Achieves significant speed improvements over traditional methods.
Maintains learning quality while enabling real-time decision-making.
Framework is modular and adaptable to unconstrained active learning.
Abstract
Safe active learning (AL) is a sequential scheme for learning unknown systems while respecting safety constraints during data acquisition. Existing methods often rely on Gaussian processes (GPs) to model the task and safety constraints, requiring repeated GP updates and constrained acquisition optimization--incurring significant computations which are challenging for real-time decision-making. We propose amortized AL for regression and amortized safe AL, replacing expensive online computations with a pretrained neural policy. Inspired by recent advances in amortized Bayesian experimental design, we leverage GPs as pretraining simulators. We train our policy prior to the AL deployment on simulated nonparametric functions, using Fourier feature-based GP sampling and a differentiable acquisition objective that is safety-aware in the safe AL setting. At deployment, our policy selects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
