Preferential Bayesian Optimization with Crash Feedback
Johanna Menn, David Stenger, Sebastian Trimpe

TL;DR
CrashPBO enhances preferential Bayesian optimization by allowing users to report crashes, significantly reducing crashes and improving data efficiency in robotics experiments.
Contribution
It introduces CrashPBO, a novel method enabling the incorporation of crash feedback into Bayesian optimization, improving safety and efficiency.
Findings
CrashPBO reduces crashes by 63% in synthetic benchmarks.
It increases data efficiency in robotics experiments.
The method is applicable and transferable across multiple robotics platforms.
Abstract
Bayesian optimization is a popular black-box optimization method for parameter learning in control and robotics. It typically requires an objective function that reflects the user's optimization goal. However, in practical applications, this objective function is often inaccessible due to complex or unmeasurable performance metrics. Preferential Bayesian optimization (PBO) overcomes this limitation by leveraging human feedback through pairwise comparisons, eliminating the need for explicit performance quantification. When applying PBO to hardware systems, such as in quadcopter control, crashes can cause time-consuming experimental resets, wear and tear, or otherwise undesired outcomes. Standard PBO methods cannot incorporate feedback from such crashed experiments, resulting in the exploration of parameters that frequently lead to experimental crashes. We thus introduce CrashPBO, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
