Preferential Bayesian Optimization with Crash Feedback

Johanna Menn; David Stenger; Sebastian Trimpe

arXiv:2604.01776·cs.RO·April 3, 2026

Preferential Bayesian Optimization with Crash Feedback

Johanna Menn, David Stenger, Sebastian Trimpe

PDF

TL;DR

CrashPBO enhances preferential Bayesian optimization by allowing users to report crashes, significantly reducing crashes and improving data efficiency in robotics experiments.

Contribution

It introduces CrashPBO, a novel method enabling the incorporation of crash feedback into Bayesian optimization, improving safety and efficiency.

Findings

01

CrashPBO reduces crashes by 63% in synthetic benchmarks.

02

It increases data efficiency in robotics experiments.

03

The method is applicable and transferable across multiple robotics platforms.

Abstract

Bayesian optimization is a popular black-box optimization method for parameter learning in control and robotics. It typically requires an objective function that reflects the user's optimization goal. However, in practical applications, this objective function is often inaccessible due to complex or unmeasurable performance metrics. Preferential Bayesian optimization (PBO) overcomes this limitation by leveraging human feedback through pairwise comparisons, eliminating the need for explicit performance quantification. When applying PBO to hardware systems, such as in quadcopter control, crashes can cause time-consuming experimental resets, wear and tear, or otherwise undesired outcomes. Standard PBO methods cannot incorporate feedback from such crashed experiments, resulting in the exploration of parameters that frequently lead to experimental crashes. We thus introduce CrashPBO, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.