Injecting Planning-Awareness into Prediction and Detection Evaluation
Boris Ivanovic, Marco Pavone

TL;DR
This paper proposes task-aware evaluation metrics for perception and forecasting in autonomous driving, aiming to better predict real-world planning outcomes than traditional accuracy metrics.
Contribution
It introduces novel task-aware metrics that improve the assessment of perception and prediction models in safety-critical autonomous systems.
Findings
Task-aware metrics better predict downstream planning success.
Traditional metrics can be misleading for real-world decision making.
Experiments validate improved correlation with closed-loop performance.
Abstract
Detecting other agents and forecasting their behavior is an integral part of the modern robotic autonomy stack, especially in safety-critical scenarios entailing human-robot interaction such as autonomous driving. Due to the importance of these components, there has been a significant amount of interest and research in perception and trajectory forecasting, resulting in a wide variety of approaches. Common to most works, however, is the use of the same few accuracy-based evaluation metrics, e.g., intersection-over-union, displacement error, log-likelihood, etc. While these metrics are informative, they are task-agnostic and outputs that are evaluated as equal can lead to vastly different outcomes in downstream planning and decision making. In this work, we take a step back and critically assess current evaluation metrics, proposing task-aware metrics as a better measure of performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Anomaly Detection Techniques and Applications · Bayesian Modeling and Causal Inference
