Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg, Monica Haurilet, Manuel Martinez, Rainer, Stiefelhagen

TL;DR
This paper evaluates the calibration of confidence scores in activity recognition models, introduces a new calibration method called CARING, and demonstrates its superior reliability over existing approaches across multiple datasets.
Contribution
It is the first to benchmark confidence calibration in activity recognition and proposes the CARING model for improved uncertainty estimation.
Findings
CARING outperforms native models and temperature scaling in calibration accuracy.
Standard models' confidence scores poorly reflect true uncertainty.
Benchmark datasets with reliability metrics are introduced for activity recognition.
Abstract
Beyond assigning the correct class, an activity recognition model should also be able to determine, how certain it is in its predictions. We present the first study of how welthe confidence values of modern action recognition architectures indeed reflect the probability of the correct outcome and propose a learning-based approach for improving it. First, we extend two popular action recognition datasets with a reliability benchmark in form of the expected calibration error and reliability diagrams. Since our evaluation highlights that confidence values of standard action recognition architectures do not represent the uncertainty well, we introduce a new approach which learns to transform the model output into realistic confidence estimates through an additional calibration network. The main idea of our Calibrated Action Recognition with Input Guidance (CARING) model is to learn an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
