Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and   the CARING Models

Alina Roitberg; Monica Haurilet; Manuel Martinez; Rainer; Stiefelhagen

arXiv:2101.00468·cs.CV·January 5, 2021

Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models

Alina Roitberg, Monica Haurilet, Manuel Martinez, Rainer, Stiefelhagen

PDF

TL;DR

This paper evaluates the calibration of confidence scores in activity recognition models, introduces a new calibration method called CARING, and demonstrates its superior reliability over existing approaches across multiple datasets.

Contribution

It is the first to benchmark confidence calibration in activity recognition and proposes the CARING model for improved uncertainty estimation.

Findings

01

CARING outperforms native models and temperature scaling in calibration accuracy.

02

Standard models' confidence scores poorly reflect true uncertainty.

03

Benchmark datasets with reliability metrics are introduced for activity recognition.

Abstract

Beyond assigning the correct class, an activity recognition model should also be able to determine, how certain it is in its predictions. We present the first study of how welthe confidence values of modern action recognition architectures indeed reflect the probability of the correct outcome and propose a learning-based approach for improving it. First, we extend two popular action recognition datasets with a reliability benchmark in form of the expected calibration error and reliability diagrams. Since our evaluation highlights that confidence values of standard action recognition architectures do not represent the uncertainty well, we introduce a new approach which learns to transform the model output into realistic confidence estimates through an additional calibration network. The main idea of our Calibrated Action Recognition with Input Guidance (CARING) model is to learn an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.