Loss Knows Best: Detecting Annotation Errors in Videos via Loss Trajectories

Praditha Alwis; Soumyadeep Chandra; Deepak Ravikumar; Kaushik Roy

arXiv:2602.15154·cs.CV·February 18, 2026

Loss Knows Best: Detecting Annotation Errors in Videos via Loss Trajectories

Praditha Alwis, Soumyadeep Chandra, Deepak Ravikumar, Kaushik Roy

PDF

Open Access

TL;DR

This paper introduces a model-agnostic method for detecting annotation errors in videos by analyzing loss trajectories across training checkpoints, effectively identifying mislabeling and ordering issues without ground truth.

Contribution

The authors propose a novel approach using cumulative sample loss trajectories to detect annotation errors in video datasets, applicable across different datasets and models.

Findings

01

Effective detection of mislabeling and disordering in videos

02

Strong performance demonstrated on EgoPER and Cholec80 datasets

03

No ground truth required for error detection

Abstract

High-quality video datasets are foundational for training robust models in tasks like action recognition, phase detection, and event segmentation. However, many real-world video datasets suffer from annotation errors such as *mislabeling*, where segments are assigned incorrect class labels, and *disordering*, where the temporal sequence does not follow the correct progression. These errors are particularly harmful in phase-annotated tasks, where temporal consistency is critical. We propose a novel, model-agnostic method for detecting annotation errors by analyzing the Cumulative Sample Loss (CSL)--defined as the average loss a frame incurs when passing through model checkpoints saved across training epochs. This per-frame loss trajectory acts as a dynamic fingerprint of frame-level learnability. Mislabeled or disordered frames tend to show consistently high or irregular loss patterns,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis