How to Correctly Make Mistakes: A Framework for Constructing and Benchmarking Mistake Aware Egocentric Procedural Videos

Olga Loginova; Frank Keller

arXiv:2604.15134·cs.CV·April 17, 2026

How to Correctly Make Mistakes: A Framework for Constructing and Benchmarking Mistake Aware Egocentric Procedural Videos

Olga Loginova, Frank Keller

PDF

TL;DR

This paper introduces PIE-V, a framework for creating and benchmarking mistake-aware egocentric procedural videos, enabling better evaluation of mistake detection and correction in video-based procedural tasks.

Contribution

PIE-V combines error injection, correction modeling, and video synthesis to generate realistic mistake scenarios and provides a comprehensive rubric for benchmarking procedural video quality.

Findings

01

PIE-V successfully injects 102 mistakes across 17 tasks.

02

The framework generates 27 recovery corrections.

03

A new taxonomy and human rubric evaluate procedural video quality.

Abstract

Reliable procedural monitoring in video requires exposure to naturally occurring human errors and the recoveries that follow. In egocentric recordings, mistakes are often partially occluded by hands and revealed through subtle object state changes, while existing procedural datasets provide limited and inconsistent mistake and correction traces. We present PIE-V (Psychologically Inspired Error injection for Videos), a framework for constructing and benchmarking mistake-aware egocentric procedural videos by augmenting clean keystep procedures with controlled, human-plausible deviations. PIE-V combines a psychology-informed error planner conditioned on procedure phase and semantic step load, a correction planner that models recovery behavior, an LLM writer that performs cascade-consistent rewrites, and an LLM judge that validates procedural coherence and repairs failures. For video…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.