IMPACT-CYCLE: A Contract-Based Multi-Agent System for Claim-Level Supervisory Correction of Long-Video Semantic Memory

Weitong Kong; Di Wen; Kunyu Peng; David Schneider; Zeyun Zhong; Alexander Jaus; Zdravko Marinov; Jiale Wei; Ruiping Liu; Junwei Zheng; Yufan Chen; Lei Qi; Rainer Stiefelhagen

arXiv:2604.20136·cs.CV·April 23, 2026

IMPACT-CYCLE: A Contract-Based Multi-Agent System for Claim-Level Supervisory Correction of Long-Video Semantic Memory

Weitong Kong, Di Wen, Kunyu Peng, David Schneider, Zeyun Zhong, Alexander Jaus, Zdravko Marinov, Jiale Wei, Ruiping Liu, Junwei Zheng, Yufan Chen, Lei Qi, Rainer Stiefelhagen

PDF

1 Repo

TL;DR

IMPACT-CYCLE introduces a multi-agent, claim-level supervisory system for long-video understanding that improves reasoning accuracy and reduces human correction effort by structuring semantic memory and enabling targeted corrections.

Contribution

The paper presents a novel multi-agent framework that reformulates long-video understanding as iterative claim maintenance with explicit supervision and correction, reducing manual effort.

Findings

01

Improved downstream reasoning scores from 0.71 to 0.79 in VQA.

02

Achieved a 4.8x reduction in human arbitration cost.

03

Significantly lowered workload compared to manual annotation.

Abstract

Correcting errors in long-video understanding is disproportionately costly: existing multimodal pipelines produce opaque, end-to-end outputs that expose no intermediate state for inspection, forcing annotators to revisit raw video and reconstruct temporal logic from scratch. The core bottleneck is not generation quality alone, but the absence of a supervisory interface through which human effort can be proportional to the scope of each error. We present IMPACT-CYCLE, a supervisory multi-agent system that reformulates long-video understanding as iterative claim-level maintenance of a shared semantic memory -- a structured, versioned state encoding typed claims, a claim dependency graph, and a provenance log. Role-specialized agents operating under explicit authority contracts decompose verification into local object-relation correctness, cross-temporal consistency, and global semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MKong17/IMPACT_CYCLE
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.