ReCCur: A Recursive Corner-Case Curation Framework for Robust Vision-Language Understanding in Open and Edge Scenarios

Yihan Wei; Shenghai Yuan; Tianchen Deng; Boyang Lou; and Enwen Hu

arXiv:2601.03011·cs.CV·January 7, 2026

ReCCur: A Recursive Corner-Case Curation Framework for Robust Vision-Language Understanding in Open and Edge Scenarios

Yihan Wei, Shenghai Yuan, Tianchen Deng, Boyang Lou, and Enwen Hu

PDF

Open Access

TL;DR

ReCCur is a low-compute recursive framework that curates high-quality corner-case datasets for robust vision-language understanding, enabling effective training and evaluation in real-world, resource-constrained scenarios.

Contribution

The paper introduces ReCCur, a novel recursive pipeline that efficiently curates fine-grained, explainable corner-case datasets from noisy web data with minimal human supervision.

Findings

01

Steadily improves data purity and separability in corner cases.

02

Operates effectively on consumer-grade GPUs.

03

Requires minimal human supervision for high-quality curation.

Abstract

Corner cases are rare or extreme scenarios that drive real-world failures, but they are difficult to curate at scale: web data are noisy, labels are brittle, and edge deployments preclude large retraining. We present ReCCur (Recursive Corner-Case Curation), a low-compute framework that converts noisy web imagery into auditable fine-grained labels via a multi-agent recursive pipeline. First, large-scale data acquisition and filtering expands a domain vocabulary with a vision-language model (VLM), crawls the web, and enforces tri-modal (image, description, keyword) consistency with light human spot checks to yield refined candidates. Next, mixture-of-experts knowledge distillation uses complementary encoders (e.g., CLIP, DINOv2, BEiT) for kNN voting with dual-confidence activation and uncertainty sampling, converging to a high-precision set. Finally, region-evidence VLM adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning