Affordance Agent Harness: Verification-Gated Skill Orchestration

Haojian Huang; Jiahao Shi; Yinchuan Li; Yingcong Chen

arXiv:2605.00663·cs.RO·May 11, 2026

Affordance Agent Harness: Verification-Gated Skill Orchestration

Haojian Huang, Jiahao Shi, Yinchuan Li, Yingcong Chen

PDF

1 Repo

TL;DR

This paper introduces Affordance Agent Harness, a closed-loop system that adaptively orchestrates multiple skills for improved affordance grounding in complex scenes, balancing accuracy and inference cost.

Contribution

It presents a novel unified framework with evidence management, episodic memory, and adaptive skill routing, outperforming fixed pipelines in accuracy and efficiency.

Findings

01

Achieves better accuracy-cost trade-offs than fixed pipelines.

02

Reduces average skill calls and latency in affordance grounding.

03

Improves robustness in challenging, occluded, and ambiguous scenes.

Abstract

Affordance grounding requires identifying where and how an agent should interact in open-world scenes, where actionable regions are often small, occluded, reflective, and visually ambiguous. Recent systems therefore combine multiple skills (e.g., detection, segmentation, interaction-imagination), yet most orchestrate them with fixed pipelines that are poorly matched to per-instance difficulty, offer limited targeted recovery from intermediate errors, and fail to reuse experience from recurring objects. These failures expose a systems problem: test-time grounding must acquire the right evidence, decide whether that evidence is reliable enough to commit, and do so under bounded inference cost without access to labels. We propose Affordance Agent Harness, a closed-loop runtime that unifies heterogeneous skills with an evidence store and cost control, retrieves episodic memories to provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://tenplusgood.github.io/a-harness-page
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.