Learning What Helps: Task-Aligned Context Selection for Vision Tasks

Jingyu Guo; Emir Konuk; Fredrik Strand; Christos Matsoukas; Kevin Smith

arXiv:2512.00489·cs.CV·December 2, 2025

Learning What Helps: Task-Aligned Context Selection for Vision Tasks

Jingyu Guo, Emir Konuk, Fredrik Strand, Christos Matsoukas, Kevin Smith

PDF

Open Access

TL;DR

TACS is a novel framework that learns to select truly helpful contextual examples for vision tasks, improving performance across diverse datasets by aligning selection with task-specific rewards.

Contribution

It introduces a joint training method combining gradient supervision and reinforcement learning to optimize example selection for vision tasks.

Findings

01

TACS outperforms similarity-based retrieval across 18 datasets.

02

It improves performance especially in data-limited or challenging scenarios.

03

The method effectively aligns example selection with task-specific rewards.

Abstract

Humans often resolve visual uncertainty by comparing an image with relevant examples, but ViTs lack the ability to identify which examples would improve their predictions. We present Task-Aligned Context Selection (TACS), a framework that learns to select paired examples which truly improve task performance rather than those that merely appear similar. TACS jointly trains a selector network with the task model through a hybrid optimization scheme combining gradient-based supervision and reinforcement learning, making retrieval part of the learning objective. By aligning selection with task rewards, TACS enables discriminative models to discover which contextual examples genuinely help. Across 18 datasets covering fine-grained recognition, medical image classification, and medical image segmentation, TACS consistently outperforms similarity-based retrieval, particularly in challenging or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Image Retrieval and Classification Techniques