Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues

Wenjin Hou; Xiaoxiao Sun; Hehe Fan

arXiv:2603.21138·cs.CV·March 24, 2026

Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues

Wenjin Hou, Xiaoxiao Sun, Hehe Fan

PDF

Open Access

TL;DR

This paper introduces RLVC, a reinforcement learning framework with visual cues that improves generative zero-shot learning by producing more task-relevant features, leading to state-of-the-art results.

Contribution

RLVC employs outcome-reward reinforcement learning with visual cues to enhance generative ZSL, addressing task-agnostic feature synthesis issues.

Findings

01

Achieves 4.7% performance gain on ZSL benchmarks.

02

Effectively aligns synthesized features with visual prototypes.

03

Demonstrates state-of-the-art results across three benchmarks.

Abstract

Recent advances in zero-shot learning (ZSL) have demonstrated the potential of generative models. Typically, generative ZSL synthesizes visual features conditioned on semantic prototypes to model the data distribution of unseen classes, followed by training a classifier on the synthesized data. However, the synthesized features often remain task-agnostic, leading to degraded performance. Moreover, inferring a faithful distribution from semantic prototypes alone is insufficient for classes that are semantically similar but visually distinct. To address these and advance ZSL, we propose RLVC, an outcome-reward reinforcement learning RL framework with visual cues for generative ZSL. At its core, RL empowers the generative model to self-evolve, implicitly enhancing its generation capability. In particular, RLVC updates the generative model using an outcome-based reward, encouraging the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis