Reinforcement Learning with Generative Models for Compact Support Sets
Nico Schiavone, Xingyu Li

TL;DR
This paper introduces a reinforcement learning framework that leverages generative models to create small, focused synthetic support sets, significantly improving classification accuracy without extra labeling costs.
Contribution
It presents a novel RL-based method for generating targeted support sets using foundation models, enhancing small-sample classification tasks.
Findings
Significant accuracy improvements on classification tasks.
Effective generation of small, focused support sets.
No additional labeling or data costs required.
Abstract
Foundation models contain a wealth of information from their vast number of training samples. However, most prior arts fail to extract this information in a precise and efficient way for small sample sizes. In this work, we propose a framework utilizing reinforcement learning as a control for foundation models, allowing for the granular generation of small, focused synthetic support sets to augment the performance of neural network models on real data classification tasks. We first allow a reinforcement learning agent access to a novel context based dictionary; the agent then uses this dictionary with a novel prompt structure to form and optimize prompts as inputs to generative models, receiving feedback based on a reward function combining the change in validation accuracy and entropy. A support set is formed this way over several exploration steps. Our framework produced excellent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Elevator Systems and Control
MethodsSparse Evolutionary Training
