Reinforcement Learning with Generative Models for Compact Support Sets

Nico Schiavone; Xingyu Li

arXiv:2404.16300·cs.LG·April 26, 2024

Reinforcement Learning with Generative Models for Compact Support Sets

Nico Schiavone, Xingyu Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning framework that leverages generative models to create small, focused synthetic support sets, significantly improving classification accuracy without extra labeling costs.

Contribution

It presents a novel RL-based method for generating targeted support sets using foundation models, enhancing small-sample classification tasks.

Findings

01

Significant accuracy improvements on classification tasks.

02

Effective generation of small, focused support sets.

03

No additional labeling or data costs required.

Abstract

Foundation models contain a wealth of information from their vast number of training samples. However, most prior arts fail to extract this information in a precise and efficient way for small sample sizes. In this work, we propose a framework utilizing reinforcement learning as a control for foundation models, allowing for the granular generation of small, focused synthetic support sets to augment the performance of neural network models on real data classification tasks. We first allow a reinforcement learning agent access to a novel context based dictionary; the agent then uses this dictionary with a novel prompt structure to form and optimize prompts as inputs to generative models, receiving feedback based on a reward function combining the change in validation accuracy and entropy. A support set is formed this way over several exploration steps. Our framework produced excellent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mesophil/deeprl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Elevator Systems and Control

MethodsSparse Evolutionary Training