RPS: Information Elicitation with Reinforcement Prompt Selection

Tao Wang; Jingyao Lu; Xibo Wang; Haonan Huang; Su Yao; Zhiqiang Hu; Xingyan Chen; Enmao Diao

arXiv:2604.13817·cs.LG·April 16, 2026

RPS: Information Elicitation with Reinforcement Prompt Selection

Tao Wang, Jingyao Lu, Xibo Wang, Haonan Huang, Su Yao, Zhiqiang Hu, Xingyan Chen, Enmao Diao

PDF

TL;DR

This paper introduces RPS, a reinforcement learning framework for adaptive prompt selection to improve information elicitation in open-ended dialogues with large language models.

Contribution

It proposes a novel RL-based approach for prompt selection and introduces IELegal, a new dataset for evaluating dialogue-based information elicitation.

Findings

01

RL agent outperforms random query baseline in synthetic experiments.

02

RPS outperforms static prompt baselines on the IELegal dataset.

03

Adaptive prompt selection enhances information elicitation in dialogue systems.

Abstract

Large language models (LLMs) have shown remarkable capabilities in dialogue generation and reasoning, yet their effectiveness in eliciting user-known but concealed information in open-ended conversations remains limited. In many interactive AI applications, such as personal assistants, tutoring systems, and legal or clinical support, users often withhold sensitive or uncertain information due to privacy concerns, ambiguity, or social hesitation. This makes it challenging for LLMs to gather complete and contextually relevant inputs. In this work, we define the problem of information elicitation in open-ended dialogue settings and propose Reinforcement Prompt Selection (RPS), a lightweight reinforcement learning framework that formulates prompt selection as a sequential decision-making problem. To analyze this problem in a controlled setting, we design a synthetic experiment, where a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.