SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection

Zhiyong Cao; Dunqiang Liu; Qi Dai; Haojun Xu; Huaiyan Xu; Huan He; Yafei Liu; Siyuan Liu; XiaoLin Lin; Ke Ma; Ruqian Shi; Sijia Yao; Hao Wang; Sicheng Zhou

arXiv:2601.02871·cs.AI·January 9, 2026

SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection

Zhiyong Cao, Dunqiang Liu, Qi Dai, Haojun Xu, Huaiyan Xu, Huan He, Yafei Liu, Siyuan Liu, XiaoLin Lin, Ke Ma, Ruqian Shi, Sijia Yao, Hao Wang, Sicheng Zhou

PDF

Open Access

TL;DR

SimRPD is a three-stage framework that enhances recruitment dialogue agents by using a high-fidelity simulator, a comprehensive evaluation method, and data selection to improve training quality and performance.

Contribution

The paper introduces a novel multi-stage approach combining a user simulator, a Chain-of-Intention evaluation framework, and data selection to improve goal-oriented dialogue agent training.

Findings

01

SimRPD outperforms existing data selection strategies in recruitment scenarios.

02

The high-fidelity simulator effectively generates large-scale conversational data.

03

The evaluation framework ensures high-quality data selection for training.

Abstract

Task-oriented proactive dialogue agents play a pivotal role in recruitment, particularly for steering conversations towards specific business outcomes, such as acquiring social-media contacts for private-channel conversion. Although supervised fine-tuning and reinforcement learning have proven effective for training such agents, their performance is heavily constrained by the scarcity of high-quality, goal-oriented domain-specific training data. To address this challenge, we propose SimRPD, a three-stage framework for training recruitment proactive dialogue agents. First, we develop a high-fidelity user simulator to synthesize large-scale conversational data through multi-turn online dialogue. Then we introduce a multi-dimensional evaluation framework based on Chain-of-Intention (CoI) to comprehensively assess the simulator and effectively select high-quality data, incorporating both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Mobile Crowdsensing and Crowdsourcing