SAPIENT: Mastering Multi-turn Conversational Recommendation with   Strategic Planning and Monte Carlo Tree Search

Hanwen Du; Bo Peng; Xia Ning

arXiv:2410.09580·cs.CL·May 9, 2025

SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search

Hanwen Du, Bo Peng, Xia Ning

PDF

Open Access 1 Repo 1 Video

TL;DR

SAPIENT introduces a Monte Carlo Tree Search-based framework for conversational recommendation systems, enabling strategic planning and iterative self-improvement to outperform existing methods.

Contribution

The paper proposes a novel MCTS-based framework with a self-training loop for improved conversational planning in CRS, addressing suboptimal strategies of prior RL-based methods.

Findings

01

Outperforms state-of-the-art baselines on four benchmark datasets.

02

Demonstrates effective strategic planning in multi-turn conversations.

03

Balances training efficiency and performance with an efficient variant.

Abstract

Conversational Recommender Systems (CRS) proactively engage users in interactive dialogues to elicit user preferences and provide personalized recommendations. Existing methods train Reinforcement Learning (RL)-based agent with greedy action selection or sampling strategy, and may suffer from suboptimal conversational planning. To address this, we present a novel Monte Carlo Tree Search (MCTS)-based CRS framework SAPIENT. SAPIENT consists of a conversational agent (S-agent) and a conversational planner (S-planner). S-planner builds a conversational search tree with MCTS based on the initial actions proposed by S-agent to find conversation plans. The best conversation plans from S-planner are used to guide the training of S-agent, creating a self-training loop where S-agent can iteratively improve its capability for conversational planning. Furthermore, we propose an efficient variant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ninglab/sapient
pytorchOfficial

Videos

SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search· underline

Taxonomy

TopicsAdvanced Text Analysis Techniques · Speech and dialogue systems