Steering Conversational Large Language Models for Long Emotional Support   Conversations

Navid Madani; Sougata Saha; Rohini Srihari

arXiv:2402.10453·cs.CL·September 17, 2024·2 cites

Steering Conversational Large Language Models for Long Emotional Support Conversations

Navid Madani, Sougata Saha, Rohini Srihari

PDF

Open Access 1 Repo 1 Models 1 Datasets 1 Video

TL;DR

This paper investigates how large language models can be guided to follow emotional support strategies consistently in long conversations, introducing new metrics and datasets to improve steerability.

Contribution

We propose the Strategy Relevant Attention (SRA) metric, create a strategy-conditioned dataset, and develop a fine-tuned model that improves adherence to emotional support strategies.

Findings

01

The SRA metric effectively measures strategy adherence.

02

Fine-tuning improves model steerability in extended conversations.

03

Publicly available code and data facilitate further research.

Abstract

In this study, we address the challenge of enabling large language models (LLMs) to consistently adhere to emotional support strategies in extended conversations. We focus on the steerability of the Llama-2 and Llama-3 suite of models, examining their ability to maintain these strategies throughout interactions. To assess this, we introduce the Strategy Relevant Attention (SRA) metric, which quantifies the model's adherence to the prompted strategy through attention maps. To facilitate our study, we create a strategy-conditioned synthetic conversational dataset derived from the ESConv dataset. We also propose various baselines informed by our proposed SRA metric to address the challenge and propose a fine-tuned model that significantly enhances the steerability of the base model in following the strategy throughout the conversation. The code and data are publicly available on our GitHub.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

navidmdn/esconv-sra
pytorchOfficial

Models

🤗
navidmadani/esconv_sra_llama3_8b
model· 99 dl
99 dl

Datasets

navidmadani/extended_esc
dataset· 5 dl
5 dl

Videos

Steering Conversational Large Language Models for Long Emotional Support Conversations· underline

Taxonomy

TopicsMental Health via Writing

MethodsBalanced Selection · Focus