Multi-expert Prompting Improves Reliability, Safety, and Usefulness of   Large Language Models

Do Xuan Long; Duong Ngoc Yen; Anh Tuan Luu; Kenji Kawaguchi; Min-Yen; Kan; Nancy F. Chen

arXiv:2411.00492·cs.CL·November 4, 2024·2 cites

Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen, Kan, Nancy F. Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

Multi-expert Prompting enhances large language model responses by simulating multiple experts, aggregating their outputs, and selecting the best, leading to improved truthfulness, safety, and usefulness without manual prompt tuning.

Contribution

It introduces a novel multi-expert prompting method that outperforms existing techniques in reliability, safety, and informativeness of LLM outputs, using a decision-making framework.

Findings

01

Outperforms ExpertPrompting and baselines in truthfulness and factuality.

02

Reduces toxicity and hurtfulness in responses.

03

Achieves 8.69% higher truthfulness with ChatGPT.

Abstract

We present Multi-expert Prompting, a novel enhancement of ExpertPrompting (Xu et al., 2023), designed to improve the large language model (LLM) generation. Specifically, it guides an LLM to fulfill an input instruction by simulating multiple experts, aggregating their responses, and selecting the best among individual and aggregated responses. This process is performed in a single chain of thoughts through our seven carefully designed subtasks derived from the Nominal Group Technique (Ven and Delbecq, 1974), a well-established decision-making framework. Our evaluations demonstrate that Multi-expert Prompting significantly outperforms ExpertPrompting and comparable baselines in enhancing the truthfulness, factuality, informativeness, and usefulness of responses while reducing toxicity and hurtfulness. It further achieves state-of-the-art truthfulness by outperforming the best baseline by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dxlong2000/multi-expert-prompting
pytorchOfficial

Videos

Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems