LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie   Detection with Self-Generated Feedback

Tanushree Banerjee; Richard Zhu; Runzhe Yang; Karthik Narasimhan

arXiv:2408.13915·cs.CL·August 27, 2024

LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback

Tanushree Banerjee, Richard Zhu, Runzhe Yang, Karthik Narasimhan

PDF

Open Access

TL;DR

This paper introduces a bootstrapping framework where LLMs generate and refine feedback to improve lie detection in complex dialogues, achieving significant performance gains without training data.

Contribution

The paper presents a novel self-feedback bootstrapping method that enhances LLM reasoning for lie detection, outperforming zero-shot baselines and rivaling supervised methods.

Findings

01

39% improvement in lying-F1 score over zero-shot baseline

02

LLM-generated feedback surpasses professional human feedback quality

03

Achieves state-of-the-art results without training data

Abstract

Large Language Models (LLMs) excel at generating human-like dialogues and comprehending text. However, understanding the subtleties of complex exchanges in language remains a challenge. We propose a bootstrapping framework that leverages self-generated feedback to enhance LLM reasoning capabilities for lie detection. The framework consists of three stages: suggestion, feedback collection, and modification. In the suggestion stage, a cost-effective language model generates initial predictions based on game state and dialogue. The feedback-collection stage involves a language model providing feedback on these predictions. In the modification stage, a more advanced language model refines the initial predictions using the auto-generated feedback. We investigate the application of the proposed framework for detecting betrayal and deception in Diplomacy games, and compare it with feedback…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Deception detection and forensic psychology