Detecting Response Generation Not Requiring Factual Judgment

Ryohei Kamei; Daiki Shiono; Reina Akama; Jun Suzuki

arXiv:2406.09702·cs.CL·June 17, 2024

Detecting Response Generation Not Requiring Factual Judgment

Ryohei Kamei, Daiki Shiono, Reina Akama, Jun Suzuki

PDF

Open Access 1 Video

TL;DR

This paper introduces a dataset and classification approach to identify dialogue responses that do not require factual correctness, enhancing dialogue systems' ability to balance attractiveness and factuality.

Contribution

It presents a new dataset (DDFC) and demonstrates effective classification models for detecting non-factual responses in dialogues.

Findings

01

Achieved up to 88% accuracy in classifying responses

02

Created a crowdsourced annotated dialogue dataset

03

Improved understanding of non-factual response detection

Abstract

With the remarkable development of large language models (LLMs), ensuring the factuality of output has become a challenge. However, having all the contents of the response with given knowledge or facts is not necessarily a good thing in dialogues. This study aimed to achieve both attractiveness and factuality in a dialogue response for which a task was set to predict sentences that do not require factual correctness judgment such as agreeing, or personal opinions/feelings. We created a dataset, dialogue dataset annotated with fact-check-needed label (DDFC), for this task via crowdsourcing, and classification tasks were performed on several models using this dataset. The model with the highest classification accuracy could yield about 88% accurate classification results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Detecting Response Generation Not Requiring Factual Judgment· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Deception detection and forensic psychology

MethodsSparse Evolutionary Training