Naturally Occurring Feedback is Common, Extractable and Useful

Shachar Don-Yehiya; Leshem Choshen; Omri Abend

arXiv:2407.10944·cs.CL·March 4, 2025·1 cites

Naturally Occurring Feedback is Common, Extractable and Useful

Shachar Don-Yehiya, Leshem Choshen, Omri Abend

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper demonstrates that naturally occurring user feedback in conversations is common, extractable, and beneficial for training more aligned language models, reducing the need for costly manual feedback collection.

Contribution

It introduces a method to automatically extract natural feedback from conversations, showing its effectiveness in improving model alignment and reducing feedback collection costs.

Findings

01

Up to 30% of chats contain explicit feedback.

02

Automatically extracted feedback improves model alignment.

03

Feedback extraction from 1M conversations yields valuable training data.

Abstract

Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable. Inspired by the way human interlocutors provide spontaneous unsolicited feedback to each other, we propose to extract feedback that users naturally include when interacting with chat models. We manually annotated conversations to confirm the presence of naturally occurring feedback in a standard corpus, finding that as much as 30% of the chats include explicit feedback. Comparing to older datasets, we find that naturally occurring feedback is more prevalent in recent conversation datasets, suggesting that more than ever, naturally occurring feedback can serve as a valuable resource for feedback data. We propose a method for automatically extracting this feedback, and apply it to over 1M conversations to obtain hundreds of thousands of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shachardon/naturally_occurring_feedback
pytorchOfficial

Datasets

shachardon/naturally_occurring_feedback
dataset· 5 dl
5 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications