HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Zhilin Wang; Yi Dong; Jiaqi Zeng; Virginia Adams; Makesh Narsimhan; Sreedhar; Daniel Egert; Olivier Delalleau; Jane Polak Scowcroft; Neel Kant,; Aidan Swope; Oleksii Kuchaiev

arXiv:2311.09528·cs.CL·November 17, 2023·1 cites

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan, Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant,, Aidan Swope, Oleksii Kuchaiev

PDF

Open Access 10 Models 5 Datasets 1 Video

TL;DR

HelpSteer is a multi-attribute dataset that annotates responses for correctness, coherence, complexity, and verbosity, enabling more nuanced training of helpfulness models like SteerLM, which achieves state-of-the-art scores without relying on proprietary data.

Contribution

This paper introduces HelpSteer, a comprehensive multi-attribute helpfulness dataset, and demonstrates its effectiveness in training models that outperform existing open models on helpfulness benchmarks.

Findings

01

SteerLM trained on HelpSteer achieves 7.54 on MT Bench.

02

HelpSteer dataset covers multiple helpfulness attributes.

03

Models trained on HelpSteer outperform previous open models.

Abstract

Existing open-source helpfulness preference datasets do not specify what makes some responses more helpful and others less so. Models trained on these datasets can incidentally learn to model dataset artifacts (e.g. preferring longer but unhelpful responses only due to their length). To alleviate this problem, we collect HelpSteer, a multi-attribute helpfulness dataset annotated for the various aspects that make responses helpful. Specifically, our 37k-sample dataset has annotations for correctness, coherence, complexity, and verbosity in addition to overall helpfulness of responses. Training Llama 2 70B using the HelpSteer dataset with SteerLM technique produces a model that scores 7.54 on MT Bench, which is currently the highest score for open models that do not require training data from more powerful models (e.g. GPT4). We release this dataset with CC-BY-4.0 license at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications