HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan, Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant,, Aidan Swope, Oleksii Kuchaiev

TL;DR
HelpSteer is a multi-attribute dataset that annotates responses for correctness, coherence, complexity, and verbosity, enabling more nuanced training of helpfulness models like SteerLM, which achieves state-of-the-art scores without relying on proprietary data.
Contribution
This paper introduces HelpSteer, a comprehensive multi-attribute helpfulness dataset, and demonstrates its effectiveness in training models that outperform existing open models on helpfulness benchmarks.
Findings
SteerLM trained on HelpSteer achieves 7.54 on MT Bench.
HelpSteer dataset covers multiple helpfulness attributes.
Models trained on HelpSteer outperform previous open models.
Abstract
Existing open-source helpfulness preference datasets do not specify what makes some responses more helpful and others less so. Models trained on these datasets can incidentally learn to model dataset artifacts (e.g. preferring longer but unhelpful responses only due to their length). To alleviate this problem, we collect HelpSteer, a multi-attribute helpfulness dataset annotated for the various aspects that make responses helpful. Specifically, our 37k-sample dataset has annotations for correctness, coherence, complexity, and verbosity in addition to overall helpfulness of responses. Training Llama 2 70B using the HelpSteer dataset with SteerLM technique produces a model that scores 7.54 on MT Bench, which is currently the highest score for open models that do not require training data from more powerful models (e.g. GPT4). We release this dataset with CC-BY-4.0 license at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗nvidia/Llama2-70B-SteerLM-Chatmodel· 4 dl· ♡ 234 dl♡ 23
- 🤗nvidia/Llama2-13B-SteerLM-RMmodel· 23 dl· ♡ 823 dl♡ 8
- 🤗nvidia/Llama-3.1-Nemotron-70B-Rewardmodel· 62 dl· ♡ 8162 dl♡ 81
- 🤗nvidia/Llama-3.1-Nemotron-70B-Reward-HFmodel· 1.5k dl· ♡ 921.5k dl♡ 92
- 🤗nvidia/Llama-3.1-Nemotron-70B-Instructmodel· 91 dl· ♡ 56891 dl♡ 568
- 🤗poisson-fish/Llama-3.1-Nemotron-70B-Instruct-GGUFmodel· 22 dl22 dl
- 🤗bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-8.0bpw-8hb-exl2model· 3 dl· ♡ 33 dl♡ 3
- 🤗bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-7.0bpw-8hb-exl2model· 4 dl· ♡ 14 dl♡ 1
- 🤗bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-6.0bpw-8hb-exl2model· 3 dl3 dl
- 🤗bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-4.3bpw-6hb-exl2model· 3 dl· ♡ 33 dl♡ 3
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications
