OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Andreas K\"opf, Yannic Kilcher, Dimitri von R\"utte, Sotiris, Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc,, Oliver Stanley, Rich\'ard Nagyfi, Shahul ES, Sameer Suri, David Glushkov,, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen

TL;DR
This paper introduces OpenAssistant Conversations, a large, multilingual, human-annotated dataset aimed at democratizing large language model alignment research, enabling broader access and development of aligned LLMs.
Contribution
It provides a comprehensive, publicly available conversation corpus with extensive annotations, facilitating research and development of alignment techniques without relying on proprietary human feedback data.
Findings
Models trained on the dataset outperform base models on standard benchmarks.
The dataset includes over 161,000 messages in 35 languages.
Crowdsourced annotations ensure high-quality, diverse human feedback.
Abstract
Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like RLHF rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗OpenAssistant/oasst-sft-6-llama-30b-xormodel· ♡ 941♡ 941
- 🤗OpenAssistant/oasst-sft-7-llama-30b-xormodel· ♡ 63♡ 63
- 🤗avictus/oasst-sft-7-llama-30b-4bitmodel· 3 dl· ♡ 33 dl♡ 3
- 🤗TheBloke/OpenAssistant-SFT-7-Llama-30B-HFmodel· 868 dl· ♡ 14868 dl♡ 14
- 🤗TheBloke/OpenAssistant-SFT-7-Llama-30B-GGMLmodel· ♡ 39♡ 39
- 🤗TheBloke/OpenAssistant-SFT-7-Llama-30B-GPTQmodel· 796 dl· ♡ 35796 dl♡ 35
- 🤗MorningstarHead/MorningstarHeadmodel
- 🤗gsaivinay/OpenAssistant-SFT-7-Llama-30B-HFmodel· 5 dl· ♡ 25 dl♡ 2
- 🤗OpenAssistant/oasst-rlhf-2-llama-30b-7k-steps-xormodel· ♡ 25♡ 25
- 🤗timdettmers/guanaco-7bmodel· ♡ 24♡ 24
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsBalanced Selection
