The Future of Open Human Feedback
Shachar Don-Yehiya, Ben Burtenshaw, Ramon Fernandez Astudillo, Cailean, Osborne, Mimansa Jaiswal, Tzu-Sheng Kuo, Wenting Zhao, Idan Shenfeld, Andi, Peng, Mikhail Yurochkin, Atoosa Kasirzadeh, Yangsibo Huang, Tatsunori, Hashimoto, Yacine Jernite, Daniel Vila-Suero, Omri Abend

TL;DR
This paper explores the potential for creating an open ecosystem for human feedback on language models, emphasizing community involvement, challenges, and sustainable practices to improve AI safety and capabilities.
Contribution
It provides an interdisciplinary assessment of open human feedback, identifies key challenges, reviews current approaches, and proposes a framework for a sustainable, community-driven feedback ecosystem.
Findings
Successful open practices from peer production and citizen science.
Main challenges include privacy, incentivization, and quality control.
Recommendations for building an open, sustainable feedback infrastructure.
Abstract
Human feedback on conversations with language language models (LLMs) is central to how these systems learn about the world, improve their capabilities, and are steered toward desirable and safe behaviors. However, this feedback is mostly collected by frontier AI labs and kept behind closed doors. In this work, we bring together interdisciplinary experts to assess the opportunities and challenges to realizing an open ecosystem of human feedback for AI. We first look for successful practices in peer production, open source, and citizen science communities. We then characterize the main challenges for open human feedback. For each, we survey current approaches and offer recommendations. We end by envisioning the components needed to underpin a sustainable and open human feedback ecosystem. In the center of this ecosystem are mutually beneficial feedback loops, between users and specialized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety
