Scaling Human Judgment in Community Notes with LLMs
Haiwen Li, Soham De, Manon Revel, Andreas Haupt, Brad Miller, Keith Coleman, Jay Baxter, Martin Saveski, Michiel A. Bakker

TL;DR
This paper proposes a hybrid system where humans and LLMs collaboratively generate and evaluate Community Notes, enhancing note delivery speed and quality while maintaining trust through human oversight and feedback.
Contribution
It introduces a new paradigm integrating LLMs into Community Notes, emphasizing human oversight and a feedback loop to improve LLM performance and note quality.
Findings
LLMs can assist in delivering context quickly and efficiently.
A feedback loop from human raters can improve LLM accuracy and reduce bias.
The proposed system maintains trust through human moderation.
Abstract
This paper argues for a new paradigm for Community Notes in the LLM era: an open ecosystem where both humans and LLMs can write notes, and the decision of which notes are helpful enough to show remains in the hands of humans. This approach can accelerate the delivery of notes, while maintaining trust and legitimacy through Community Notes' foundational principle: A community of diverse human raters collectively serve as the ultimate evaluator and arbiter of what is helpful. Further, the feedback from this diverse community can be used to improve LLMs' ability to produce accurate, unbiased, broadly helpful notes--what we term Reinforcement Learning from Community Feedback (RLCF). This becomes a two-way street: LLMs serve as an asset to humans--helping deliver context quickly and with minimal effort--while human feedback, in turn, enhances the performance of LLMs. This paper describes how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
