Scaling Human Judgment in Community Notes with LLMs

Haiwen Li; Soham De; Manon Revel; Andreas Haupt; Brad Miller; Keith Coleman; Jay Baxter; Martin Saveski; Michiel A. Bakker

arXiv:2506.24118·cs.CY·October 2, 2025

Scaling Human Judgment in Community Notes with LLMs

Haiwen Li, Soham De, Manon Revel, Andreas Haupt, Brad Miller, Keith Coleman, Jay Baxter, Martin Saveski, Michiel A. Bakker

PDF

TL;DR

This paper proposes a hybrid system where humans and LLMs collaboratively generate and evaluate Community Notes, enhancing note delivery speed and quality while maintaining trust through human oversight and feedback.

Contribution

It introduces a new paradigm integrating LLMs into Community Notes, emphasizing human oversight and a feedback loop to improve LLM performance and note quality.

Findings

01

LLMs can assist in delivering context quickly and efficiently.

02

A feedback loop from human raters can improve LLM accuracy and reduce bias.

03

The proposed system maintains trust through human moderation.

Abstract

This paper argues for a new paradigm for Community Notes in the LLM era: an open ecosystem where both humans and LLMs can write notes, and the decision of which notes are helpful enough to show remains in the hands of humans. This approach can accelerate the delivery of notes, while maintaining trust and legitimacy through Community Notes' foundational principle: A community of diverse human raters collectively serve as the ultimate evaluator and arbiter of what is helpful. Further, the feedback from this diverse community can be used to improve LLMs' ability to produce accurate, unbiased, broadly helpful notes--what we term Reinforcement Learning from Community Feedback (RLCF). This becomes a two-way street: LLMs serve as an asset to humans--helping deliver context quickly and with minimal effort--while human feedback, in turn, enhances the performance of LLMs. This paper describes how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.