CoALFake: Collaborative Active Learning with Human-LLM Co-Annotation for Cross-Domain Fake News Detection
Esma A\"imeur, Gilles Brassard, Dorsaf Sallami

TL;DR
CoALFake introduces a cost-effective cross-domain fake news detection method combining human-LLM co-annotation with domain-aware active learning to improve generalization and reduce reliance on labeled data.
Contribution
The paper presents a novel framework integrating LLM-assisted annotation and domain-aware sampling to enhance cross-domain fake news detection.
Findings
Outperforms existing baselines across multiple datasets.
Maintains high performance with minimal human oversight.
Effectively captures domain-specific nuances and cross-domain patterns.
Abstract
The proliferation of fake news across diverse domains highlights critical limitations in current detection systems, which often exhibit narrow domain specificity and poor generalization. Existing cross-domain approaches face two key challenges: (1) reliance on labelled data, which is frequently unavailable and resource intensive to acquire and (2) information loss caused by rigid domain categorization or neglect of domain-specific features. To address these issues, we propose CoALFake, a novel approach for cross-domain fake news detection that integrates Human-Large Language Model (LLM) co-annotation with domain-aware Active Learning (AL). Our method employs LLMs for scalable, low-cost annotation while maintaining human oversight to ensure label reliability. By integrating domain embedding techniques, the CoALFake dynamically captures both domain specific nuances and cross-domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
