Generating Scientific Claims for Zero-Shot Scientific Fact Checking
Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan,, Isabelle Augenstein, and Lucy Lu Wang

TL;DR
This paper introduces methods for generating atomic scientific claims from sentences to improve zero-shot biomedical fact checking, achieving high performance with limited annotated data.
Contribution
It presents CLAIMGEN-BART, KBIN, and CLAIMGEN-ENTITY, novel claim generation techniques that enhance zero-shot fact checking in scientific domains.
Findings
CLAIMGEN-BART and CLAIMGEN-ENTITY achieve up to 90% of fully supervised model performance.
The methods significantly improve claim and negation quality over baselines.
The approach reduces the need for extensive annotated training data.
Abstract
Automated scientific fact checking is difficult due to the complexity of scientific language and a lack of significant amounts of training data, as annotation requires domain expertise. To address this challenge, we propose scientific claim generation, the task of generating one or more atomic and verifiable claims from scientific sentences, and demonstrate its usefulness in zero-shot fact checking for biomedical claims. We propose CLAIMGEN-BART, a new supervised method for generating claims supported by the literature, as well as KBIN, a novel method for generating claim negations. Additionally, we adapt an existing unsupervised entity-centric method of claim generation to biomedical claims, which we call CLAIMGEN-ENTITY. Experiments on zero-shot fact checking demonstrate that both CLAIMGEN-ENTITY and CLAIMGEN-BART, coupled with KBIN, achieve up to 90% performance of fully supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques
