Extracting Fine-Grained Knowledge Graphs of Scientific Claims: Dataset and Transformer-Based Results
Ian H. Magnusson, Scott E. Friedman

TL;DR
This paper introduces SciClaim, a detailed dataset of scientific claims with fine-grained annotations, and demonstrates transformer-based methods to extract complex knowledge graphs from scientific texts, enhancing understanding of experimental relationships.
Contribution
The paper presents a new dataset with extensive fine-grained annotations and a transformer-based approach for extracting detailed scientific knowledge graphs.
Findings
Transformer models effectively infer complex schema annotations.
SciClaim dataset captures diverse causal, statistical, and comparative relations.
Enhanced label density improves the granularity of extracted scientific information.
Abstract
Recent transformer-based approaches demonstrate promising results on relational scientific information extraction. Existing datasets focus on high-level description of how research is carried out. Instead we focus on the subtleties of how experimental associations are presented by building SciClaim, a dataset of scientific claims drawn from Social and Behavior Science (SBS), PubMed, and CORD-19 papers. Our novel graph annotation schema incorporates not only coarse-grained entity spans as nodes and relations as edges between them, but also fine-grained attributes that modify entities and their relations, for a total of 12,738 labels in the corpus. By including more label types and more than twice the label density of previous datasets, SciClaim captures causal, comparative, predictive, statistical, and proportional associations over experimental variables along with their qualifications,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Advanced Text Analysis Techniques
