Extracting and Measuring Uncertain Biomedical Knowledge from Scientific Statements
Xin Guo, Yuming Chen, Jian Du, Erdan Dong

TL;DR
This study introduces a novel method for extracting and quantifying uncertain biomedical knowledge from scientific statements, focusing on cardiovascular research in China, using entropy-based metrics to identify key uncertain knowledge areas.
Contribution
The paper presents a new approach combining SPO triple extraction with entropy and uncertainty metrics to measure biomedical knowledge uncertainty at multiple levels.
Findings
Scientific publication and SPO triple counts grow linearly.
Uncertain cue words are more frequent in specific sentence types.
Major uncertain knowledge areas identified include biomarkers and therapies.
Abstract
Purpose: This study aims to develop a novel approach to extracting and measuring uncertain biomedical knowledge from scientific statements. Design/methodology/approach: Taking cardiovascular research publications in China as a sample, we extracted the SPO triples as knowledge unit and the hedging/conflicting uncertainties as the knowledge context. We introduced Information Entropy and Uncertainty Rate as potential metrics to quantity the uncertainty of biomedical knowledge claims represented at different levels, such as the SPO triples (micro level), as well as the semantic type pairs (micro-level). Findings: The results indicated that while the number of scientific publications and total SPO triples showed a liner growth, the novel SPO triples occurring per year remained stable. After examining the frequency of uncertain cue words in different part of scientific statements, we found…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies
