Evaluating Wikipedia as a source of information for disease understanding
Eduardo P. Garcia del Valle, Gerardo Lagunes Garcia, Lucia Prieto, Santamaria, Massimiliano Zanin, Alejandro Rodriguez-Gonzalez, Ernestina, Menasalvas Ruiz

TL;DR
This paper explores Wikipedia as an accessible and effective source of textual information for disease understanding, comparing its utility to PubMed and finding comparable relevance in disease relationship extraction.
Contribution
It introduces Wikipedia as a viable alternative to traditional medical databases for disease research, highlighting its potential despite data access and structure limitations.
Findings
Wikipedia's information is as relevant as PubMed abstracts for disease relationships
Wikipedia provides accessible textual data for disease understanding research
Further validation needed for medical reliability
Abstract
The increasing availability of biological data is improving our understanding of diseases and providing new insight into their underlying relationships. Thanks to the improvements on both text mining techniques and computational capacity, the combination of biological data with semantic information obtained from medical publications has proven to be a very promising path. However, the limitations in the access to these data and their lack of structure pose challenges to this approach. In this document we propose the use of Wikipedia - the free online encyclopedia - as a source of accessible textual information for disease understanding research. To check its validity, we compare its performance in the determination of relationships between diseases with that of PubMed, one of the most consulted data sources of medical texts. The obtained results suggest that the information extracted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
