LinkedCT: A Linked Data Space for Clinical Trials
Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Renee J., Miller, Min Wang

TL;DR
LinkedCT creates an open semantic web data source for clinical trials by transforming existing data into RDF and discovering semantic links using advanced matching techniques, enhancing data integration and accessibility.
Contribution
This paper introduces a novel methodology for transforming clinical trial data into RDF and discovering semantic links using combined string matching and ontology-based techniques.
Findings
Effective semantic link discovery in clinical trial data
High performance of matching techniques in various scenarios
Enhanced data integration for clinical research
Abstract
The Linked Clinical Trials (LinkedCT) project aims at publishing the first open semantic web data source for clinical trials data. The database exposed by LinkedCT is generated by (1) transforming existing data sources of clinical trials into RDF, and (2) discovering semantic links between the records in the trials data and several other data sources. In this paper, we discuss several challenges involved in these two steps and present the methodology used in LinkedCT to overcome these challenges. Our approach for semantic link discovery involves using state-of-the-art approximate string matching techniques combined with ontology-based semantic matching of the records, all performed in a declarative and easy-to-use framework. We present an evaluation of the performance of our proposed techniques in several link discovery scenarios in LinkedCT.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Biomedical Text Mining and Ontologies · Semantic Web and Ontologies
