Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines
Ellery Smith, Rahel Paloots, Dimitris Giagkos, Michael Baudis, Kurt, Stockinger

TL;DR
This paper presents a novel computational system that extracts deep semantic relations from biomedical literature to enrich cancer cell line data, facilitating rapid literature search and data exploration.
Contribution
It introduces a new data extraction system and a public portal that links genomic data with literature-derived relations for cancer cell lines.
Findings
Enables automatic linking of genomic variants with related entities.
Provides literature evidence for extracted relations.
Facilitates rapid literature search and data enrichment.
Abstract
With the proliferation of research means and computational methodologies, published biomedical literature is growing exponentially in numbers and volume. Cancer cell lines are frequently used models in biological and medical research that are currently applied for a wide range of purposes, from studies of cellular mechanisms to drug development, which has led to a wealth of related data and publications. Sifting through large quantities of text to gather relevant information on the cell lines of interest is tedious and extremely slow when performed by humans. Hence, novel computational information extraction and correlation mechanisms are required to boost meaningful knowledge extraction. In this work, we present the design, implementation and application of a novel data extraction and exploration system. This system extracts deep semantic relations between textual entities from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Computational Drug Discovery Methods · Machine Learning in Bioinformatics
