Scalable Ontological Query Processing over Semantically Integrated Life Science Datasets using MapReduce
HyeongSik Kim, Kemafor Anyanwu

TL;DR
This paper presents a scalable approach for processing complex ontological queries over large life sciences datasets using MapReduce, addressing challenges of inferencing and query complexity in big data environments.
Contribution
It introduces a novel method for handling complex, ontologically-rich queries on big data platforms like MapReduce, improving scalability and efficiency.
Findings
Effective query processing over large biomedical datasets
Demonstrated scalability on real-world data and benchmarks
Improved handling of complex ontological queries in cloud environments
Abstract
To address the requirement of enabling a comprehensive perspective of life-sciences data, Semantic Web technologies have been adopted for standardized representations of data and linkages between data. This has resulted in data warehouses such as UniProt, Bio2RDF, and Chem2Bio2RDF, that integrate different kinds of biological and chemical data using ontologies. Unfortunately, the ability to process queries over ontologically-integrated collections remains a challenge, particularly when data is large. The reason is that besides the traditional challenges of processing graph-structured data, complete query answering requires inferencing to explicate implicitly represented facts. Since traditional inferencing techniques like forward chaining are difficult to scale up, and need to be repeated each time data is updated, recent focus has been on inferencing that can be supported using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Scientific Computing and Data Management · Biomedical Text Mining and Ontologies
