Ontology Based Data Integration Over Document and Column Family Oriented NOSQL
Olivier Cur\'e, Myriam Lamolle, Chan Le Duc

TL;DR
This paper presents an ontology-based data integration framework for NOSQL databases, enabling reasoning and efficient distributed querying across heterogeneous web-scale data sources.
Contribution
It introduces a method to generate local ontologies from schemaless NOSQL sources, create a global ontology through correspondence discovery, and translate SPARQL queries to source-specific languages.
Findings
Generated local ontologies for MongoDB and Cassandra
Created a global ontology linking local schemas
Developed a SPARQL-to-source query translation
Abstract
The World Wide Web infrastructure together with its more than 2 billion users enables to store information at a rate that has never been achieved before. This is mainly due to the will of storing almost all end-user interactions performed on some web applications. In order to reply to scalability and availability constraints, many web companies involved in this process recently started to design their own data management systems. Many of them are referred to as NOSQL databases, standing for 'Not only SQL'. With their wide adoption emerges new needs and data integration is one of them. In this paper, we consider that an ontology-based representation of the information stored in a set of NOSQL sources is highly needed. The main motivation of this approach is the ability to reason on elements of the ontology and to retrieve information in an efficient and distributed manner. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Service-Oriented Architecture and Web Services · Advanced Database Systems and Queries
