Data Modeling for Connected Data -- A systematic literature review
Veronica Santos

TL;DR
This paper systematically reviews approaches to data modeling for connected data in NoSQL graph databases, highlighting the challenges and research gaps, especially in reverse engineering methods, over the period 2013-2020.
Contribution
It provides a comprehensive analysis of sixteen studies on connected data modeling, identifying research opportunities and characterizing existing approaches.
Findings
Reverse engineering of connected data is underexplored.
Most works focus on modeling techniques for specific applications.
There is a need for more complete solutions in data reverse engineering.
Abstract
A data model specifies how real-world entities and their relationships are represented and operated. In the NoSQL world data modeling usually begins from identifying application queries and designing the data model to efficiently answer them so each database is designed to meet requirements of just one or more applications. But this practice causes a strong coupling between the data model and application queries and promotes data silos. Newly developed applications that manipulate connected data, usually stored in NoSQL Graph Databases, suffer from this type of problem, which is a challenge for data integration projects in Big Data scenarios. This systematic literature review (SLR) was carried out to identify the known approaches for data modeling of connected data. The main contribution of this SLR is an analysis of sixteen works, from 2013 to 2020, in terms of three dimensions: type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Semantic Web and Ontologies · Data Quality and Management
