The Forgotten Document-Oriented Database Management Systems: An Overview   and Benchmark of Native XML DODBMSes in Comparison with JSON DODBMSes

Ciprian-Octavian Truic\u{a}; Elena-Simona Apostol; J\'er\^ome; Darmont; Torben Bach Pedersen

arXiv:2102.02246·cs.DB·February 5, 2021

The Forgotten Document-Oriented Database Management Systems: An Overview and Benchmark of Native XML DODBMSes in Comparison with JSON DODBMSes

Ciprian-Octavian Truic\u{a}, Elena-Simona Apostol, J\'er\^ome, Darmont, Torben Bach Pedersen

PDF

1 Repo

TL;DR

This paper compares the performance and scalability of XML-based and JSON-based native document-oriented databases, highlighting the lack of benchmarks and analyzing their suitability for complex, heterogeneous data in Big Data contexts.

Contribution

It provides the first comprehensive comparison and benchmarking of XML and JSON document databases, addressing a gap in existing research.

Findings

01

XML databases perform well on complex queries.

02

JSON databases offer better scalability for large datasets.

03

Benchmark results highlight performance trade-offs between formats.

Abstract

In the current context of Big Data, a multitude of new NoSQL solutions for storing, managing, and extracting information and patterns from semi-structured data have been proposed and implemented. These solutions were developed to relieve the issue of rigid data structures present in relational databases, by introducing semi-structured and flexible schema design. As current data generated by different sources and devices, especially from IoT sensors and actuators, use either XML or JSON format, depending on the application, database technologies that store and query semi-structured data in XML format are needed. Thus, Native XML Databases, which were initially designed to manipulate XML data using standardized querying languages, i.e., XQuery and XPath, were rebranded as NoSQL Document-Oriented Databases Systems. Currently, the majority of these solutions have been replaced with the more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cipriantruica/The-Forgotten-DODBMSes
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.