TL;DR
This paper compares the performance and scalability of XML-based and JSON-based native document-oriented databases, highlighting the lack of benchmarks and analyzing their suitability for complex, heterogeneous data in Big Data contexts.
Contribution
It provides the first comprehensive comparison and benchmarking of XML and JSON document databases, addressing a gap in existing research.
Findings
XML databases perform well on complex queries.
JSON databases offer better scalability for large datasets.
Benchmark results highlight performance trade-offs between formats.
Abstract
In the current context of Big Data, a multitude of new NoSQL solutions for storing, managing, and extracting information and patterns from semi-structured data have been proposed and implemented. These solutions were developed to relieve the issue of rigid data structures present in relational databases, by introducing semi-structured and flexible schema design. As current data generated by different sources and devices, especially from IoT sensors and actuators, use either XML or JSON format, depending on the application, database technologies that store and query semi-structured data in XML format are needed. Thus, Native XML Databases, which were initially designed to manipulate XML data using standardized querying languages, i.e., XQuery and XPath, were rebranded as NoSQL Document-Oriented Databases Systems. Currently, the majority of these solutions have been replaced with the more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
