Multidimensional or Relational? / How to Organize an On-line Analytical Processing Database
Istv\'an Sz\'epk\'uti

TL;DR
This paper compares multidimensional and relational database structures for OLAP, demonstrating conditions where MD arrays are faster and more space-efficient, supported by theoretical proofs and experimental results.
Contribution
It provides new proofs and conditions for when MD arrays outperform relational tables, including compression techniques and experimental validation.
Findings
MD arrays can represent all relations.
MD arrays are faster under certain conditions.
MD arrays can be more space-efficient.
Abstract
In the past few years, the number of OLAP applications increased quickly. These applications use two significantly different DB structures: multidimensional (MD) and table-based. One can show that the traditional model of relational databases cannot make difference between these two structures. Another model is necessary to make the differences visible. One of these is the speed of the system. It can be proven that the multidimensional DB organization results in shorter response times. And it is crucial, since a manager may become impatient, if he or she has to wait say more than 20 seconds for the next screen. On the other hand, we have to pay for the speed with a bigger DB size. Why does the size of MD databases grow so quickly? The reason is the sparsity of data: The MD matrix contains many empty cells. Efficient handling of sparse matrices is indispensable in an OLAP application.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Semantic Web and Ontologies
