HTAP Databases: A Survey
Chao Zhang, Guoliang Li, Jintao Zhang, Xinning Zhang, Jianhua Feng

TL;DR
This survey comprehensively reviews HTAP databases, classifying them by storage architecture, analyzing key techniques, benchmarks, and discussing future research challenges in combining transactional and analytical processing.
Contribution
It provides a systematic classification and analysis of state-of-the-art HTAP databases, highlighting their architectures, techniques, and open research issues.
Findings
Classified HTAP databases into four storage architecture categories.
Reviewed key techniques like workload processing and data synchronization.
Discussed existing benchmarks and future research challenges.
Abstract
Since Gartner coined the term, Hybrid Transactional and Analytical Processing (HTAP), numerous HTAP databases have been proposed to combine transactions with analytics in order to enable real-time data analytics for various data-intensive applications. HTAP databases typically process the mixed workloads of transactions and analytical queries in a unified system by leveraging both a row store and a column store. As there are different storage architectures and processing techniques to satisfy various requirements of diverse applications, it is critical to summarize the pros and cons of these key techniques. This paper offers a comprehensive survey of HTAP databases. We mainly classify state-of-the-art HTAP databases according to four storage architectures: (a) Primary Row Store and In-Memory Column Store; (b) Distributed Row Store and Column Store Replica; (c) Primary Row Store and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
