Systematic definition and classification of data anomalies in DBMS   (English Version)

Li Hai-Xiang; Li Xiao-Yan; Liu Chang; Du Xiao-Yong; Lu Wei; Pan An-Qun

arXiv:2110.14230·cs.DB·October 28, 2021·1 cites

Systematic definition and classification of data anomalies in DBMS (English Version)

Li Hai-Xiang, Li Xiao-Yan, Liu Chang, Du Xiao-Yong, Lu Wei, Pan An-Qun

PDF

Open Access

TL;DR

This paper systematically defines and classifies data anomalies in DBMS, introduces 22 new anomalies, and proposes two new isolation levels to improve transaction processing efficiency.

Contribution

It provides a comprehensive classification of data anomalies, reports 22 new anomalies, and proposes novel isolation levels based on anomaly classification.

Findings

01

22 new data anomalies identified

02

A miraculously comprehensive classification system established

03

Two new isolation levels proposed based on anomalies

Abstract

There is no unified definition of Data anomalies, which refers to the specific data operation mode that may violate the consistency of the database. Known data anomalies include Dirty Write, Dirty Read, Non-repeatable Read, Phantom, Read Skew and Write Skew, etc. In order to improve the efficiency of concurrency control algorithms, data anomalies are also used to define the isolation levels, because the weak isolation level can improve the efficiency of transaction processing systems. This paper systematically studies the data anomalies and the corresponding isolation levels. We report twenty-two new data anomalies that other papers have not reported, and all data anomalies are classified miraculously. Based on the classification of data anomalies, two new isolation levels systems with different granularity are proposed, which reveals the rule of defining isolation levels based on data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Data Quality and Management · Advanced Data Storage Technologies