Handling Inconsistencies in Tables with Nulls and Functional Dependencies
Dominique Laurent, Nicolas Spyratos

TL;DR
This paper introduces a formal framework for managing inconsistencies and missing values in tables with functional dependencies, enabling improved query answering and data integration.
Contribution
It presents a new approach to handle inconsistencies with nulls, algorithms for tuple classification, and a novel solution for consistent query answering.
Findings
Framework associates tuples with truth values: true, false, inconsistent, unknown.
Algorithms compute true, inconsistent, and false tuples efficiently.
The approach relates to Four-valued logic and improves upon existing table repair methods.
Abstract
In this paper we address the problem of handling inconsistencies in tables with missing values (also called nulls) and functional dependencies. Although the traditional view is that table instances must respect all functional dependencies imposed on them, it is nevertheless relevant to develop theories about how to handle instances that violate some dependencies. Regarding missing values, we make no assumptions on their existence: a missing value exists only if it is inferred from the functional dependencies of the table. We propose a formal framework in which each tuple of a table is associated with a truth value among the following: true, false, inconsistent or unknown; and we show that our framework can be used to study important problems such as consistent query answering, table merging, and data quality measures - to mention just a few. In this paper, however, we focus mainly on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
