Open Data Quality

Anastasija Nikiforova

arXiv:2007.06540·cs.DB·June 16, 2022·1 cites

Open Data Quality

Anastasija Nikiforova

PDF

Open Access

TL;DR

This paper presents a method for assessing open data quality through a specification-based approach, enabling error detection and evaluation to ensure trustworthiness for decision-making.

Contribution

It introduces a novel data quality specification framework tailored for open data, facilitating systematic quality evaluation and error identification.

Findings

01

Applied to multiple open datasets, demonstrating effectiveness in quality assessment.

02

Identified common data quality issues in open datasets, highlighting the need for systematic evaluation.

03

Showed that the approach can improve trust in open data for stakeholders.

Abstract

The research discusses how (open) data quality could be described, what should be considered developing a data quality management solution and how it could be applied to open data to check its quality. The proposed approach focuses on development of data quality specification which can be executed to get data quality evaluation results, find errors in data and possible problems which must be solved. The proposed approach is applied to several open data sets to evaluate their quality. Open data is very popular, free available for every stakeholder - it is often used to make business decisions. It is important to be sure that this data is trustable and error-free as its quality problems can lead to huge losses.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Big Data and Business Intelligence · Privacy-Preserving Technologies in Data