AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration
Widad Elouataoui

TL;DR
This paper proposes comprehensive AI-driven frameworks for assessing, detecting, correcting data quality issues, and enhancing metadata in big data ecosystems, aiming to improve decision-making accuracy across diverse domains.
Contribution
It introduces new quality metrics, a generic anomaly detection framework, an anomaly correction framework, and metadata quality enhancement methods for big data.
Findings
Frameworks effectively detect and correct data anomalies.
Enhanced data quality improves analysis accuracy.
Applicable across multiple big data domains.
Abstract
The widespread adoption of big data has ushered in a new era of data-driven decision-making, transforming numerous industries and sectors. However, the efficacy of these decisions hinges on the quality of the underlying data. Poor data quality can result in inaccurate analyses and deceptive conclusions. Managing the vast volume, velocity, and variety of data sources presents significant challenges, heightening the importance of addressing big data quality issues. While there has been increased attention from both academia and industry, current approaches often lack comprehensiveness and universality. They tend to focus on limited metrics, neglecting other dimensions of data quality. Moreover, existing methods are often context-specific, limiting their applicability across different domains. There is a clear need for intelligent, automated approaches leveraging artificial intelligence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Data Quality and Management
MethodsSparse Evolutionary Training · Focus
