Data Quality in Edge Machine Learning: A State-of-the-Art Survey
Mohammed Djameleddine Belgoumri, Mohamed Reda Bouadjenek, Sunil Aryal,, Hakim Hacid

TL;DR
This survey reviews the current state of data quality challenges and solutions in edge machine learning, emphasizing the importance of maintaining high data standards in resource-constrained, decentralized environments.
Contribution
It provides a comprehensive overview of data quality issues in edge ML, proposing a definition and dimensions of data quality specific to edge computing, and discusses mitigation strategies.
Findings
Identifies key data quality dimensions for edge ML.
Highlights challenges unique to resource-limited edge environments.
Summarizes existing solutions for data quality mitigation.
Abstract
Data-driven Artificial Intelligence (AI) systems trained using Machine Learning (ML) are shaping an ever-increasing (in size and importance) portion of our lives, including, but not limited to, recommendation systems, autonomous driving technologies, healthcare diagnostics, financial services, and personalized marketing. On the one hand, the outsized influence of these systems imposes a high standard of quality, particularly in the data used to train them. On the other hand, establishing and maintaining standards of Data Quality (DQ) becomes more challenging due to the proliferation of Edge Computing and Internet of Things devices, along with their increasing adoption for training and deploying ML models. The nature of the edge environment -- characterized by limited resources, decentralized data storage, and processing -- exacerbates data-related issues, making them more frequent,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Face and Expression Recognition · Advanced Clustering Algorithms Research
MethodsSparse Evolutionary Training
