On the Use of Interpretable Machine Learning for the Management of Data   Quality

Anna Karanika; Panagiotis Oikonomou; Kostas Kolomvatsos; Christos; Anagnostopoulos

arXiv:2007.14677·cs.LG·July 30, 2020

On the Use of Interpretable Machine Learning for the Management of Data Quality

Anna Karanika, Panagiotis Oikonomou, Kostas Kolomvatsos, Christos, Anagnostopoulos

PDF

Open Access

TL;DR

This paper proposes using interpretable machine learning to identify significant features in IoT data, enhancing data quality management at the edge by enabling feature selection and dimensionality reduction.

Contribution

It introduces an ensemble-based interpretable machine learning approach for feature importance detection to improve data quality in IoT and edge computing environments.

Findings

01

Effective feature selection for IoT data quality management.

02

Enhanced dimensionality reduction with interpretability.

03

Robust performance across various simulated scenarios.

Abstract

Data quality is a significant issue for any application that requests for analytics to support decision making. It becomes very important when we focus on Internet of Things (IoT) where numerous devices can interact to exchange and process data. IoT devices are connected to Edge Computing (EC) nodes to report the collected data, thus, we have to secure data quality not only at the IoT but also at the edge of the network. In this paper, we focus on the specific problem and propose the use of interpretable machine learning to deliver the features that are important to be based for any data processing activity. Our aim is to secure data quality, at least, for those features that are detected as significant in the collected datasets. We have to notice that the selected features depict the highest correlation with the remaining in every dataset, thus, they can be adopted for dimensionality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Machine Learning and Data Classification

MethodsInterpretability