How Far are we from Data Mining Democratisation? A Systematic Review
Alfonso de la Vega, Diego Garc\'ia-Saiz, Marta Zorrilla, Pablo, S\'anchez

TL;DR
This systematic review assesses the progress towards democratizing data mining, revealing that current approaches still require data scientists and face challenges in automation and accuracy, indicating that full democratization remains distant.
Contribution
The paper provides a comprehensive evaluation of existing data mining democratization approaches, identifying gaps and challenges that hinder full accessibility for non-experts.
Findings
Only two of four categories offer effective solutions.
One approach requires minimal data scientist intervention.
Automated methods face accuracy and completeness issues.
Abstract
Context: Data mining techniques have demonstrated to be a powerful technique for discovering insights hidden in data from a domain. However, these techniques demand very specialised skills. People willing to analyse data often lack these skills, so they must rely on data scientists, which hinders data mining democratisation. Different approaches have appeared in the last years to address this issue. Objective: Analyse the state of the art to know how far are we from an effective data mining democratisation, what has already been accomplished, and what should be done in the upcoming years. Method: We performed a state-of-the-art review following a systematic and objective procedure, which included works both from the academia and the industry. The reviewed works were grouped in four categories. Each category was then evaluated in detail using a well-defined evaluation criteria to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Imbalanced Data Classification Techniques · Data Stream Mining Techniques
