Correct classification for big/smart/fast data machine learning

Sander Stepanov

arXiv:1609.08550·cs.LG·September 28, 2016

Correct classification for big/smart/fast data machine learning

Sander Stepanov

PDF

Open Access

TL;DR

This paper explores the classification of big data using Boolean function minimization, proposing a mathematical approach that transforms data representation into Boolean functions and applies known algorithms.

Contribution

It introduces a novel perspective of data classification as Boolean function minimization and demonstrates how to leverage existing algorithms for this purpose.

Findings

01

Data can be represented as Boolean functions for classification.

02

Existing Boolean minimization algorithms can be applied to data classification.

03

The approach facilitates development of multivalued output classifiers.

Abstract

Table (database) / Relational database Classification for big/smart/fast data machine learning is one of the most important tasks of predictive analytics and extracting valuable information from data. It is core applied technique for what now understood under data science and/or artificial intelligence. Widely used Decision Tree (Random Forest) and rare used rule based PRISM , VFST, etc classifiers are empirical substitutions of theoretically correct to use Boolean functions minimization. Developing Minimization of Boolean functions algorithms is started long time ago by Edward Veitch's 1952. Since it, big efforts by wide scientific/industrial community was done to find feasible solution of Boolean functions minimization. In this paper we propose consider table data classification from mathematical point of view, as minimization of Boolean functions. It is shown that data representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Anomaly Detection Techniques and Applications · Advanced Graph Neural Networks