# Do we still need fuzzy classifiers for Small Data in the Era of Big   Data?

**Authors:** Mikel Elkano, Humberto Bustince, Mikel Galar

arXiv: 1903.03324 · 2019-03-11

## TL;DR

This paper empirically compares fuzzy classifiers designed for Big Data with those for Small Data, revealing that Big Data classifiers are increasingly competitive in accuracy, hinting at the possibility of a unified approach for both data scales.

## Contribution

It provides the first empirical analysis of the performance tradeoffs between Big Data and Small Data fuzzy classifiers, highlighting their convergence in accuracy.

## Key findings

- Big Data classifiers are slightly less accurate but improving.
- Big Data classifiers are catching up with Small Data classifiers.
- Potential for a unified learning algorithm for both data scales.

## Abstract

The Era of Big Data has forced researchers to explore new distributed solutions for building fuzzy classifiers, which often introduce approximation errors or make strong assumptions to reduce computational and memory requirements. As a result, Big Data classifiers might be expected to be inferior to those designed for standard classification tasks (Small Data) in terms of accuracy and model complexity. To our knowledge, however, there is no empirical evidence to confirm such a conjecture yet. Here, we investigate the extent to which state-of-the-art fuzzy classifiers for Big Data sacrifice performance in favor of scalability. To this end, we carry out an empirical study that compares these classifiers with some of the best performing algorithms for Small Data. Assuming the latter were generally designed for maximizing performance without considering scalability issues, the results of this study provide some intuition around the tradeoff between performance and scalability achieved by current Big Data solutions. Our findings show that, although slightly inferior, Big Data classifiers are gradually catching up with state-of-the-art classifiers for Small data, suggesting that a unified learning algorithm for Big and Small Data might be possible.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.03324/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/1903.03324/full.md

---
Source: https://tomesphere.com/paper/1903.03324