# Is 'Unsupervised Learning' a Misconceived Term?

**Authors:** Stephen G. Odaibo

arXiv: 1904.03259 · 2019-04-09

## TL;DR

This paper argues that all machine learning involves some supervision, challenging the traditional supervised vs unsupervised categorization, and proposes a new framework based on internal and external supervision.

## Contribution

It introduces a novel perspective that classifies learning algorithms as internally or externally supervised, redefining the fundamental understanding of supervision in machine learning.

## Key findings

- Clustering and dimensionality reduction algorithms are internally supervised by data.
- Such algorithms require external labeling of their outputs for external inference.
- The traditional supervised vs unsupervised categorization should be replaced with internal/external supervision.

## Abstract

Is all of machine learning supervised to some degree? The field of machine learning has traditionally been categorized pedagogically into $supervised~vs~unsupervised~learning$; where supervised learning has typically referred to learning from labeled data, while unsupervised learning has typically referred to learning from unlabeled data. In this paper, we assert that all machine learning is in fact supervised to some degree, and that the scope of supervision is necessarily commensurate to the scope of learning potential. In particular, we argue that clustering algorithms such as k-means, and dimensionality reduction algorithms such as principal component analysis, variational autoencoders, and deep belief networks are each internally supervised by the data itself to learn their respective representations of its features. Furthermore, these algorithms are not capable of external inference until their respective outputs (clusters, principal components, or representation codes) have been identified and externally labeled in effect. As such, they do not suffice as examples of unsupervised learning. We propose that the categorization `supervised vs unsupervised learning' be dispensed with, and instead, learning algorithms be categorized as either $internally~or~externally~supervised$ (or both). We believe this change in perspective will yield new fundamental insights into the structure and character of data and of learning algorithms.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.03259/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1904.03259/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/1904.03259/full.md

---
Source: https://tomesphere.com/paper/1904.03259