Extreme Entropy Machines: Robust information theoretic classification
Wojciech Marian Czarnecki, Jacek Tabor

TL;DR
This paper introduces Extreme Entropy Machines, a novel classification approach based on entropy measures, offering a robust and scalable alternative to traditional methods like SVMs and ELMs, with strong empirical performance.
Contribution
It proposes a new information theoretic classification model using quadratic Renyi's entropy and Cauchy-Schwarz Divergence, providing robustness and scalability.
Findings
Competitive accuracy with state-of-the-art classifiers
Effective on large and highly unbalanced datasets
Scales well to hundreds of thousands of samples
Abstract
Most of the existing classification methods are aimed at minimization of empirical risk (through some simple point-based error measured with loss function) with added regularization. We propose to approach this problem in a more information theoretic way by investigating applicability of entropy measures as a classification model objective function. We focus on quadratic Renyi's entropy and connected Cauchy-Schwarz Divergence which leads to the construction of Extreme Entropy Machines (EEM). The main contribution of this paper is proposing a model based on the information theoretic concepts which on the one hand shows new, entropic perspective on known linear classifiers and on the other leads to a construction of very robust method competetitive with the state of the art non-information theoretic ones (including Support Vector Machines and Extreme Learning Machines). Evaluation on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Neural Networks and Applications · Face and Expression Recognition
