A New Information Theory of Certainty for Machine Learning

Arthur Jun Zhang

arXiv:2304.12833·cs.IT·April 26, 2023·1 cites

A New Information Theory of Certainty for Machine Learning

Arthur Jun Zhang

PDF

Open Access

TL;DR

This paper introduces troenpy, a new measure dual to entropy, to quantify certainty in data distributions, with applications in document classification, language modeling, and quantum systems.

Contribution

It proposes troenpy as a novel concept dual to entropy, enabling new ways to model certainty in classical and quantum data.

Findings

01

Troenpy improves document classification weighting schemes.

02

Self-troenpy reduces perplexity in neural language models.

03

Quantum troenpy quantifies certainty in quantum systems.

Abstract

Claude Shannon coined entropy to quantify the uncertainty of a random distribution for communication coding theory. We observe that the uncertainty nature of entropy also limits its direct usage in mathematical modeling. Therefore we propose a new concept troenpy,as the canonical dual of entropy, to quantify the certainty of the underlying distribution. We demonstrate two applications in machine learning. The first is for the classical document classification, we develop a troenpy based weighting scheme to leverage the document class label. The second is a self-troenpy weighting scheme for sequential data and show that it can be easily included in neural network based language models and achieve dramatic perplexity reduction. We also define quantum troenpy as the dual of the Von Neumann entropy to quantify the certainty of quantum systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy