Information Theory and its Relation to Machine Learning

Bao-Gang Hu

arXiv:1501.04309·cs.IT·January 20, 2015

Information Theory and its Relation to Machine Learning

Bao-Gang Hu

PDF

Open Access

TL;DR

This paper presents a new perspective on machine learning by analyzing four fundamental problems, emphasizing the importance of learning target selection, and explores the connection between information theory and ML.

Contribution

It introduces a novel framework for understanding ML problems, reviews existing links between information theory and ML, and proposes a conjecture for a unified mathematical interpretation.

Findings

01

A theorem relating similarity measures to information measures.

02

A review of information theoretical approaches in ML.

03

A conjecture for a unified theory of learning target selection.

Abstract

In this position paper, I first describe a new perspective on machine learning (ML) by four basic problems (or levels), namely, "What to learn?", "How to learn?", "What to evaluate?", and "What to adjust?". The paper stresses more on the first level of "What to learn?", or "Learning Target Selection". Towards this primary problem within the four levels, I briefly review the existing studies about the connection between information theoretical learning (ITL [1]) and machine learning. A theorem is given on the relation between the empirically-defined similarity measure and information measures. Finally, a conjecture is proposed for pursuing a unified mathematical interpretation to learning target selection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · Neural Networks and Applications