From Shannon's Channel to Semantic Channel via New Bayes' Formulas for   Machine Learning

Chenguang Lu

arXiv:1803.08979·cs.LG·March 28, 2018

From Shannon's Channel to Semantic Channel via New Bayes' Formulas for Machine Learning

Chenguang Lu

PDF

Open Access

TL;DR

This paper introduces a novel semantic information theory and the Channels' Matching algorithm, transforming Shannon's channel into a semantic channel for improved machine learning and natural language understanding.

Contribution

It develops a third-kind Bayes' theorem, defines semantic channels, and proposes the CM algorithm, offering a new framework for semantic information processing in machine learning.

Findings

01

Semantic channel conversion via Bayes' theorem

02

CM algorithm explains language evolution

03

Overcomes class imbalance in prediction

Abstract

A group of transition probability functions form a Shannon's channel whereas a group of truth functions form a semantic channel. By the third kind of Bayes' theorem, we can directly convert a Shannon's channel into an optimized semantic channel. When a sample is not big enough, we can use a truth function with parameters to produce the likelihood function, then train the truth function by the conditional sampling distribution. The third kind of Bayes' theorem is proved. A semantic information theory is simply introduced. The semantic information measure reflects Popper's hypothesis-testing thought. The Semantic Information Method (SIM) adheres to maximum semantic information criterion which is compatible with maximum likelihood criterion and Regularized Least Squares criterion. It supports Wittgenstein's view: the meaning of a word lies in its use. Letting the two channels mutually…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRough Sets and Fuzzy Logic · Text and Document Classification Technologies · Machine Learning and Algorithms