From Shannon's Channel to Semantic Channel via New Bayes' Formulas for Machine Learning
Chenguang Lu

TL;DR
This paper introduces a novel semantic information theory and the Channels' Matching algorithm, transforming Shannon's channel into a semantic channel for improved machine learning and natural language understanding.
Contribution
It develops a third-kind Bayes' theorem, defines semantic channels, and proposes the CM algorithm, offering a new framework for semantic information processing in machine learning.
Findings
Semantic channel conversion via Bayes' theorem
CM algorithm explains language evolution
Overcomes class imbalance in prediction
Abstract
A group of transition probability functions form a Shannon's channel whereas a group of truth functions form a semantic channel. By the third kind of Bayes' theorem, we can directly convert a Shannon's channel into an optimized semantic channel. When a sample is not big enough, we can use a truth function with parameters to produce the likelihood function, then train the truth function by the conditional sampling distribution. The third kind of Bayes' theorem is proved. A semantic information theory is simply introduced. The semantic information measure reflects Popper's hypothesis-testing thought. The Semantic Information Method (SIM) adheres to maximum semantic information criterion which is compatible with maximum likelihood criterion and Regularized Least Squares criterion. It supports Wittgenstein's view: the meaning of a word lies in its use. Letting the two channels mutually…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Text and Document Classification Technologies · Machine Learning and Algorithms
