Learning from What's Right and Learning from What's Wrong

Bart Jacobs (Institute for Computing; Information Sciences (iCIS),; Radboud University Nijmegen; The Netherlands)

arXiv:2112.14045·cs.LO·December 30, 2021·MFPS

Learning from What's Right and Learning from What's Wrong

Bart Jacobs (Institute for Computing, Information Sciences (iCIS),, Radboud University Nijmegen, The Netherlands)

PDF

TL;DR

This paper clarifies the mathematical distinction between Pearl's and Jeffrey's rules for updating probability distributions, framing them as learning from what's right versus what's wrong, with implications for machine learning and predictive coding.

Contribution

It provides the first precise mathematical characterization of the difference between Pearl's and Jeffrey's updating rules, linking them to validity and divergence minimization.

Findings

01

Pearl's rule increases validity (expected value).

02

Jeffrey's rule decreases Kullback-Leibler divergence.

03

The distinction is illustrated through a cognitive scenario.

Abstract

The concept of updating (or conditioning or revising) a probability distribution is fundamental in (machine) learning and in predictive coding theory. The two main approaches for doing so are called Pearl's rule and Jeffrey's rule. Here we make, for the first time, mathematically precise what distinguishes them: Pearl's rule increases validity (expected value) and Jeffrey's rule decreases (Kullback-Leibler) divergence. This forms an instance of a more general distinction between learning from what's right and learning from what's wrong. The difference between these two approaches is illustrated in a mock cognitive scenario.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.