Mean-field inference methods for neural networks

Marylou Gabri\'e

arXiv:1911.00890·cond-mat.dis-nn·March 24, 2020

Mean-field inference methods for neural networks

Marylou Gabri\'e

PDF

1 Datasets

TL;DR

This paper reviews classical and recent mean-field methods used for inference in neural networks, connecting statistical physics approaches with machine learning techniques to enhance understanding of neural network behavior.

Contribution

It provides a comprehensive overview of mean-field approximation strategies, including derivations, equivalences, and recent advances relevant for neural network inference.

Findings

01

Clarifies the connections between different mean-field methods.

02

Highlights recent progress in applying statistical physics to neural networks.

03

Provides references for ongoing research directions.

Abstract

Machine learning algorithms relying on deep neural networks recently allowed a great leap forward in artificial intelligence. Despite the popularity of their applications, the efficiency of these algorithms remains largely unexplained from a theoretical point of view. The mathematical description of learning problems involves very large collections of interacting random variables, difficult to handle analytically as well as numerically. This complexity is precisely the object of study of statistical physics. Its mission, originally pointed towards natural systems, is to understand how macroscopic behaviors arise from microscopic laws. Mean-field methods are one type of approximation strategy developed in this view. We review a selection of classical mean-field methods and recent progress relevant for inference in neural networks. In particular, we remind the principles of derivations of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.