# Generalization from correlated sets of patterns in the perceptron

**Authors:** Francesco Borra, Marco Cosentino Lagomarsino, Pietro Rotondo, Marco, Gherardi

arXiv: 1903.06818 · 2020-01-08

## TL;DR

This paper introduces a framework to quantify how easily a neural network can unify interpretations of different pattern sets based on their dissimilarity, using the perceptron as a case study.

## Contribution

It proposes a novel distance-based measure of generalization and analytically computes the capacity of the perceptron to unify pattern sets at various dissimilarities.

## Key findings

- Generalization capacity decreases with pattern dissimilarity.
- Analytical expression for the perceptron's distance-based capacity.
- Generalization remains possible at any dissimilarity, but with reduced capacity.

## Abstract

Generalization is a central aspect of learning theory. Here, we propose a framework that explores an auxiliary task-dependent notion of generalization, and attempts to quantitatively answer the following question: given two sets of patterns with a given degree of dissimilarity, how easily will a network be able to "unify" their interpretation? This is quantified by the volume of the configurations of synaptic weights that classify the two sets in a similar manner. To show the applicability of our idea in a concrete setting, we compute this quantity for the perceptron, a simple binary classifier, using the classical statistical physics approach in the replica-symmetric ansatz. In this case, we show how an analytical expression measures the "distance-based capacity", the maximum load of patterns sustainable by the network, at fixed dissimilarity between patterns and fixed allowed number of errors. This curve indicates that generalization is possible at any distance, but with decreasing capacity. We propose that a distance-based definition of generalization may be useful in numerical experiments with real-world neural networks, and to explore computationally sub-dominant sets of synaptic solutions.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.06818/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1903.06818/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/1903.06818/full.md

---
Source: https://tomesphere.com/paper/1903.06818