The Copycat Perceptron: Smashing Barriers Through Collective Learning
Giovanni Catania, Aur\'elien Decelle, and Beatriz Seoane

TL;DR
This paper analyzes a model of coupled binary perceptrons with thermal noise, showing that collective learning via replicas smooths the landscape, enabling efficient learning and supporting the Bayes-optimality of replicated algorithms.
Contribution
It provides a detailed characterization of equilibrium properties of coupled perceptrons with thermal noise, demonstrating how replica coupling improves learning efficiency and landscape smoothness.
Findings
Replica coupling leads to a smoother free entropy landscape.
Replicated Simulated Annealing can efficiently reach the teacher solution.
Multiple students can learn faster and with fewer examples.
Abstract
We characterize the equilibrium properties of a model of coupled binary perceptrons in the teacher-student scenario, subject to a suitable cost function, with an explicit ferromagnetic coupling proportional to the Hamming distance between the students' weights. In contrast to recent works, we analyze a more general setting in which thermal noise is present that affects each student's generalization performance. In the nonzero temperature regime, we find that the coupling of replicas leads to a bend of the phase diagram towards smaller values of : This suggests that the free entropy landscape gets smoother around the solution with perfect generalization (i.e., the teacher) at a fixed fraction of examples, allowing standard thermal updating algorithms such as Simulated Annealing to easily reach the teacher solution and avoid getting trapped in metastable states as it happens…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Thermodynamics and Statistical Mechanics · Theoretical and Computational Physics
