PAC-learning gains of Turing machines over circuits and neural networks
Brieuc Pinon, Rapha\"el Jungers, Jean-Charles Delvenne

TL;DR
This paper explores how using Turing machines instead of neural networks or circuits can theoretically improve sample efficiency in machine learning, linking computational complexity to learning performance.
Contribution
It provides bounds on sample efficiency gains when applying the minimum description length principle with Turing machines, connecting complexity theory with learning models.
Findings
Bounds depend on input bit-size
Links to circuit complexity open problems
Potential for improved sample efficiency
Abstract
A caveat to many applications of the current Deep Learning approach is the need for large-scale data. One improvement suggested by Kolmogorov Complexity results is to apply the minimum description length principle with computationally universal models. We study the potential gains in sample efficiency that this approach can bring in principle. We use polynomial-time Turing machines to represent computationally universal models and Boolean circuits to represent Artificial Neural Networks (ANNs) acting on finite-precision digits. Our analysis unravels direct links between our question and Computational Complexity results. We provide lower and upper bounds on the potential gains in sample efficiency between the MDL applied with Turing machines instead of ANNs. Our bounds depend on the bit-size of the input of the Boolean function to be learned. Furthermore, we highlight close…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · Ferroelectric and Negative Capacitance Devices
MethodsMinimum Description Length
