Strong Memory Lower Bounds for Learning Natural Models
Gavin Brown, Mark Bun, Adam Smith

TL;DR
This paper establishes fundamental memory lower bounds for streaming algorithms learning natural models, showing that minimal example usage requires space proportional to the product of data dimension and model complexity.
Contribution
It provides the first broad range memory lower bounds for natural learning problems in streaming settings, extending beyond specific cases like parity.
Findings
Memory lower bounds match the ambient space dimension
Bounds apply across a wide range of input sizes
Space complexity scales with data and model parameters
Abstract
We give lower bounds on the amount of memory required by one-pass streaming algorithms for solving several natural learning problems. In a setting where examples lie in and the optimal classifier can be encoded using bits, we show that algorithms which learn using a near-minimal number of examples, , must use bits of space. Our space bounds match the dimension of the ambient space of the problem's natural parametrization, even when it is quadratic in the size of examples and the final classifier. For instance, in the setting of -sparse linear classifiers over degree-2 polynomial features, for which , our space lower bound is . Our bounds degrade gracefully with the stream length , generally having the form . Bounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Algorithms and Data Compression
