Power-law Scaling to Assist with Key Challenges in Artificial   Intelligence

Yuval Meir; Shira Sardi; Shiri Hodassman; Karin Kisos; Itamar; Ben-Noam; Amir Goldental; Ido Kanter

arXiv:2211.08430·cs.LG·November 17, 2022

Power-law Scaling to Assist with Key Challenges in Artificial Intelligence

Yuval Meir, Shira Sardi, Shiri Hodassman, Karin Kisos, Itamar, Ben-Noam, Amir Goldental, Ido Kanter

PDF

TL;DR

This paper demonstrates how power-law scaling in deep learning can predict dataset size requirements, improve decision speed, and benchmark training complexity, thereby addressing key challenges in artificial intelligence.

Contribution

It introduces the application of power-law scaling to estimate dataset size, enhance rapid decision-making, and establish benchmarks in AI training complexity.

Findings

01

Test errors converge as a power-law to zero with database size.

02

Power-law exponent increases with the number of hidden layers.

03

Estimated test error approaches state-of-the-art for large epoch numbers.

Abstract

Power-law scaling, a central concept in critical phenomena, is found to be useful in deep learning, where optimized test errors on handwritten digit examples converge as a power-law to zero with database size. For rapid decision making with one training epoch, each example is presented only once to the trained network, the power-law exponent increased with the number of hidden layers. For the largest dataset, the obtained test error was estimated to be in the proximity of state-of-the-art algorithms for large epoch numbers. Power-law scaling assists with key challenges found in current artificial intelligence applications and facilitates an a priori dataset size estimation to achieve a desired test accuracy. It establishes a benchmark for measuring training complexity and a quantitative hierarchy of machine learning tasks and algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest