Q-Newton: Hybrid Quantum-Classical Scheduling for Accelerating Neural Network Training with Newton's Gradient Descent
Pingzhi Li, Junyu Liu, Hanrui Wang, Tianlong Chen

TL;DR
Q-Newton introduces a hybrid quantum-classical scheduling method to accelerate neural network training by leveraging quantum linear solvers for second-order optimization, promising significant reductions in training time.
Contribution
The paper proposes a novel hybrid quantum-classical scheduler that integrates quantum linear solvers into Newton's gradient descent for faster neural network training.
Findings
Q-Newton can significantly reduce training time compared to SGD.
The hybrid approach effectively estimates and reduces condition number $ppa$ and sparsity $d$.
Potential for exponential speedup with future quantum hardware advancements.
Abstract
Optimization techniques in deep learning are predominantly led by first-order gradient methodologies, such as SGD. However, neural network training can greatly benefit from the rapid convergence characteristics of second-order optimization. Newton's GD stands out in this category, by rescaling the gradient using the inverse Hessian. Nevertheless, one of its major bottlenecks is matrix inversion, which is notably time-consuming in time with weak scalability. Matrix inversion can be translated into solving a series of linear equations. Given that quantum linear solver algorithms (QLSAs), leveraging the principles of quantum superposition and entanglement, can operate within a time frame, they present a promising approach with exponential acceleration. Specifically, one of the most recent QLSAs demonstrates a complexity scaling of $O(d\cdot\kappa…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture
MethodsStochastic Gradient Descent
