Convergence of the mini-batch SIHT algorithm
Saeed Damadi, Jinglai Shen

TL;DR
This paper introduces a convergence analysis for the mini-batch Stochastic IHT algorithm, demonstrating its almost sure convergence without the need for increasing mini-batch sizes or strong convexity assumptions.
Contribution
It provides the first proof of almost sure convergence of stochastic function values with fixed mini-batch size in sparse optimization.
Findings
The mini-batch SIHT converges with probability one.
A critical sparse stochastic gradient descent property is established.
Convergence is proven without assuming restricted strong convexity.
Abstract
The Iterative Hard Thresholding (IHT) algorithm has been considered extensively as an effective deterministic algorithm for solving sparse optimizations. The IHT algorithm benefits from the information of the batch (full) gradient at each point and this information is a crucial key for the convergence analysis of the generated sequence. However, this strength becomes a weakness when it comes to machine learning and high dimensional statistical applications because calculating the batch gradient at each iteration is computationally expensive or impractical. Fortunately, in these applications the objective function has a summation structure that can be taken advantage of to approximate the batch gradient by the stochastic mini-batch gradient. In this paper, we study the mini-batch Stochastic IHT (SIHT) algorithm for solving the sparse optimizations. As opposed to previous works where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
