ADMM for Structured Fractional Minimization
Ganzhao Yuan

TL;DR
This paper introduces FADMM, a novel ADMM-based algorithm for structured fractional minimization problems with nonconvex and nonsmooth components, demonstrating convergence and superior performance in machine learning applications.
Contribution
The paper develops the first ADMM-based method tailored for structured fractional minimization, with convergence guarantees and variants using Dinkelbach's and quadratic transform methods.
Findings
FADMM converges to ε-approximate critical points within O(1/ε^3) iterations.
The method outperforms existing subgradient and smoothing proximal gradient methods.
Experimental results show effectiveness in applications like sparse Fisher discriminant analysis and robust sparse recovery.
Abstract
This paper considers a class of structured fractional minimization problems. The numerator consists of a differentiable function, a simple nonconvex nonsmooth function, a concave nonsmooth function, and a convex nonsmooth function composed with a linear operator. The denominator is a continuous function that is either weakly convex or has a weakly convex square root. These problems are prevalent in various important applications in machine learning and data science. Existing methods, primarily based on subgradient methods and smoothing proximal gradient methods, often suffer from slow convergence and numerical stability issues. In this paper, we introduce {\sf FADMM}, the first Alternating Direction Method of Multipliers tailored for this class of problems. {\sf FADMM} decouples the original problem into linearized proximal subproblems, featuring two variants: one using Dinkelbach's…
Peer Reviews
Decision·ICLR 2025 Poster
The authors discuss a class of fractional minimization problems, which seem not to be well addressed in the literature. The authors provide corresponding theoretical analysis and numerical results.
The results in Lemmas 3.9 and 3.10 are standard in the literature. The authors do not need to prove them in the appendix and should not claim the results ``represent novel contributions.'' After using the smoothing techniques, the hard term $h(Ax)$ will become $h_{\mu}(Ax)$. The authors could then use some standard methods from the fractional minimization community to solve this problem. The corresponding complexity will also be $\mathcal{O}(\epsilon^{-3})$. The authors might want to comment on
1.The authors provide a comprehensive analysis of the proposed FADMM algorithm, including two specific variants: FADMM-D and FADMM-Q. 2.Comprehensive theoretical analysis, with proofs on convergence. 3.The authors conduct extensive experiments on both synthetic and real-world data, effectively demonstrating the efficiency of the FADMM algorithm.
I think the writing of this paper can be further improved.
- The class of problems addressed is broader than what was previously treated in the literature. - The theoretical results about the convergence of the methods and their rates are strong and significant. - The arguments made were rigorous and clearly stated. I wasn't able to review all the proofs in detail as there were many that were relegated to the appendix and the paper itself is quite long at nearly 40 pages. - The methods appear to perform well in the numerical experiments compared to
- The technical density of the presentation harms the readability. - The discussion after Remark 3.5, on the stationarity conditions for this problem, feels very rushed and I don't understand exactly the justifications for all the claims made. In particular there are many references but they are vague - I think it would improve a lot the clarity of the paper if you could specify a lemma or result from those papers that clearly justifies what you are claiming with the subdifferential caclulus he
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Design · Fractional Differential Equations Solutions · Iterative Methods for Nonlinear Equations
MethodsAlternating Direction Method of Multipliers
