DFWLayer: Differentiable Frank-Wolfe Optimization Layer

Zixuan Liu; Liu Liu; Xueqian Wang; Peilin Zhao

arXiv:2308.10806·cs.LG·April 1, 2024

DFWLayer: Differentiable Frank-Wolfe Optimization Layer

Zixuan Liu, Liu Liu, Xueqian Wang, Peilin Zhao

PDF

Open Access 1 Repo

TL;DR

This paper introduces DFWLayer, a differentiable layer based on the Frank-Wolfe algorithm, enabling efficient large-scale constrained optimization within neural networks with competitive accuracy and constraint adherence.

Contribution

It presents a novel differentiable layer leveraging the Frank-Wolfe method, avoiding projections and Hessian computations for efficient constrained optimization in neural networks.

Findings

01

DFWLayer achieves competitive solution accuracy.

02

It maintains strict adherence to constraints.

03

The layer is efficient for large-scale problems.

Abstract

Differentiable optimization has received a significant amount of attention due to its foundational role in the domain of machine learning based on neural networks. This paper proposes a differentiable layer, named Differentiable Frank-Wolfe Layer (DFWLayer), by rolling out the Frank-Wolfe method, a well-known optimization algorithm which can solve constrained optimization problems without projections and Hessian matrix computations, thus leading to an efficient way of dealing with large-scale convex optimization problems with norm constraints. Experimental results demonstrate that the DFWLayer not only attains competitive accuracy in solutions and gradients but also consistently adheres to constraints.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

panda-shawn/dfwlayer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvancements in Semiconductor Devices and Circuit Design