Expressing linear equality constraints in feedforward neural networks

Anand Rangarajan; Pan He; Jaemoon Lee; Tania Banerjee; Sanjay Ranka

arXiv:2211.04395·cs.LG·January 10, 2023·1 cites

Expressing linear equality constraints in feedforward neural networks

Anand Rangarajan, Pan He, Jaemoon Lee, Tania Banerjee, Sanjay Ranka

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method to impose linear equality constraints in feedforward neural networks by using a saddle-point Lagrangian with auxiliary variables, enabling standard training methods to handle constraints effectively.

Contribution

A new saddle-point Lagrangian formulation with auxiliary predictor variables that simplifies imposing linear equality constraints in neural networks.

Findings

01

Lagrange parameters interpreted as fixed-weight hidden units

02

Enables standard optimization despite constraints

03

Applicable to multi-label classification and autoencoders

Abstract

We seek to impose linear, equality constraints in feedforward neural networks. As top layer predictors are usually nonlinear, this is a difficult task if we seek to deploy standard convex optimization methods and strong duality. To overcome this, we introduce a new saddle-point Lagrangian with auxiliary predictor variables on which constraints are imposed. Elimination of the auxiliary variables leads to a dual minimization problem on the Lagrange multipliers introduced to satisfy the linear constraints. This minimization problem is combined with the standard learning problem on the weight matrices. From this theoretical line of development, we obtain the surprising interpretation of Lagrange parameters as additional, penultimate layer hidden units with fixed weights stemming from the constraints. Consequently, standard minimization approaches can be used despite the inclusion of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anandrajan0/smartalec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning in Materials Science · Model Reduction and Neural Networks