Tightening convex relaxations of trained neural networks: a unified approach for convex and S-shaped activations

Pablo Carrasco; Gonzalo Mu\~noz

arXiv:2410.23362·math.OC·April 1, 2026

Tightening convex relaxations of trained neural networks: a unified approach for convex and S-shaped activations

Pablo Carrasco, Gonzalo Mu\~noz

PDF

TL;DR

This paper introduces a recursive method to produce tight convex relaxations for neural networks with convex or S-shaped activations, enhancing optimization integration.

Contribution

It develops a unified recursive formula for convexifying a broad class of activation functions, improving upon previous convex relaxation techniques.

Findings

01

Provides a recursive formula for convexification of S-shaped activations.

02

Enables efficient computation of separating hyperplanes in neural network relaxations.

03

Extends convex relaxation methods to non-polyhedral activation functions.

Abstract

The non-convex nature of trained neural networks has created significant obstacles in their incorporation into optimization models. In this context, Anderson et al. (2020) provided a framework to obtain the convex hull of the graph of a piecewise linear convex activation function composed with an affine function; this effectively convexifies activations such as the ReLU together with the affine transformation that precedes it. In this article, we contribute to this line of work by developing a recursive formula that yields a tight convexification for the composition of an activation with an affine function for a wide scope of activation functions, namely, convex or ``S-shaped". Our approach can be used to efficiently compute separating hyperplanes or determine that none exists in various settings, including non-polyhedral cases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.