Principled Weight Initialisation for Input-Convex Neural Networks

Pieter-Jan Hoedt; G\"unter Klambauer

arXiv:2312.12474·cs.LG·December 21, 2023·1 cites

Principled Weight Initialisation for Input-Convex Neural Networks

Pieter-Jan Hoedt, G\"unter Klambauer

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new weight initialization method for Input-Convex Neural Networks (ICNNs) that improves training efficiency and generalization, enabling effective application in real-world tasks like drug discovery.

Contribution

We derive a principled weight initialization for ICNNs by extending signal propagation theory to non-negative weights, enhancing training and application performance.

Findings

01

Initialization accelerates learning in ICNNs.

02

Proper initialization allows training without skip-connections.

03

ICNNs improve molecular space exploration in drug discovery.

Abstract

Input-Convex Neural Networks (ICNNs) are networks that guarantee convexity in their input-output mapping. These networks have been successfully applied for energy-based modelling, optimal transport problems and learning invariances. The convexity of ICNNs is achieved by using non-decreasing convex activation functions and non-negative weights. Because of these peculiarities, previous initialisation strategies, which implicitly assume centred weights, are not effective for ICNNs. By studying signal propagation through layers with non-negative weights, we are able to derive a principled weight initialisation for ICNNs. Concretely, we generalise signal propagation theory by removing the assumption that weights are sampled from a centred distribution. In a set of experiments, we demonstrate that our principled initialisation effectively accelerates learning in ICNNs and leads to better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ml-jku/convex-init
pytorchOfficial

Videos

Principled Weight Initialisation for Input-Convex Neural Networks· slideslive

Taxonomy

TopicsMachine Learning in Materials Science · Adversarial Robustness in Machine Learning · Computational Drug Discovery Methods

MethodsSparse Evolutionary Training