From Bricks to Bridges: Product of Invariances to Enhance Latent Space   Communication

Irene Cannistraci; Luca Moschella; Marco Fumero; Valentino Maiorca,; Emanuele Rodol\`a

arXiv:2310.01211·cs.LG·March 21, 2024

From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Irene Cannistraci, Luca Moschella, Marco Fumero, Valentino Maiorca,, Emanuele Rodol\`a

PDF

Open Access

TL;DR

This paper introduces a method to incorporate invariances into neural network representations, enabling better latent space communication and model merging across diverse modalities without prior invariance knowledge.

Contribution

It proposes a versatile approach to embed invariances directly into latent representations, facilitating model stitching and reuse without needing task-specific invariance estimation.

Findings

01

Consistent improvements in latent similarity and downstream performance.

02

Validated across vision, text, and graph modalities.

03

Effective in zero-shot model stitching scenarios.

Abstract

It has been observed that representations learned by distinct neural networks conceal structural similarities when the models are trained under similar inductive biases. From a geometric perspective, identifying the classes of transformations and the related invariances that connect these representations is fundamental to unlocking applications, such as merging, stitching, and reusing different neural modules. However, estimating task-specific transformations a priori can be challenging and expensive due to several factors (e.g., weights initialization, training hyperparameters, or data modality). To this end, we introduce a versatile method to directly incorporate a set of invariances into the representations, constructing a product space of invariant components on top of the latent representations without requiring prior knowledge about the optimal invariance to infuse. We validate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Human Pose and Action Recognition · Multimodal Machine Learning Applications