Multi-Modal Learning meets Genetic Programming: Analyzing Alignment in Latent Space Optimization
Benjamin L\'eger, Kazem Meidani, Christian Gagn\'e

TL;DR
This paper critically evaluates SNIP, a multi-modal latent space optimization approach for symbolic regression, revealing that its coarse alignment limits effective symbolic search despite promising potential.
Contribution
It provides an empirical analysis showing that SNIP's cross-modal alignment does not improve during optimization, questioning its effectiveness for symbolic search.
Findings
Cross-modal alignment does not improve with fitness increase.
SNIP's learned alignment is too coarse for effective symbolic search.
Effective fine-grained alignment is necessary for future improvements.
Abstract
Symbolic regression (SR) aims to discover mathematical expressions from data, a task traditionally tackled using Genetic Programming (GP) through combinatorial search over symbolic structures. Latent Space Optimization (LSO) methods use neural encoders to map symbolic expressions into continuous spaces, transforming the combinatorial search into continuous optimization. SNIP (Meidani et al., 2024), a contrastive pre-training model inspired by CLIP, advances LSO by introducing a multi-modal approach: aligning symbolic and numeric encoders in a shared latent space to learn the phenotype-genotype mapping, enabling optimization in the numeric space to implicitly guide symbolic search. However, this relies on fine-grained cross-modal alignment, whereas literature on similar models like CLIP reveals that such an alignment is typically coarse-grained. In this paper, we investigate whether SNIP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
