Affinity-VAE: incorporating prior knowledge in representation learning from scientific images
Marjan Famili, Jola Mirecka, Camila Rangel Smith, Anna Kota\'nska,, Nikolai Juraschko, Beatriz Costa-Gomes, Colin M. Palmer, Jeyan Thiyagalingam,, Tom Burnley, Mark Basham, Alan R. Lowe

TL;DR
Affinity-VAE is a novel generative model that incorporates prior scientific knowledge into the learning process, producing interpretable, invariant representations of cryo-ET images to improve molecule identification and understanding.
Contribution
The paper introduces Affinity-VAE, a new model that integrates prior knowledge into representation learning for scientific images, enhancing interpretability and clustering of noisy cryo-ET data.
Findings
Creates rotationally-invariant, morphologically homogeneous clusters
Improves cluster separation over existing methods
Captures protein orientation for scientific analysis
Abstract
Learning compact and interpretable representations of data is a critical challenge in scientific image analysis. Here, we introduce Affinity-VAE, a generative model that enables us to impose our scientific intuition about the similarity of instances in the dataset on the learned representation during training. We demonstrate the utility of the approach in the scientific domain of cryo-electron tomography (cryo-ET) where a significant current challenge is to identify similar molecules within a noisy and low contrast tomographic image volume. This task is distinct from classification in that, at inference time, it is unknown whether an instance is part of the training set or not. We trained affinity-VAE using prior knowledge of protein structure to inform the latent space. Our model is able to create rotationally-invariant, morphologically homogeneous clusters in the latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Fractal and DNA sequence analysis · Machine Learning in Bioinformatics
