Seeing the imagined: a latent functional alignment in visual imagery decoding from fMRI data
Fabrizio Spera, Tommaso Boccato, Michal Olak, Sara Cammarota, Matteo Ciferri, Michelangelo Tronti, Nicola Toschi, Matteo Ferrante

TL;DR
This paper introduces a latent functional alignment method to improve visual imagery decoding from fMRI data by leveraging perception-trained models and semantic retrieval augmentation, demonstrating consistent improvements across subjects.
Contribution
It proposes a novel latent alignment approach that enhances imagery decoding by mapping activity into a pretrained model's space and uses retrieval-based augmentation for better performance.
Findings
Latent functional alignment improves semantic reconstruction metrics.
The method enables above-chance decoding from multiple cortical regions.
It outperforms baseline methods in perception-to-imagery transfer tasks.
Abstract
Recent progress in visual brain decoding from fMRI has been enabled by large-scale datasets such as the Natural Scenes Dataset (NSD) and powerful diffusion-based generative models. While current pipelines are primarily optimized for perception, their performance under mental-imagery remains less well understood. In this work, we study how a state-of-the-art (SOTA) perception decoder (DynaDiff) can be adapted to reconstruct imagined content from the Imagery-NSD benchmark. We propose a latent functional alignment approach that maps imagery-evoked activity into the pretrained model's conditioning space, while keeping the remaining components frozen. To mitigate the limited amount of matched imagery-perception supervision, we further introduce a retrieval-based augmentation strategy that selects semantically related NSD perception trials. Across four subjects, latent functional alignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
