Exploration and Comparison of Deep Learning Architectures to Predict Brain Response to Realistic Pictures
Riccardo Chimisso, Sathya Bur\v{s}i\'c, Paolo Marocco, Giuseppe, Vizzari, Dimitri Ognibene

TL;DR
This study explores various deep learning architectures, especially using CLIP embeddings, to predict brain responses to realistic images, highlighting the challenges and effective strategies in modeling high-dimensional neural data.
Contribution
The paper systematically compares different deep learning models for brain response prediction, demonstrating that simple ROI-specific linear models with CLIP embeddings perform best.
Findings
Simple ROI-specific models outperform complex architectures.
Using CLIP embeddings improves prediction accuracy.
Challenges include high dimensionality and noisy data.
Abstract
We present an exploration of machine learning architectures for predicting brain responses to realistic images on occasion of the Algonauts Challenge 2023. Our research involved extensive experimentation with various pretrained models. Initially, we employed simpler models to predict brain activity but gradually introduced more complex architectures utilizing available data and embeddings generated by large-scale pre-trained models. We encountered typical difficulties related to machine learning problems, e.g. regularization and overfitting, as well as issues specific to the challenge, such as difficulty in combining multiple input encodings, as well as the high dimensionality, unclear structure, and noisy nature of the output. To overcome these issues we tested single edge 3D position-based, multi-region of interest (ROI) and hemisphere predictor models, but we found that employing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · Optical Imaging and Spectroscopy Techniques
MethodsLinear Layer · Contrastive Language-Image Pre-training
