TL;DR
BOLD5000 is a large-scale, diverse fMRI dataset of nearly 5,000 real-world scene images designed to advance the integration of neuroscience and computer vision through extensive neural imaging data.
Contribution
It introduces a significantly larger and more diverse fMRI dataset of visual perception, bridging the gap between neuroscience and computer vision research.
Findings
Enables detailed analysis of neural responses to diverse visual features.
Facilitates comparison between biological and computer vision representations.
Supports large-scale statistical learning approaches in neuroimaging.
Abstract
Vision science, particularly machine vision, has been revolutionized by introducing large-scale image datasets and statistical learning approaches. Yet, human neuroimaging studies of visual perception still rely on small numbers of images (around 100) due to time-constrained experimental procedures. To apply statistical learning approaches that integrate neuroscience, the number of images used in neuroimaging must be significantly increased. We present BOLD5000, a human functional MRI (fMRI) study that includes almost 5,000 distinct images depicting real-world scenes. Beyond dramatically increasing image dataset size relative to prior fMRI studies, BOLD5000 also accounts for image diversity, overlapping with standard computer vision datasets by incorporating images from the Scene UNderstanding (SUN), Common Objects in Context (COCO), and ImageNet datasets. The scale and diversity of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
