How Much is Brain Data Worth for Machine Learning?
Lane Lewis, Zhixin Wang, David Schwab, Xaq Pitkow

TL;DR
This paper develops a theoretical framework to quantify the value of neural brain data in enhancing machine learning performance, considering factors like data quantity, noise, and task-brain alignment.
Contribution
It introduces a mathematical model and scaling laws to evaluate when and how brain data improves machine learning models, including robustness and data collection strategies.
Findings
Derived performance scaling laws for models trained on brain and task data.
Quantified the relative value of brain samples based on task-brain alignment and noise levels.
Identified conditions where brain data significantly enhances robustness and learning efficiency.
Abstract
If a person can solve a task, can measuring their brain make it easier to train a model to solve that task too? Recent NeuroAI work suggests that supplementing task training with neural recordings can modestly improve model performance and robustness. However, it is unclear when there should be a benefit from using neural data and how much benefit to expect. We formulate this question mathematically, and begin to address it theoretically using a simple, analytically tractable linear gaussian model of task targets and neural recordings. For a multimodal estimator trained on both brain data and task labels, we derive scaling laws for how performance scales with the numbers of brain and task samples. From these laws we derive relative value and exchange rates between brain samples and task samples, quantifying how much extra task samples neural data is worth as a function of task-brain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
