Text to brain: predicting the spatial distribution of neuroimaging observations from text reports
J\'er\^ome Dock\`es (PARIETAL), Demian Wassermann (PARIETAL), Russell, Poldrack, Fabian Suchanek, Bertrand Thirion (PARIETAL), Ga\"el Varoquaux, (PARIETAL)

TL;DR
This paper introduces a method to extract and predict the spatial distribution of brain regions from medical text reports, leveraging a novel risk minimization approach to improve accuracy in unseen documents.
Contribution
It presents a new distribution learning framework that models anatomical terms from text to brain spatial mappings, outperforming traditional methods.
Findings
Semantic variation affects model accuracy
Voxel-wise parameterization improves location prediction
Least-deviation cost outperforms least-square
Abstract
Despite the digital nature of magnetic resonance imaging, the resulting observations are most frequently reported and stored in text documents. There is a trove of information untapped in medical health records, case reports, and medical publications. In this paper, we propose to mine brain medical publications to learn the spatial distribution associated with anatomical terms. The problem is formulated in terms of minimization of a risk on distributions which leads to a least-deviation cost function. An efficient algorithm in the dual then learns the mapping from documents to brain structures. Empirical results using coordinates extracted from the brain-imaging literature show that i) models must adapt to semantic variation in the terms used to describe a given anatomical structure, ii) voxel-wise parameterization leads to higher likelihood of locations reported in unseen documents,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Machine Learning in Healthcare · Topic Modeling
