NDAI-NeuroMAP: A Neuroscience-Specific Embedding Model for Domain-Specific Retrieval
Devendra Patel, Aaditya Jain, Jayant Verma, Divyansh Rajput, Sunil Mahala, Ketki Suresh Khapare, Jayateja Kalla

TL;DR
NDAI-NeuroMAP is a specialized neuroscience embedding model designed for high-precision retrieval, leveraging extensive domain-specific data and advanced fine-tuning to outperform general models in neuroscience tasks.
Contribution
The paper introduces NDAI-NeuroMAP, the first neuroscience-specific dense embedding model optimized for information retrieval, using a large domain-specific corpus and multi-objective fine-tuning.
Findings
Significant performance improvements over general-purpose models.
Effective use of domain-specific triplets and ontologies.
Enhanced accuracy in neuroscience query retrieval.
Abstract
We present NDAI-NeuroMAP, the first neuroscience-domain-specific dense vector embedding model engineered for high-precision information retrieval tasks. Our methodology encompasses the curation of an extensive domain-specific training corpus comprising 500,000 carefully constructed triplets (query-positive-negative configurations), augmented with 250,000 neuroscience-specific definitional entries and 250,000 structured knowledge-graph triplets derived from authoritative neurological ontologies. We employ a sophisticated fine-tuning approach utilizing the FremyCompany/BioLORD-2023 foundation model, implementing a multi-objective optimization framework combining contrastive learning with triplet-based metric learning paradigms. Comprehensive evaluation on a held-out test dataset comprising approximately 24,000 neuroscience-specific queries demonstrates substantial performance improvements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
