SpectraQuery: A Hybrid Retrieval-Augmented Conversational Assistant for Battery Science
Sreya Vangara, Jagjit Nanda, Yan-Kai Tzeng, Eric Darve

TL;DR
SpectraQuery is a hybrid conversational assistant that integrates structured experimental data with scientific literature, enabling accurate, grounded, and explainable reasoning in battery science through combined semantic parsing and retrieval methods.
Contribution
It introduces a novel hybrid retrieval-augmented framework that unifies structured data and literature for scientific reasoning, with high accuracy and expert-rated performance.
Findings
80% of SQL queries are fully correct
Answer groundedness reaches 93-97% with retrieved passages
High expert ratings on accuracy and relevance (4.1-4.6/5)
Abstract
Scientific reasoning increasingly requires linking structured experimental data with the unstructured literature that explains it, yet most large language model (LLM) assistants cannot reason jointly across these modalities. We introduce SpectraQuery, a hybrid natural-language query framework that integrates a relational Raman spectroscopy database with a vector-indexed scientific literature corpus using a Structured and Unstructured Query Language (SUQL)-inspired design. By combining semantic parsing with retrieval-augmented generation, SpectraQuery translates open-ended questions into coordinated SQL and literature retrieval operations, producing cited answers that unify numerical evidence with mechanistic explanation. Across SQL correctness, answer groundedness, retrieval effectiveness, and expert evaluation, SpectraQuery demonstrates strong performance: approximately 80 percent of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Spectroscopy Techniques in Biomedical and Chemical Research · Scientific Computing and Data Management
