BioModelsRAG: A Biological Modeling Assistant Using RAG (Retrieval Augmented Generation)
Bhavyahshree Navaneetha Krishnan, Adel Heydarabadipour, Herbert Sauro

TL;DR
BioModelsRAG leverages retrieval-augmented generation with large language models to enable efficient, accurate, and natural language-based analysis of complex biological models from the BioModels database, reducing analysis time and hallucinations.
Contribution
This paper introduces a novel LLM-based assistant that uses retrieval-augmented generation to analyze BioModels, improving accuracy and user interaction in systems biology modeling.
Findings
Enhanced model analysis speed and accuracy.
Reduced hallucination through focused retrieval.
Improved user interaction with biological models.
Abstract
The BioModels database is one of the premier databases for computational models in systems biology. The database contains over 1000 curated models and an even larger number of non-curated models. All the models are stored in the machine-readable format, SBML. Although SBML can be translated into the human readable Antimony format, analyzing the models can still be time consuming. In order to bridge this gap, a LLM (large language model) assistant was created to analyze the BioModels and allow interaction between the user and the model using natural language. By doing so, a user can easily and rapidly extract the salient points in a given model. Our analysis workflow involved 'chunking' BioModels and converting them to plain text using llama3, and then embedding them in a ChromaDB database. The user-provided query was also embedded, and a similarity search was performed between the query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Gene Regulatory Network Analysis · Microbial Metabolic Engineering and Bioproduction
