Language Model Powered Digital Biology with BRAD
Joshua Pickard, Ram Prakash, Marc Andrew Choi, Natalie Oliven, Cooper, Stansbury, Jillian Cwycyshyn, Alex Gorodetsky, Alvaro Velasquez, and Indika, Rajapakse

TL;DR
BRAD is a novel AI-powered bioinformatics assistant that integrates diverse tools and databases, enabling efficient, context-aware, semi-autonomous biological data analysis through an intuitive GUI.
Contribution
The paper introduces BRAD, a configurable LLM-based bioinformatics system that combines multiple tools and databases for enhanced research workflows.
Findings
BRAD enables retrieval-augmented generation and database searches.
It connects local and online resources for comprehensive analysis.
Provides an intuitive GUI for user-friendly interaction.
Abstract
Recent advancements in Large Language Models (LLMs) are transforming biology, computer science, engineering, and every day life. However, integrating the wide array of computational tools, databases, and scientific literature continues to pose a challenge to biological research. LLMs are well-suited for unstructured integration, efficient information retrieval, and automating standard workflows and actions from these diverse resources. To harness these capabilities in bioinformatics, we present a prototype Bioinformatics Retrieval Augmented Digital assistant (BRAD). BRAD is a chatbot and agentic system that integrates a variety of bioinformatics tools. The Python package implements an AI \texttt{Agent} that is powered by LLMs and connects to a local file system, online databases, and a user's software. The \texttt{Agent} is highly configurable, enabling tasks such as Retrieval-Augmented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetics, Bioinformatics, and Biomedical Research · Scientific Computing and Data Management
