Beyond SMILES: Evaluating Agentic Systems for Drug Discovery

Edward Wijaya

arXiv:2602.10163·q-bio.QM·February 12, 2026

Beyond SMILES: Evaluating Agentic Systems for Drug Discovery

Edward Wijaya

PDF

Open Access

TL;DR

This paper evaluates the generalization and limitations of agentic systems in drug discovery, highlighting capability gaps and proposing design requirements for future frameworks to serve as effective computational partners.

Contribution

It systematically assesses six agentic frameworks across diverse drug discovery tasks, identifying key capability gaps and proposing a capability matrix for future development.

Findings

01

Five major capability gaps identified in current frameworks.

02

Frontier LLMs can reason about peptides as well as small molecules.

03

Most frameworks do not leverage the full potential of advanced LLMs.

Abstract

Agentic systems for drug discovery have demonstrated autonomous synthesis planning, literature mining, and molecular design. We ask how well they generalize. Evaluating six frameworks against 15 task classes drawn from peptide therapeutics, in vivo pharmacology, and resource-constrained settings, we find five capability gaps: no support for protein language models or peptide-specific prediction, no bridges between in vivo and in silico data, reliance on LLM inference with no pathway to ML training or reinforcement learning, assumptions tied to large-pharma resources, and single-objective optimization that ignores safety-efficacy-stability trade-offs. A paired knowledge-probing experiment suggests the bottleneck is architectural rather than epistemic: four frontier LLMs reason about peptides at levels comparable to small molecules, yet no framework exposes this capability. We propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · vaccines and immunoinformatics approaches · Chemical Synthesis and Analysis