Toward Reliable, Safe, and Secure LLMs for Scientific Applications

Saket Sanjeev Chaturvedi; Joshua Bergerson; Tanwi Mallick

arXiv:2603.18235·cs.CR·March 20, 2026

Toward Reliable, Safe, and Secure LLMs for Scientific Applications

Saket Sanjeev Chaturvedi, Joshua Bergerson, Tanwi Mallick

PDF

Open Access

TL;DR

This paper addresses the unique safety, security, and reliability challenges of deploying large language models in scientific research, proposing a new evaluation framework and defense strategies tailored for science-specific risks.

Contribution

It introduces a detailed threat taxonomy for LLMs in science and proposes a multi-agent system for generating domain-specific adversarial benchmarks and defense mechanisms.

Findings

01

Synthesized a taxonomy of LLM threats in scientific research.

02

Proposed a multi-agent system for generating security benchmarks.

03

Outlined a multilayered defense framework for trustworthy LLM deployment.

Abstract

As large language models (LLMs) evolve into autonomous "AI scientists," they promise transformative advances but introduce novel vulnerabilities, from potential "biosafety risks" to "dangerous explosions." Ensuring trustworthy deployment in science requires a new paradigm centered on reliability (ensuring factual accuracy and reproducibility), safety (preventing unintentional physical or biological harm), and security (preventing malicious misuse). Existing general-purpose safety benchmarks are poorly suited for this purpose, suffering from a fundamental domain mismatch, limited threat coverage of science-specific vectors, and benchmark overfitting, which create a critical gap in vulnerability evaluation for scientific applications. This paper examines the unique security and safety landscape of LLM agents in science. We begin by synthesizing a detailed taxonomy of LLM threats…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education · Machine Learning in Materials Science