Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery

Robert Yang

arXiv:2508.17681·cs.LG·November 26, 2025

Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery

Robert Yang

PDF

Open Access

TL;DR

This paper proposes a systematic unlearning-as-ablation method to test whether large language models genuinely generate new scientific knowledge or simply remix memorized information, aiming to improve AI-for-Science evaluation.

Contribution

It introduces a conceptual and methodological framework for using ablation as a falsifiable probe to distinguish genuine knowledge generation from recall in AI models.

Findings

01

Proposes unlearning-as-ablation as a scientific discovery probe

02

Outlines a pilot in mathematics and algorithms

03

Suggests extension to physics and chemistry domains

Abstract

Bold claims about AI's role in science-from "AGI will cure all diseases" to promises of radically accelerated discovery-raise a central epistemic question: do large language models (LLMs) truly generate new knowledge, or do they merely remix memorized fragments? We propose unlearning-as-ablation as a falsifiable probe of constructive scientific discovery. The idea is to systematically remove a target result together with its forget-closure (supporting lemmas, paraphrases, and multi-hop entailments) and then evaluate whether the model can re-derive the result from only permitted axioms and tools. Success would indicate generative capability beyond recall; failure would expose current limits. Unlike prevailing motivations for unlearning-privacy, copyright, or safety-our framing repositions it as an epistemic probe for AI-for-Science. We outline a minimal pilot in mathematics and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Scientific Computing and Data Management · Explainable Artificial Intelligence (XAI)