No Test Cases, No Problem: Distillation-Driven Code Generation for Scientific Workflows

Siddeshwar Raghavan; Tanwi Mallick

arXiv:2604.23106·cs.SE·April 28, 2026

No Test Cases, No Problem: Distillation-Driven Code Generation for Scientific Workflows

Siddeshwar Raghavan, Tanwi Mallick

PDF

TL;DR

MOSAIC is a training-free multi-agent framework that generates scientific workflows without I/O test cases, using knowledge distillation and structured problem decomposition to improve accuracy and consistency.

Contribution

It introduces MOSAIC, a novel approach for scientific code generation that does not require I/O supervision and mitigates hallucinations with a Consolidated Context Window.

Findings

01

MOSAIC outperforms existing methods on the SciCode benchmark.

02

It improves accuracy, executability, and numerical precision.

03

Uses lightweight models with a knowledge distillation framework.

Abstract

Existing multi-agent Large Language Model (LLM) frameworks for code generation typically use execution feedback and improve iteratively using Input/Output (I/O) test cases. However, this does not work for scientific workflows, where I/O test cases do not exist, and generating them requires solving the very problem at hand. To address this, we introduce MOSAIC, a training-free multi-agent framework for scientific code generation without I/O supervision. Instead of execution feedback, MOSAIC employs a student-teacher knowledge distillation framework that grounds generation through domain-specific examples and structured problem decomposition. To further mitigate hallucinations across chained subproblems, we introduce a Consolidated Context Window (CCW) for maintaining consistent reasoning across agents. Experiments on the SciCode benchmark show that MOSAIC improves accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.