Multi-Task Fine-Tuning Enables Robust Out-of-Distribution Generalization in Atomistic Models
Chengqian Zhang, Duo Zhang, Anyang Peng, Mingyu Guo, Yuzhi Zhang, Lei Wang, Guolin Ke, Linfeng Zhang, Tiejun Li, Han Wang

TL;DR
This paper introduces multi-task fine-tuning (MFT) for atomistic models, which enhances out-of-distribution generalization by preserving chemical priors during adaptation, outperforming standard methods across benchmarks.
Contribution
The paper proposes MFT, a novel fine-tuning approach that jointly optimizes property prediction and a force-field objective to improve OOD robustness in atomistic models.
Findings
MFT consistently improves OOD generalization across benchmarks.
Standard fine-tuning causes representation collapse degrading OOD performance.
MFT approaches the theoretical limit of in-distribution accuracy for OOD tasks.
Abstract
Accurate de novo molecular and materials design requires structure-property models that generalize beyond known regimes. Although pretrained atomistic models achieve strong in-distribution accuracy after fine-tuning, their reliability under out-of-distribution (OOD) conditions remains unclear. We identify a critical failure mode in downstream adaptation: standard fine-tuning induces representation collapse, erasing pretrained chemical and structural priors and severely degrading OOD performance. To address this limitation, we propose multi-task fine-tuning (MFT), which jointly optimizes downstream property prediction with a physically grounded force-field objective inherited from pretraining. This approach preserves essential chemical priors while enabling task-specific adaptation. Across molecular and materials benchmarks, MFT consistently improves OOD generalization, approaching the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Block Copolymer Self-Assembly
