Rethinking Molecular OOD Generalization via Target-Aware Source Selection

Zhuohao Lin; Kun Li; Jiameng Chen; Jiajun Yu; Duanhua Cao; Yizhen Zheng; Wenbin Hu

arXiv:2605.13932·cs.LG·May 15, 2026

Rethinking Molecular OOD Generalization via Target-Aware Source Selection

Zhuohao Lin, Kun Li, Jiameng Chen, Jiajun Yu, Duanhua Cao, Yizhen Zheng, Wenbin Hu

PDF

1 Repo

TL;DR

This paper introduces SCOPE-BENCH, a new benchmark for molecular OOD evaluation, and POMA, a policy-based framework for source selection and domain adaptation, significantly improving prediction robustness in drug discovery.

Contribution

It proposes a novel benchmark and a reinforcement learning-based source selection framework to enhance molecular property prediction under extreme OOD conditions.

Findings

01

Prediction errors increase up to 8.0x on SCOPE-BENCH for state-of-the-art models.

02

POMA reduces mean absolute error by up to 11.2%.

03

Average relative improvement of 6.2% across backbone architectures.

Abstract

Robust prediction of molecular properties under extreme out-of-distribution (OOD) scenarios is a pivotal bottleneck in AI-driven drug discovery. Current scaffold-splitting protocols fail to obstruct microscopic semantic overlap, predisposing models to shortcut learning and overestimating their true extrapolation capability; meanwhile, conventional domain adaptation paradigms suffer under extreme structural shifts, as blindly aligning heterogeneous source libraries injects topological noise and triggers negative transfer. To address these two challenges, scaffold-cluster out-of-distribution performance evaluation benchmark (SCOPE-BENCH), a benchmark built on cluster-level partitioning in an explicit physicochemical descriptor space, is proposed alongside policy optimization for multi-source adaptation (POMA), a framework that formulates knowledge transfer as a retrieve-compose-adapt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/Molecular-OOD-Code-73F6
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.