When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction
Bryan Cheng, Jasper Zhang

TL;DR
This study systematically evaluates when and how context conditioning improves molecular property prediction across diverse datasets and architectures, revealing key factors influencing success and failure.
Contribution
It introduces a comprehensive analysis of context conditioning methods, highlighting the importance of architecture choice and data regimes, and exposes flaws in standard benchmarking practices.
Findings
Fusion architecture significantly outperforms simpler methods.
Context enables predictions in data-scarce scenarios.
Distribution mismatch can cause systematic performance degradation.
Abstract
We present the first systematic study of when target context helps molecular property prediction, evaluating context conditioning across 10 diverse protein families, 4 fusion architectures, data regimes spanning 67-9,409 training compounds, and both temporal and random evaluation splits. Using NestDrug, a FiLM-based architecture that conditions molecular representations on target identity, we characterize both success and failure modes with three principal findings. First, fusion architecture dominates: FiLM outperforms concatenation by 24.2 percentage points and additive conditioning by 8.6 pp; how you incorporate context matters more than whether you include it. Second, context enables otherwise impossible predictions: on data-scarce CYP3A4 (67 training compounds), multi-task transfer achieves 0.686 AUC where per-target Random Forest collapses to 0.238. Third, context can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
