Tabular foundation models for in-context prediction of molecular properties
Karim K. Ben Hicham, Jan G. Rittig, Martin Grohe, Alexander Mitsos

TL;DR
Tabular foundation models enable accurate, cost-effective molecular property prediction through in-context learning, outperforming classical methods without task-specific fine-tuning in low- to medium-data regimes.
Contribution
This work demonstrates the effectiveness of tabular foundation models for molecular property prediction, highlighting their advantages over traditional fine-tuning approaches and classical descriptors.
Findings
TFMs achieve up to 100% win rates on MoleculeACE tasks.
Combining TFMs with CheMeleon embeddings improves predictive performance.
Molecular representation choice significantly impacts TFM effectiveness.
Abstract
Accurate molecular property prediction is central to drug discovery, catalysis, and process design, yet real-world applications are often limited by small datasets. Molecular foundation models provide a promising direction by learning transferable molecular representations; however, they typically involve task-specific fine-tuning, require machine learning expertise, and often fail to outperform classical baselines. Tabular foundation models (TFMs) offer a fundamentally different paradigm: they perform predictions through in-context learning, enabling inference without task-specific training. Here, we evaluate TFMs in the low- to medium-data regime across both standardized pharmaceutical benchmarks and chemical engineering datasets. We evaluate both frozen molecular foundation model representations, as well as classical descriptors and fingerprints. Across the benchmarks, the approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
