Tabular foundation models for in-context prediction of molecular properties

Karim K. Ben Hicham; Jan G. Rittig; Martin Grohe; Alexander Mitsos

arXiv:2604.16123·cs.LG·April 21, 2026

Tabular foundation models for in-context prediction of molecular properties

Karim K. Ben Hicham, Jan G. Rittig, Martin Grohe, Alexander Mitsos

PDF

TL;DR

Tabular foundation models enable accurate, cost-effective molecular property prediction through in-context learning, outperforming classical methods without task-specific fine-tuning in low- to medium-data regimes.

Contribution

This work demonstrates the effectiveness of tabular foundation models for molecular property prediction, highlighting their advantages over traditional fine-tuning approaches and classical descriptors.

Findings

01

TFMs achieve up to 100% win rates on MoleculeACE tasks.

02

Combining TFMs with CheMeleon embeddings improves predictive performance.

03

Molecular representation choice significantly impacts TFM effectiveness.

Abstract

Accurate molecular property prediction is central to drug discovery, catalysis, and process design, yet real-world applications are often limited by small datasets. Molecular foundation models provide a promising direction by learning transferable molecular representations; however, they typically involve task-specific fine-tuning, require machine learning expertise, and often fail to outperform classical baselines. Tabular foundation models (TFMs) offer a fundamentally different paradigm: they perform predictions through in-context learning, enabling inference without task-specific training. Here, we evaluate TFMs in the low- to medium-data regime across both standardized pharmaceutical benchmarks and chemical engineering datasets. We evaluate both frozen molecular foundation model representations, as well as classical descriptors and fingerprints. Across the benchmarks, the approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.