Tabular foundation models for robust calibration of near-infrared chemical sensing data
Robin Reiter, Denis Cornet, Fabien Michel, Lauriane Rouan, Gregory Beurier

TL;DR
This study evaluates tabular foundation models for calibrating near-infrared spectroscopy data, demonstrating their competitive performance and potential to complement traditional chemometric methods in chemical sensing applications.
Contribution
The paper introduces the use of tabular foundation models for NIR calibration, benchmarking their performance against traditional methods across diverse datasets.
Findings
TabPFN outperforms traditional models in regression tasks.
TabPFN applied directly to raw spectra performs well in classification.
Classical models remain competitive on spectral outliers and extrapolation.
Abstract
Near-infrared spectroscopy is increasingly used as a rapid, non-destructive chemical sensing technology for the analysis of food, pharmaceutical, biological, and environmental samples. However, the practical deployment of NIR sensors still depends on calibration models able to handle high-dimensional, collinear spectra, limited sample sizes, preprocessing dependence, spectral outliers, and extrapolation beyond the calibration domain. Here, we evaluate whether tabular foundation models can provide a new calibration strategy for NIR chemical sensing. We benchmark TabPFN on 66 NIR datasets covering 54 regression and 12 classification tasks, and compare direct inference on raw spectra with preprocessing-optimized inference against PLS/PLS-DA, Ridge, Catboost, and one-dimensional convolutional neural networks. The study uses a unified validation framework in which preprocessing and model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
