Fast and Accurate Zero-Training Classification for Tabular Engineering   Data

Cyril Picard; Faez Ahmed

arXiv:2401.06948·cs.CE·January 17, 2024·1 cites

Fast and Accurate Zero-Training Classification for Tabular Engineering Data

Cyril Picard, Faez Ahmed

PDF

Open Access

TL;DR

This paper demonstrates that TabPFN, a transformer-based model trained on synthetic data, offers fast, accurate, and domain-agnostic classification for engineering design, eliminating the need for dataset-specific training.

Contribution

The paper introduces TabPFN as a pre-trained, zero-training classifier for tabular data, showing its superior speed and accuracy in engineering applications without domain-specific tuning.

Findings

01

TabPFN outperforms seven algorithms in speed and accuracy.

02

It is data-efficient and provides uncertainty estimates.

03

It requires no dataset-specific training.

Abstract

In engineering design, navigating complex decision-making landscapes demands a thorough exploration of the design, performance, and constraint spaces, often impeded by resource-intensive simulations. Data-driven methods can mitigate this challenge by harnessing historical data to delineate feasible domains, accelerate optimization, or evaluate designs. However, the implementation of these methods usually demands machine-learning expertise and multiple trials to choose the right method and hyperparameters. This makes them less accessible for numerous engineering situations. Additionally, there is an inherent trade-off between training speed and accuracy, with faster methods sometimes compromising precision. In our paper, we demonstrate that a recently released general-purpose transformer-based classification model, TabPFN, is both fast and accurate. Notably, it requires no…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Software Engineering Research