Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard
Si-Yang Liu, Qile Zhou, Han-Jia Ye

TL;DR
This paper introduces a novel in-context ensemble approach using large language models to dynamically integrate external model predictions for tabular data, achieving expert-level decision-making without raw feature access.
Contribution
The work presents Chain of Tabular Thoughts (CoT$^2$), a prompting strategy enabling LLMs to perform multi-step reasoning for instance-specific model aggregation in tabular prediction.
Findings
Outperforms traditional ensemble methods on various datasets
Enables instance-level adaptive model integration
Leverages LLMs for interpretable, multi-step reasoning
Abstract
Tabular data, a fundamental data format in machine learning, is predominantly utilized in competitions and real-world applications. The performance of tabular models--such as gradient boosted decision trees and neural networks--can vary significantly across datasets due to differences in feature distributions and task characteristics. Achieving top performance on each dataset often requires specialized expert knowledge. To address this variability, practitioners often aggregate the predictions of multiple models. However, conventional aggregation strategies typically rely on static combination rules and lack instance-level adaptability. In this work, we propose an in-context ensemble framework for tabular prediction that leverages large language models (LLMs) to perform dynamic, instance-specific integration of external model predictions. Without access to raw tabular features or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Machine Learning and Data Classification
