BoostLLM: Boosting-inspired LLM Fine-tuning for Few-shot Tabular Classification

Yi-Siang Wang; Kuan-Yu Chen; Yu-Chen Den; Darby Tien-Hao Chang

arXiv:2605.06117·cs.LG·May 12, 2026

BoostLLM: Boosting-inspired LLM Fine-tuning for Few-shot Tabular Classification

Yi-Siang Wang, Kuan-Yu Chen, Yu-Chen Den, Darby Tien-Hao Chang

PDF

TL;DR

BoostLLM introduces a boosting-inspired fine-tuning framework for LLMs that enhances performance in low-data tabular classification by integrating decision-tree paths and residual optimization.

Contribution

This work applies the boosting paradigm to LLM fine-tuning, transforming parameter-efficient training into a multi-round residual process with structured tabular bias incorporation.

Findings

01

BoostLLM consistently outperforms standard fine-tuning across multiple datasets.

02

It matches or surpasses XGBoost in low-data regimes.

03

Scaling BoostLLM with stronger models and longer boosting improves results.

Abstract

Large language models (LLMs) have recently been adapted to tabular prediction by serializing structured features into natural language, but their performance in low-data regimes remains limited compared to gradient-boosted decision trees (GBDTs). In this work, we revisit the boosting paradigm, traditionally associated with tree ensembles, and ask whether it can be applied as a general training principle for LLM fine-tuning. We propose BoostLLM, a framework that transforms parameter-efficient fine-tuning into a multi-round residual optimization process by training sequential PEFT adapters as weak learners. To incorporate tabular inductive bias, BoostLLM integrates decision-tree paths as a second input view alongside raw features; analysis reveals that the path view acts as a structured teacher in early training steps before the model shifts toward feature-driven representations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.