LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization
Zishi Zhang, Jinhui Han, Ming Hu, Yijie Peng

TL;DR
This paper introduces a pretrain-then-finetune approach using Transformers for small-data, large-scale decision problems, leveraging synthetic domain-informed data and theoretical error analysis to improve operational decision-making.
Contribution
It proposes a novel Transformer-based pipeline with domain-specific design and provides the first comprehensive error analysis with nonasymptotic guarantees.
Findings
Pretraining injects domain knowledge and enables high-capacity models.
Fine-tuning adapts models to real data and improves accuracy.
Transfer learning effectiveness increases with more data.
Abstract
We consider small-data, large-scale decision problems in which a firm must make many operational decisions simultaneously (e.g., across a large product portfolio) while observing only a few, potentially noisy, data points per instance. Inspired by the success of large language models (LLMs), we propose a pretrain-then-finetune approach built on a designed Transformer model to address this challenge. The model is first pretrained on large-scale, domain-informed synthetic data that encode managerial knowledge and structural features of the decision environment, and is then fine-tuned on real observations. This new pipeline offers two complementary advantages: pretraining injects domain knowledge into the learning process and enables the training of high-capacity models using abundant synthetic data, while finetuning adapts the pretrained model to the operational environment and improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Big Data and Business Intelligence · Big Data and Digital Economy
