Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs
Biyang Guo, He Wang, Wenyilin Xiao, Hong Chen, Zhuxin Lee, Songqiao, Han, Hailiang Huang

TL;DR
This paper introduces Sample Design Engineering (SDE), a systematic approach to improve downstream fine-tuning of LLMs by optimizing input, output, and reasoning samples, demonstrating its effectiveness across various tasks.
Contribution
It is the first comprehensive study on sample design for downstream fine-tuning of LLMs, providing an effective SDE strategy validated through extensive experiments.
Findings
SDE consistently improves LLM performance on complex tasks
Effective sample design patterns are identified and validated
Good prompt engineering does not always equate to good sample design
Abstract
In the burgeoning field of Large Language Models (LLMs) like ChatGPT and LLaMA, Prompt Engineering (PE) is renowned for boosting zero-shot or in-context learning (ICL) through prompt modifications. Yet, the realm of the sample design for downstream fine-tuning, crucial for task-specific LLM adaptation, is largely unexplored. This paper introduces Sample Design Engineering (SDE), a methodical approach to enhancing LLMs' post-tuning performance by refining input, output, and reasoning designs. We conduct a series of in-domain (ID) and out-of-domain (OOD) experiments to assess the impact of various design options on LLMs' downstream performance, revealing several intriguing patterns that hold consistently across different LLMs. Based on these insights, we propose an integrated SDE strategy, combining the most effective options, and validate its consistent superiority over heuristic sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Materials Characterization Techniques · Mineral Processing and Grinding
MethodsLLaMA
