Learning from "Silly" Questions Improves Large Language Models, But Only Slightly
Tingyuan Zhu, Shudong Liu, Yidong Wang, Derek F. Wong, Han, Yu, Takahiro Shinozaki, Jindong Wang

TL;DR
This study investigates how incorporating rules derived from analyzing 'silly' questions on Ruozhiba can slightly enhance large language model fine-tuning, revealing mixed effects depending on task and rule applied.
Contribution
The paper introduces a method to analyze and apply educational and cognitive science rules to improve SFT datasets, with large-scale evaluation on MMLU tasks.
Findings
Rules can improve performance on some tasks by around 5%.
Certain rules cause performance drops up to 6.14%.
Effectiveness of rules varies across different tasks.
Abstract
Constructing high-quality Supervised Fine-Tuning (SFT) datasets is critical for the training of large language models (LLMs). Recent studies have shown that using data from a specific source, Ruozhiba, a Chinese website where users ask "silly" questions to better understand certain topics, can lead to better fine-tuning performance. This paper aims to explore some hidden factors: the potential interpretations of its success and a large-scale evaluation of the performance. First, we leverage GPT-4 to analyze the successful cases of Ruozhiba questions from the perspective of education, psychology, and cognitive science, deriving a set of explanatory rules. Then, we construct fine-tuning datasets by applying these rules to the MMLU training set. Surprisingly, our results indicate that rules can significantly improve model performance in certain tasks, while potentially diminishing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Layer Normalization · Dropout · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Softmax
