Learning from "Silly" Questions Improves Large Language Models, But Only   Slightly

Tingyuan Zhu; Shudong Liu; Yidong Wang; Derek F. Wong; Han; Yu; Takahiro Shinozaki; Jindong Wang

arXiv:2411.14121·cs.CL·November 22, 2024

Learning from "Silly" Questions Improves Large Language Models, But Only Slightly

Tingyuan Zhu, Shudong Liu, Yidong Wang, Derek F. Wong, Han, Yu, Takahiro Shinozaki, Jindong Wang

PDF

Open Access

TL;DR

This study investigates how incorporating rules derived from analyzing 'silly' questions on Ruozhiba can slightly enhance large language model fine-tuning, revealing mixed effects depending on task and rule applied.

Contribution

The paper introduces a method to analyze and apply educational and cognitive science rules to improve SFT datasets, with large-scale evaluation on MMLU tasks.

Findings

01

Rules can improve performance on some tasks by around 5%.

02

Certain rules cause performance drops up to 6.14%.

03

Effectiveness of rules varies across different tasks.

Abstract

Constructing high-quality Supervised Fine-Tuning (SFT) datasets is critical for the training of large language models (LLMs). Recent studies have shown that using data from a specific source, Ruozhiba, a Chinese website where users ask "silly" questions to better understand certain topics, can lead to better fine-tuning performance. This paper aims to explore some hidden factors: the potential interpretations of its success and a large-scale evaluation of the performance. First, we leverage GPT-4 to analyze the successful cases of Ruozhiba questions from the perspective of education, psychology, and cognitive science, deriving a set of explanatory rules. Then, we construct fine-tuning datasets by applying these rules to the MMLU training set. Surprisingly, our results indicate that rules can significantly improve model performance in certain tasks, while potentially diminishing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Layer Normalization · Dropout · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Softmax