Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data
Zihui Gu, Xingwu Sun, Fengzong Lian, Zhanhui Kang, Cheng-Zhong Xu, Ju, Fan

TL;DR
This paper introduces DINGO, a comprehensive evaluation dataset with diverse, fine-grained instructions derived from real-world requests, to better assess and improve large language models' instruction-following capabilities.
Contribution
The paper presents DINGO, a novel dataset with multi-level categories and diverse instructions generated by GPT-4 and humans, addressing evaluation shortcomings in existing methods.
Findings
DINGO offers more challenging evaluation scenarios.
It enables fine-grained, task-level assessment of LLMs.
Experiments show DINGO improves understanding of LLM capabilities.
Abstract
Instruction-following is particularly crucial for large language models (LLMs) to support diverse user requests. While existing work has made progress in aligning LLMs with human preferences, evaluating their capabilities on instruction following remains a challenge due to complexity and diversity of real-world user instructions. While existing evaluation methods focus on general skills, they suffer from two main shortcomings, i.e., lack of fine-grained task-level evaluation and reliance on singular instruction expression. To address these problems, this paper introduces DINGO, a fine-grained and diverse instruction-following evaluation dataset that has two main advantages: (1) DINGO is based on a manual annotated, fine-grained and multi-level category tree with 130 nodes derived from real-world user requests; (2) DINGO includes diverse instructions, generated by both GPT-4 and human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment · Mobile Learning in Education · Education Methods and Technologies
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Residual Connection · Byte Pair Encoding · Layer Normalization · Label Smoothing · Adam · Dropout
