DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded   Instruction Wrapping

Yongrui Chen; Haiyun Jiang; Xinting Huang; Shuming Shi; Guilin Qi

arXiv:2309.05447·cs.CL·May 28, 2024

DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping

Yongrui Chen, Haiyun Jiang, Xinting Huang, Shuming Shi, Guilin Qi

PDF

Open Access 1 Repo 1 Models 1 Video

TL;DR

This paper introduces DoG-Instruct, a scalable method for generating high-quality instruction-response pairs by wrapping human-written documents with an LLM, significantly improving instruction-following performance while reducing hallucinations.

Contribution

It presents a novel instruction wrapping technique that leverages human documents and LLMs to produce high-quality data, outperforming existing methods on multiple benchmarks.

Findings

01

10% performance improvement on AlpacaEval

02

Uses only 1/5 of the training data of baseline

03

Manual evaluation confirms data quality

Abstract

The improvement of LLMs' instruction-following capabilities relies heavily on the availability of high-quality instruction-response pairs. Unfortunately, the current methods used to collect the pairs suffer from either unaffordable labor costs or severe hallucinations in the self-generation of LLM. To tackle these challenges, this paper proposes a scalable solution. It involves training LLMs to generate instruction-response pairs based on human-written documents, rather than relying solely on self-generation without context. Our proposed method not only exploits the advantages of human-written documents in reducing hallucinations but also utilizes an LLM to wrap the expression of documents, which enables us to bridge the gap between various document styles and the standard AI response. Experiments demonstrate that our method outperforms existing typical methods on multiple benchmarks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bahuia/dog-instruct
noneOfficial

Models

🤗
bahuia/dog-instruct-wrapper-7b-lora
model· ♡ 3
♡ 3

Videos

DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification