Self-QA: Unsupervised Knowledge Guided Language Model Alignment

Xuanyu Zhang; Qing Yang

arXiv:2305.11952·cs.CL·May 23, 2023·5 cites

Self-QA: Unsupervised Knowledge Guided Language Model Alignment

Xuanyu Zhang, Qing Yang

PDF

Open Access 1 Repo 1 Models

TL;DR

Self-QA is an unsupervised framework that leverages knowledge to generate large-scale, domain-specific instruction data for language model alignment, reducing reliance on human-annotated datasets.

Contribution

We introduce Self-QA, a novel unsupervised approach that replaces human instruction seeds with knowledge-based data to improve language model instruction tuning.

Findings

01

Self-QA produces high-quality, domain-specific instruction data.

02

The method reduces the need for extensive human annotation.

03

Experiments show improved model alignment with less supervision.

Abstract

Large-scale language models like ChatGPT and GPT-4 have gained attention for their impressive conversational and generative capabilities. However, the creation of supervised paired question-answering data for instruction tuning presents formidable challenges. This endeavor necessitates substantial human effort for data annotation and wrestles with issues concerning data quality, diversity, accuracy, and other related factors. To overcome these obstacles, we introduce an innovative framework named Self-QA, which replaces the traditional practice of human-written instruction seeds with a vast amount of unsupervised knowledge, enabling the model to generate a larger quantity of correct and domain-specific instruction data. The effectiveness of our proposed method is demonstrated through experiments conducted on unsupervised corpora from various domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

duxiaoman-di/xuanyuan
pytorch

Models

🤗
xyz-nlp/XuanYuan2.0
model· ♡ 150
♡ 150

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsAttention Is All You Need · Absolute Position Encodings · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Multi-Head Attention · Adam