I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative   Self-Enhancement Paradigm

Yiming Liang; Ge Zhang; Xingwei Qu; Tianyu Zheng; Jiawei Guo; Xinrun; Du; Zhenzhu Yang; Jiaheng Liu; Chenghua Lin; Lei Ma; Wenhao Huang; Jiajun; Zhang

arXiv:2408.08072·cs.CL·December 18, 2024

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Yiming Liang, Ge Zhang, Xingwei Qu, Tianyu Zheng, Jiawei Guo, Xinrun, Du, Zhenzhu Yang, Jiaheng Liu, Chenghua Lin, Lei Ma, Wenhao Huang, Jiajun, Zhang

PDF

Open Access 1 Repo 7 Models

TL;DR

I-SHEEP introduces an iterative self-enhancement paradigm enabling LLMs to self-align continuously from scratch, significantly improving their performance across multiple benchmarks compared to one-time alignment methods.

Contribution

This paper presents I-SHEEP, a novel human-like iterative self-alignment approach for LLMs that surpasses prior one-time alignment techniques in various tasks.

Findings

01

Achieves up to 78.2% improvement in Alpaca Eval

02

Surpasses base models in code generation, TrivialQA, and SQuAD tasks

03

Demonstrates continuous capacity enhancement over iterations

Abstract

Large Language Models (LLMs) have achieved significant advancements, however, the common learning paradigm treats LLMs as passive information repositories, neglecting their potential for active learning and alignment. Some approaches train LLMs using their own generated synthetic data, exploring the possibility of active alignment. However, there is still a huge gap between these one-time alignment methods and the continuous automatic alignment of humans. In this paper, we introduce \textbf{I-SHEEP}, an \textbf{I}terative \textbf{S}elf-En\textbf{H}anc\textbf{E}m\textbf{E}nt \textbf{P}aradigm.This human-like paradigm enables LLMs to \textbf{continuously self-align from scratch with nothing}. Compared to the one-time alignment method Dromedary \cite{sun2023principledriven}, which refers to the first iteration in this paper, I-SHEEP can significantly enhance capacities on both Qwen and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

multimodal-art-projection/I-SHEEP
noneOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security · Semantic Web and Ontologies

MethodsLLaMA · Balanced Selection