Not Everything is All You Need: Toward Low-Redundant Optimization for   Large Language Model Alignment

Zhipeng Chen; Kun Zhou; Wayne Xin Zhao; Jingyuan Wang; Ji-Rong Wen

arXiv:2406.12606·cs.CL·October 4, 2024

Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment

Zhipeng Chen, Kun Zhou, Wayne Xin Zhao, Jingyuan Wang, Ji-Rong Wen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ALLO, a low-redundant alignment method for large language models that selectively optimizes key neurons related to human preferences, improving alignment efficiency and performance.

Contribution

The paper proposes a novel neuron selection strategy and a two-stage learning process to reduce redundancy and enhance LLM alignment with human preferences.

Findings

01

Selective neuron optimization improves alignment performance.

02

Reducing redundant neurons accelerates convergence.

03

ALLO outperforms baseline methods on multiple datasets.

Abstract

Large language models (LLMs) are still struggling in aligning with human preference in complex tasks and scenarios. They are prone to overfit into the unexpected patterns or superficial styles in the training data. We conduct an empirical study that only selects the top-10\% most updated parameters in LLMs for alignment training, and see improvements in the convergence process and final performance. It indicates the existence of redundant neurons in LLMs for alignment training. To reduce its influence, we propose a low-redundant alignment method named \textbf{ALLO}, focusing on optimizing the most related neurons with the most useful supervised signals. Concretely, we first identify the neurons that are related to the human preference data by a gradient-based strategy, then identify the alignment-related key tokens by reward models for computing loss. Besides, we also decompose the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rucaibox/allo
pytorchOfficial

Videos

Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling