LOGO -- Long cOntext aliGnment via efficient preference Optimization

Zecheng Tang; Zechen Sun; Juntao Li; Qiaoming Zhu; Min Zhang

arXiv:2410.18533·cs.CL·October 25, 2024

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Zecheng Tang, Zechen Sun, Juntao Li, Qiaoming Zhu, Min Zhang

PDF

Open Access 1 Repo

TL;DR

LOGO introduces an efficient preference optimization training strategy that significantly improves long-context models' generation performance and context window size with minimal data and computational resources.

Contribution

The paper proposes LOGO, a novel training method using preference optimization and position synthesis to enhance long-context alignment efficiently.

Findings

01

LOGO enables a 8B model to match GPT-4's performance on long-context tasks.

02

Training with only 0.3B data on a single GPU, LOGO achieves high efficiency.

03

LOGO extends context window size while maintaining original task capabilities.

Abstract

Long-context models(LCMs) have shown great potential in processing long input sequences(even more than 100M tokens) conveniently and effectively. With significant progress, recent research has pointed out that LCMs can accurately locate token-level salient information within the context. Yet, the generation performance of these LCMs is far from satisfactory and might result in misaligned responses, such as hallucinations. To enhance the generation capability of LCMs, existing works have investigated the effects of data size and quality for both pre-training and instruction tuning. Though achieving meaningful improvement, previous methods fall short in either effectiveness or efficiency. In this paper, we introduce LOGO(Long cOntext aliGnment via efficient preference Optimization), a training strategy that first introduces preference optimization for long-context alignment. To overcome…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zetangforward/lcm_stack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Linear Layer · Multi-Head Attention · Softmax · Adam