LOGO -- Long cOntext aliGnment via efficient preference Optimization
Zecheng Tang, Zechen Sun, Juntao Li, Qiaoming Zhu, Min Zhang

TL;DR
LOGO introduces an efficient preference optimization training strategy that significantly improves long-context models' generation performance and context window size with minimal data and computational resources.
Contribution
The paper proposes LOGO, a novel training method using preference optimization and position synthesis to enhance long-context alignment efficiently.
Findings
LOGO enables a 8B model to match GPT-4's performance on long-context tasks.
Training with only 0.3B data on a single GPU, LOGO achieves high efficiency.
LOGO extends context window size while maintaining original task capabilities.
Abstract
Long-context models(LCMs) have shown great potential in processing long input sequences(even more than 100M tokens) conveniently and effectively. With significant progress, recent research has pointed out that LCMs can accurately locate token-level salient information within the context. Yet, the generation performance of these LCMs is far from satisfactory and might result in misaligned responses, such as hallucinations. To enhance the generation capability of LCMs, existing works have investigated the effects of data size and quality for both pre-training and instruction tuning. Though achieving meaningful improvement, previous methods fall short in either effectiveness or efficiency. In this paper, we introduce LOGO(Long cOntext aliGnment via efficient preference Optimization), a training strategy that first introduces preference optimization for long-context alignment. To overcome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques
MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Linear Layer · Multi-Head Attention · Softmax · Adam
