COrAL: Order-Agnostic Language Modeling for Efficient Iterative   Refinement

Yuxi Xie; Anirudh Goyal; Xiaobao Wu; Xunjian Yin; Xiao Xu; Min-Yen; Kan; Liangming Pan; William Yang Wang

arXiv:2410.09675·cs.CL·October 15, 2024

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

Yuxi Xie, Anirudh Goyal, Xiaobao Wu, Xunjian Yin, Xiao Xu, Min-Yen, Kan, Liangming Pan, William Yang Wang

PDF

Open Access 1 Repo

TL;DR

COrAL introduces an order-agnostic, architecture-integrated iterative refinement method for large language models, significantly improving reasoning accuracy and inference speed while enabling parallel output refinement.

Contribution

It proposes a novel order-agnostic language modeling architecture with sliding blockwise decoding for efficient parallel iterative refinement.

Findings

01

Achieves 4.6% accuracy improvement on GSM8K

02

Realizes up to 3.9x inference speedup

03

Demonstrates promising results on reasoning tasks

Abstract

Iterative refinement has emerged as an effective paradigm for enhancing the capabilities of large language models (LLMs) on complex tasks. However, existing approaches typically implement iterative refinement at the application or prompting level, relying on autoregressive (AR) modeling. The sequential token generation in AR models can lead to high inference latency. To overcome these challenges, we propose Context-Wise Order-Agnostic Language Modeling (COrAL), which incorporates iterative refinement directly into the LLM architecture while maintaining computational efficiency. Our approach models multiple token dependencies within manageable context windows, enabling the model to perform iterative refinement internally during the generation process. Leveraging the order-agnostic nature of COrAL, we introduce sliding blockwise order-agnostic decoding, which performs multi-token forward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuxixie/coral
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques

MethodsCorrelation Alignment for Deep Domain Adaptation