UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation

Jun Gao; Qi Lv; Zili Wang; Tianxiang Wu; Ziqiang Cao; Wenjie Li

arXiv:2405.17062·cs.CL·May 27, 2025

UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation

Jun Gao, Qi Lv, Zili Wang, Tianxiang Wu, Ziqiang Cao, Wenjie Li

PDF

Open Access

TL;DR

UniICL introduces a unified framework that combines demonstration compression, selection, and response generation to improve the efficiency and effectiveness of in-context learning in large language models.

Contribution

It proposes a novel unified approach that integrates demonstration compression, selection, and generation, with a caching strategy to enhance inference efficiency.

Findings

01

Outperforms existing methods in effectiveness on out-of-domain tasks.

02

Reduces computational cost through demonstration caching.

03

Maintains high performance with compressed demonstrations.

Abstract

In-context learning (ICL) enhances the reasoning abilities of Large Language Models (LLMs) by prepending a few demonstrations. It motivates researchers to introduce more examples to provide additional contextual information for the generation. However, existing methods show a significant limitation due to the problem of excessive growth in context length, which causes a large hardware burden. In addition, shallow-relevant examples selected by off-the-shelf tools hinder LLMs from capturing useful contextual information for generation. In this paper, we propose \textbf{UniICL}, a novel \textbf{Uni}fied \textbf{ICL} framework that unifies demonstration compression, demonstration selection, and final response generation. Furthermore, to boost inference efficiency, we design a tailored compression strategy that allows UniICL to cache compression results into \textbf{Demonstration Bank}…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification