Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens

Weiyao Luo; Suncong Zheng; Heming Xia; Weikang Wang; Yan Lei; Tianyu Liu; Shuang Chen; Zhifang Sui

arXiv:2406.10985·cs.CL·March 23, 2026

Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens

Weiyao Luo, Suncong Zheng, Heming Xia, Weikang Wang, Yan Lei, Tianyu Liu, Shuang Chen, Zhifang Sui

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to improve large language models' ability to handle long contexts by using sentinel tokens to summarize and integrate chunked text information, enhancing performance on language modeling and downstream tasks.

Contribution

The paper proposes a novel chunking and sentinel token mechanism that enables LLMs to better capture long-term dependencies without increasing computational costs.

Findings

01

Improved language modeling performance on long texts

02

Enhanced downstream task accuracy

03

Effective chunk summarization with sentinel tokens

Abstract

Large language models (LLMs) have shown promising efficacy across various tasks, becoming powerful tools in numerous aspects of human life. However, Transformer-based LLMs suffer a performance degradation when modeling long-term contexts due to they discard some information to reduce computational overhead. In this work, we propose a simple yet effective method to enable LLMs to take a deep breath, encouraging them to summarize information contained within discrete text chunks. Specifically, we segment the text into multiple chunks and insert special token <SR> at the end of each chunk. We then modify the attention mask to integrate the chunk's information into the corresponding <SR> token. This facilitates LLMs to interpret information not only from historical individual tokens but also from the <SR> token, aggregating the chunk's semantic information. Experiments on language modeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques