BoundRL: Efficient Structured Text Segmentation through Reinforced Boundary Generation

Haoyuan Li; Zhengyuan Shen; Sullam Jeoung; Yueyan Chen; Jiayu Li; Qi Zhu; Shuai Wang; Vassilis Ioannidis; Huzefa Rangwala

arXiv:2510.20151·cs.CL·April 17, 2026

BoundRL: Efficient Structured Text Segmentation through Reinforced Boundary Generation

Haoyuan Li, Zhengyuan Shen, Sullam Jeoung, Yueyan Chen, Jiayu Li, Qi Zhu, Shuai Wang, Vassilis Ioannidis, Huzefa Rangwala

PDF

TL;DR

BoundRL introduces an efficient token-level segmentation method for structured texts, leveraging reinforcement learning to improve boundary detection and outperform larger models in complex prompt tasks.

Contribution

It proposes BoundRL, a novel reinforcement learning approach that reduces token output and hallucination in structured text segmentation, enabling smaller models to excel.

Findings

01

Reduces output tokens by 90% compared to full text generation.

02

Enables small models to outperform larger models in complex prompt tasks.

03

Uses reinforcement learning with verifiable rewards for boundary generation.

Abstract

Structured texts refer to texts containing structured elements beyond plain texts, such as code snippets and placeholders. Such structured texts increasingly require segmentation into semantically meaningful components, which cannot be effectively handled by conventional sentence-level segmentation methods. To address this, we propose BoundRL, a novel approach that jointly performs efficient token-level text segmentation and label prediction for long structured texts. Instead of generating full texts for each segment, it generates only starting tokens and reconstructs the complete texts by locating these tokens within the original texts, thereby reducing output tokens by 90% and minimizing hallucination. To train the models for the boundary generation, BoundRL~performs reinforcement learning with verifiable rewards (RLVR) that jointly optimizes document reconstruction fidelity and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.