TextBox 2.0: A Text Generation Library with Pre-trained Language Models
Tianyi Tang, Junyi Li, Zhipeng Chen, Yiwen Hu, Zhuohao Yu, Wenxun Dai,, Zican Dong, Xiaoxue Cheng, Yuhao Wang, Wayne Xin Zhao, Jian-Yun Nie, and, Ji-Rong Wen

TL;DR
TextBox 2.0 is a comprehensive, user-friendly library that supports 13 text generation tasks, 83 datasets, 45 pre-trained models, and integrates the entire research pipeline for efficient text generation research.
Contribution
It introduces a unified, easy-to-use library for text generation that covers diverse tasks, datasets, models, and training strategies, streamlining research workflows.
Findings
Validated effectiveness through extensive experiments
Demonstrated support for diverse research scenarios
Showcased ease of use via Python API and command line
Abstract
To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2.0, focusing on the use of pre-trained language models (PLMs). To be comprehensive, our library covers common text generation tasks and their corresponding datasets and further incorporates PLMs covering general, translation, Chinese, dialogue, controllable, distilled, prompting, and lightweight PLMs. We also implement efficient training strategies and provide generation objectives for pre-training new PLMs from scratch. To be unified, we design the interfaces to support the entire research pipeline (from data loading to training and evaluation), ensuring that each step can be fulfilled in a unified way. Despite the rich functionality, it is easy to use our library, either through the friendly Python API or command line. To validate the effectiveness of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsLib
