StringLLM: Understanding the String Processing Capability of Large   Language Models

Xilong Wang; Hao Fu; Jindong Wang; Neil Zhenqiang Gong

arXiv:2410.01208·cs.CL·January 28, 2025

StringLLM: Understanding the String Processing Capability of Large Language Models

Xilong Wang, Hao Fu, Jindong Wang, Neil Zhenqiang Gong

PDF

Open Access 1 Repo

TL;DR

This paper systematically evaluates large language models' ability to process strings, introduces datasets for benchmarking, analyzes their limitations, and proposes fine-tuning methods to improve their string processing skills.

Contribution

It presents StringLLM and StringBench for benchmarking string processing in LLMs, and offers insights and methods to enhance their capabilities.

Findings

01

LLMs struggle with string processing compared to humans

02

The proposed fine-tuning approach significantly improves LLMs' string capabilities

03

The study provides a foundation for future research in LLM string processing

Abstract

String processing, which mainly involves the analysis and manipulation of strings, is a fundamental component of modern computing. Despite the significant advancements of large language models (LLMs) in various natural language processing (NLP) tasks, their capability in string processing remains underexplored and underdeveloped. To bridge this gap, we present a comprehensive study of LLMs' string processing capability. In particular, we first propose StringLLM, a method to construct datasets for benchmarking string processing capability of LLMs. We use StringLLM to build a series of datasets, referred to as StringBench. It encompasses a wide range of string processing tasks, allowing us to systematically evaluate LLMs' performance in this area. Our evaluations indicate that LLMs struggle with accurately processing strings compared to humans. To uncover the underlying reasons for this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wxl-lxw/stringllm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis