Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications

Yanxiang Zhang; Zheng Xu; Shanshan Wu; Yuanbo Zhang; Daniel Ramage

arXiv:2505.18488·cs.LG·May 27, 2025

Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications

Yanxiang Zhang, Zheng Xu, Shanshan Wu, Yuanbo Zhang, Daniel Ramage

PDF

TL;DR

This paper presents a method to synthesize and adapt error correction data for mobile LLM applications, improving their performance through domain-specific data augmentation and reweighting based on live metrics.

Contribution

It introduces a scalable data synthesis pipeline using LLMs, with domain adaptation via reweighting based on live deployment metrics, enhancing mobile error correction models.

Findings

01

Synthetic data improves offline error correction performance.

02

Reweighting based on live metrics enhances real-world application results.

03

Best practices for combining synthetic and real data are identified.

Abstract

Error correction is an important capability when applying large language models (LLMs) to facilitate user typing on mobile devices. In this paper, we use LLMs to synthesize a high-quality dataset of error correction pairs to evaluate and improve LLMs for mobile applications. We first prompt LLMs with error correction domain knowledge to build a scalable and reliable addition to the existing data synthesis pipeline. We then adapt the synthetic data distribution to match the mobile application domain by reweighting the samples. The reweighting model is learnt by predicting (a handful of) live A/B test metrics when deploying LLMs in production, given the LLM performance on offline evaluation data and scores from a small privacy-preserving on-device language model. Finally, we present best practices for mixing our synthetic data with other data sources to improve model performance on error…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.