Tuning Language Models for Robust Prediction of Diverse User Behaviors

Fanjin Meng; Jingtao Ding; Jiahui Gong; Chen Yang; Hong Chen; Zuojian Wang; Haisheng Lu; Yong Li

arXiv:2505.17682·cs.CL·April 14, 2026

Tuning Language Models for Robust Prediction of Diverse User Behaviors

Fanjin Meng, Jingtao Ding, Jiahui Gong, Chen Yang, Hong Chen, Zuojian Wang, Haisheng Lu, Yong Li

PDF

TL;DR

This paper introduces BehaviorLM, a two-stage fine-tuning method for large language models that improves the prediction of both common and rare user behaviors in intelligent systems.

Contribution

BehaviorLM's progressive fine-tuning approach enhances tail behavior prediction while maintaining performance on frequent behaviors, leveraging LLMs' behavioral knowledge.

Findings

01

BehaviorLM outperforms existing methods on real-world datasets.

02

It effectively predicts rare tail behaviors with few-shot learning.

03

The approach preserves general behavioral knowledge during fine-tuning.

Abstract

Predicting user behavior is essential for intelligent assistant services, yet deep learning models often struggle to capture long-tailed behaviors. Large language models (LLMs), with their pretraining on vast corpora containing rich behavioral knowledge, offer promise. However, existing fine-tuning approaches tend to overfit to frequent ``anchor'' behaviors, reducing their ability to predict less common ``tail'' behaviors. In this paper, we introduce BehaviorLM, a progressive fine-tuning approach that addresses this issue. In the first stage, LLMs are fine-tuned on anchor behaviors while preserving general behavioral knowledge. In the second stage, fine-tuning uses a balanced subset of all behaviors based on sample difficulty to improve tail behavior predictions without sacrificing anchor performance. Experimental results on two real-world datasets demonstrate that BehaviorLM robustly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.