BehaveGPT: A Foundation Model for Large-scale User Behavior Modeling

Jiahui Gong; Jingtao Ding; Fanjin Meng; Chen Yang; Hong Chen; Zuojian Wang; Haisheng Lu; Yong Li

arXiv:2505.17631·cs.IR·May 26, 2025

BehaveGPT: A Foundation Model for Large-scale User Behavior Modeling

Jiahui Gong, Jingtao Ding, Fanjin Meng, Chen Yang, Hong Chen, Zuojian Wang, Haisheng Lu, Yong Li

PDF

TL;DR

BehaveGPT is a transformer-based foundational model tailored for large-scale user behavior prediction, leveraging a novel pretraining paradigm to improve generalization and outperform existing methods on real-world datasets.

Contribution

The paper introduces BehaveGPT, a new model with a DRO-based pretraining paradigm specifically designed for user behavior modeling, enabling better capture of complex patterns and transferability.

Findings

01

Achieves over 10% improvement in macro and weighted recall over baselines.

02

Demonstrates effective modeling of complex user behavior patterns.

03

Provides the first scaling law analysis for user behavior models.

Abstract

In recent years, foundational models have revolutionized the fields of language and vision, demonstrating remarkable abilities in understanding and generating complex data; however, similar advances in user behavior modeling have been limited, largely due to the complexity of behavioral data and the challenges involved in capturing intricate temporal and contextual relationships in user activities. To address this, we propose BehaveGPT, a foundational model designed specifically for large-scale user behavior prediction. Leveraging transformer-based architecture and a novel pretraining paradigm, BehaveGPT is trained on vast user behavior datasets, allowing it to learn complex behavior patterns and support a range of downstream tasks, including next behavior prediction, long-term generation, and cross-domain adaptation. Our approach introduces the DRO-based pretraining paradigm tailored…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.