RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Fei Zhao; Chonggang Lu; Haofu Qian; Fangcheng Shi; Zijie Meng; Jianzhao Huang; Xu Tang; Zheyong Xie; Zheyu Ye; Zhe Xu; Yao Hu; Shaosheng Cao

arXiv:2511.07070·cs.AI·November 11, 2025

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Fei Zhao, Chonggang Lu, Haofu Qian, Fangcheng Shi, Zijie Meng, Jianzhao Huang, Xu Tang, Zheyong Xie, Zheyu Ye, Zhe Xu, Yao Hu, Shaosheng Cao

PDF

Open Access

TL;DR

RedOne 2.0 introduces a progressive RL-prioritized post-training approach for social networking services-oriented LLMs, achieving significant performance gains, data efficiency, and robustness at a smaller scale.

Contribution

The paper presents a novel three-stage post-training paradigm for SNS-specific LLMs, combining exploratory learning, targeted fine-tuning, and reinforcement learning for improved adaptation.

Findings

01

Achieves 2.41 average performance improvement over a 7B baseline.

02

Attains 8.74 average performance lift with less than half the data.

03

Demonstrates superior data efficiency and robustness at 4B scale.

Abstract

As a key medium for human interaction and information exchange, social networking services (SNS) pose unique challenges for large language models (LLMs): heterogeneous workloads, fast-shifting norms and slang, and multilingual, culturally diverse corpora that induce sharp distribution shift. Supervised fine-tuning (SFT) can specialize models but often triggers a ``seesaw'' between in-distribution gains and out-of-distribution robustness, especially for smaller models. To address these challenges, we introduce RedOne 2.0, an SNS-oriented LLM trained with a progressive, RL-prioritized post-training paradigm designed for rapid and stable adaptation. The pipeline consist in three stages: (1) Exploratory Learning on curated SNS corpora to establish initial alignment and identify systematic weaknesses; (2) Targeted Fine-Tuning that selectively applies SFT to the diagnosed gaps while mixing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Mental Health via Writing · Advanced Graph Neural Networks