An End-to-End Framework for Building Large Language Models for Software Operations

Jingkai He; Pengfei Chen; Chenghui Wu; Shuang Liang; Ye Li; Gou Tan; Xiadao Wen; Chuanfu Zhang

arXiv:2605.02906·cs.LG·May 13, 2026

An End-to-End Framework for Building Large Language Models for Software Operations

Jingkai He, Pengfei Chen, Chenghui Wu, Shuang Liang, Ye Li, Gou Tan, Xiadao Wen, Chuanfu Zhang

PDF

TL;DR

This paper introduces OpsLLM, a domain-specific large language model for software operations, developed through a detailed end-to-end process including data curation, fine-tuning, and reinforcement learning, outperforming existing models.

Contribution

The paper presents a comprehensive workflow for building domain-specific LLMs in software operations, including data curation, fine-tuning, and reinforcement learning, with open-source models and datasets.

Findings

01

OpsLLM outperforms existing LLMs in accuracy on QA and RCA tasks.

02

The models demonstrate strong transferability across tasks.

03

Open-source models include 7B, 14B, and 32B parameter versions with a 15K dataset.

Abstract

In the field of software operations, Large Language Models (LLMs) have attracted increasing attention. However, existing research has not yet achieved efficient and effective end-to-end intelligent operations due to low-quality data, fragmented knowledge and insufficient learning. To explore the potential of LLMs in software operations, we propose OpsLLM, a domain-specific LLM that supports both knowledge-based question answering (QA) and root cause analysis (RCA). Moreover, we disclose the detailed workflow for building LLMs specifically in the software operations domain. First, a Human-in-the-Loop mechanism is introduced to curate highquality data from a large collection of operational raw data and construct a fine-tuning dataset. Then, based on the data, supervised fine-tuning is conducted to achieve a base model. Furthermore, we introduce a domain process reward model (DPRM) during…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.