An End-to-End Framework for Building Large Language Models for Software Operations
Jingkai He, Pengfei Chen, Chenghui Wu, Shuang Liang, Ye Li, Gou Tan, Xiadao Wen, Chuanfu Zhang

TL;DR
This paper introduces OpsLLM, a domain-specific large language model for software operations, developed through a detailed end-to-end process including data curation, fine-tuning, and reinforcement learning, outperforming existing models.
Contribution
The paper presents a comprehensive workflow for building domain-specific LLMs in software operations, including data curation, fine-tuning, and reinforcement learning, with open-source models and datasets.
Findings
OpsLLM outperforms existing LLMs in accuracy on QA and RCA tasks.
The models demonstrate strong transferability across tasks.
Open-source models include 7B, 14B, and 32B parameter versions with a 15K dataset.
Abstract
In the field of software operations, Large Language Models (LLMs) have attracted increasing attention. However, existing research has not yet achieved efficient and effective end-to-end intelligent operations due to low-quality data, fragmented knowledge and insufficient learning. To explore the potential of LLMs in software operations, we propose OpsLLM, a domain-specific LLM that supports both knowledge-based question answering (QA) and root cause analysis (RCA). Moreover, we disclose the detailed workflow for building LLMs specifically in the software operations domain. First, a Human-in-the-Loop mechanism is introduced to curate highquality data from a large collection of operational raw data and construct a fine-tuning dataset. Then, based on the data, supervised fine-tuning is conducted to achieve a base model. Furthermore, we introduce a domain process reward model (DPRM) during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
