Are LLMs Correctly Integrated into Software Systems?
Yuchen Shao, Yuheng Huang, Jiawei Shen, Lei Ma, Ting Su, Chengcheng, Wan

TL;DR
This paper investigates the challenges of integrating large language models with retrieval-augmented generation into software systems, identifying common defect patterns and proposing guidelines to improve integration quality.
Contribution
It provides a comprehensive analysis of 100 open-source applications, identifies 18 defect patterns, and offers systematic guidelines and an open-source defect library for better LLM integration.
Findings
77% of applications have more than three types of integration defects
Identified 18 common defect patterns affecting functionality, efficiency, and security
Proposed systematic guidelines and developed the Hydrangea defect library
Abstract
Large language models (LLMs) provide effective solutions in various application scenarios, with the support of retrieval-augmented generation (RAG). However, developers face challenges in integrating LLM and RAG into software systems, due to lacking interface specifications, various requirements from software context, and complicated system management. In this paper, we have conducted a comprehensive study of 100 open-source applications that incorporate LLMs with RAG support, and identified 18 defect patterns. Our study reveals that 77% of these applications contain more than three types of integration defects that degrade software functionality, efficiency, and security. Guided by our study, we propose systematic guidelines for resolving these defects in software life cycle. We also construct an open-source defect library Hydrangea.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInertial Sensor and Navigation
MethodsAttention Is All You Need · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · WordPiece · Attention Dropout · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections
