An Empirical Study on Using Large Language Models to Analyze Software   Supply Chain Security Failures

Tanmay Singla; Dharun Anandayuvaraj; Kelechi G. Kalu; Taylor R.; Schorlemmer; James C. Davis

arXiv:2308.04898·cs.CR·August 10, 2023·1 cites

An Empirical Study on Using Large Language Models to Analyze Software Supply Chain Security Failures

Tanmay Singla, Dharun Anandayuvaraj, Kelechi G. Kalu, Taylor R., Schorlemmer, James C. Davis

PDF

Open Access

TL;DR

This study evaluates the effectiveness of Large Language Models in analyzing and categorizing software supply chain security failures, highlighting their potential to assist but not replace human analysts.

Contribution

It demonstrates the capabilities and limitations of LLMs in analyzing real-world security breach reports, providing a foundation for future improvements.

Findings

01

GPT 3.5 achieved 68% accuracy in categorization

02

Bard achieved 58% accuracy in categorization

03

LLMs effectively characterize failures with detailed source articles

Abstract

As we increasingly depend on software systems, the consequences of breaches in the software supply chain become more severe. High-profile cyber attacks like those on SolarWinds and ShadowHammer have resulted in significant financial and data losses, underlining the need for stronger cybersecurity. One way to prevent future breaches is by studying past failures. However, traditional methods of analyzing these failures require manually reading and summarizing reports about them. Automated support could reduce costs and allow analysis of more failures. Natural Language Processing (NLP) techniques such as Large Language Models (LLMs) could be leveraged to assist the analysis of failures. In this study, we assessed the ability of Large Language Models (LLMs) to analyze historical software supply chain breaches. We used LLMs to replicate the manual analysis of 69 software supply chain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software Engineering Research · Software Reliability and Analysis Research

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Softmax · Layer Normalization · Discriminative Fine-Tuning · Adam · Residual Connection · Dense Connections