FAIL: Analyzing Software Failures from the News Using LLMs
Dharun Anandayuvaraj, Matthew Campbell, Arav Tewari, James C. Davis

TL;DR
FAIL leverages large language models to automate the collection, grouping, and analysis of news-reported software failures, enabling large-scale insights into failure patterns, recurrence, and severity over time.
Contribution
This work introduces FAIL, an automated system that analyzes news reports of software failures using LLMs, filling the gap of manual, small-scale failure analysis methods.
Findings
LLMs can effectively identify and analyze software failure news articles.
High recurrence of similar failures occurs within and across organizations.
Software failure severity has increased over the past decade.
Abstract
Software failures inform engineering work, standards, regulations. For example, the Log4J vulnerability brought government and industry attention to evaluating and securing software supply chains. Accessing private engineering records is difficult, so failure analyses tend to use information reported by the news media. However, prior works in this direction have relied on manual analysis. That has limited the scale of their analyses. The community lacks automated support to enable such analyses to consider a wide range of news sources and incidents. In this paper, we propose the Failure Analysis Investigation with LLMs (FAIL) system to fill this gap. FAIL collects, analyzes, and summarizes software failures as reported in the news. FAIL groups articles that describe the same incidents. It then analyzes incidents using existing taxonomies for postmortems, faults, and system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Reliability and Analysis Research
