Learning Tractable Probabilistic Models for Fault Localization

Aniruddh Nath; Pedro Domingos

arXiv:1507.01698·cs.SE·July 8, 2015

Learning Tractable Probabilistic Models for Fault Localization

Aniruddh Nath, Pedro Domingos

PDF

Open Access

TL;DR

This paper introduces Tractable Fault Localization Models (TFLMs), which learn from data across multiple buggy programs to improve bug localization by modeling dependencies between code lines, outperforming existing methods.

Contribution

The paper proposes TFLMs that leverage recent tractable probabilistic models to generalize fault localization across programs, incorporating multiple features and dependencies.

Findings

01

TFLMs outperform previous statistical debugging methods.

02

Incorporating TARANTULA scores improves bug localization.

03

TFLMs effectively model dependencies between code lines.

Abstract

In recent years, several probabilistic techniques have been applied to various debugging problems. However, most existing probabilistic debugging systems use relatively simple statistical models, and fail to generalize across multiple programs. In this work, we propose Tractable Fault Localization Models (TFLMs) that can be learned from data, and probabilistically infer the location of the bug. While most previous statistical debugging methods generalize over many executions of a single program, TFLMs are trained on a corpus of previously seen buggy programs, and learn to identify recurring patterns of bugs. Widely-used fault localization techniques such as TARANTULA evaluate the suspiciousness of each line in isolation; in contrast, a TFLM defines a joint probability distribution over buggy indicator variables for each line. Joint distributions with rich dependency structure are often…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software Reliability and Analysis Research