DeepAnalyze: Learning to Localize Crashes at Scale
Manish Shetty, Chetan Bansal, Suman Nath, Sean Bowles, Henry Wang,, Ozgur Arman, Siamak Ahari

TL;DR
DeepAnalyze is a scalable, data-driven approach that uses machine learning to accurately localize crash causes in large-scale error reporting systems, reducing manual rule creation and maintenance.
Contribution
We introduce DeepAnalyze, a novel multi-task sequence labeling model that effectively localizes crash causes across diverse applications with minimal additional training.
Findings
Accurately localizes crashes in large-scale real-world data
Generalizes to new applications with little to no retraining
Outperforms rule-based crash localization methods
Abstract
Crash localization, an important step in debugging crashes, is challenging when dealing with an extremely large number of diverse applications and platforms and underlying root causes. Large-scale error reporting systems, e.g., Windows Error Reporting (WER), commonly rely on manually developed rules and heuristics to localize blamed frames causing the crashes. As new applications and features are routinely introduced and existing applications are run under new environments, developing new rules and maintaining existing ones become extremely challenging. We propose a data-driven solution to address the problem. We start with the first large-scale empirical study of 362K crashes and their blamed methods reported to WER by tens of thousands of applications running in the field. The analysis provides valuable insights on where and how the crashes happen and what methods to blame for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Software System Performance and Reliability
