Linnaeus: A highly reusable and adaptable ML based log classification   pipeline

Armin Catovic; Carolyn Cartwright; Yasmin Tesfaldet Gebreyesus and; Simone Ferlin

arXiv:2103.06927·cs.LG·March 15, 2021

Linnaeus: A highly reusable and adaptable ML based log classification pipeline

Armin Catovic, Carolyn Cartwright, Yasmin Tesfaldet Gebreyesus and, Simone Ferlin

PDF

TL;DR

Linnaeus is an adaptable, reusable machine learning pipeline for log classification that addresses practical challenges like scalability, integration, and limited labeled data in large-scale software systems.

Contribution

The paper introduces Linnaeus, an end-to-end log classification pipeline emphasizing practicality, reusability, adaptability, and integration in industrial environments.

Findings

01

Demonstrates effective log classification with minimal labeled data

02

Shows high reusability and adaptability of the pipeline

03

Integrates ML solutions into large-scale software processes

Abstract

Logs are a common way to record detailed run-time information in software. As modern software systems evolve in scale and complexity, logs have become indispensable to understanding the internal states of the system. At the same time however, manually inspecting logs has become impractical. In recent times, there has been more emphasis on statistical and machine learning (ML) based methods for analyzing logs. While the results have shown promise, most of the literature focuses on algorithms and state-of-the-art (SOTA), while largely ignoring the practical aspects. In this paper we demonstrate our end-to-end log classification pipeline, Linnaeus. Besides showing the more traditional ML flow, we also demonstrate our solutions for adaptability and re-use, integration towards large scale software development processes, and how we cope with lack of labelled data. We hope Linnaeus can serve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.