Online detection of failures generated by storage simulator
Kenenbek Arzymatov, Mikhail Hushchyn, Andrey Sapronov, Vladislav, Belavin, Leonid Gremyachikh, Maksim Karpov, Andrey Ustyuzhanin

TL;DR
This paper introduces a storage system simulator and an online failure detection method using change point detection and density ratio estimation to improve failure identification in large-scale data centers.
Contribution
It presents a novel Go-based storage simulator and adapts online change point detection algorithms for failure detection in simulated storage system data.
Findings
Created a flexible storage system simulator in Go
Modified online change point detection for failure identification
Demonstrated effective failure detection in simulated data
Abstract
Modern large-scale data-farms consist of hundreds of thousands of storage devices that span distributed infrastructure. Devices used in modern data centers (such as controllers, links, SSD- and HDD-disks) can fail due to hardware as well as software problems. Such failures or anomalies can be detected by monitoring the activity of components using machine learning techniques. In order to use these techniques, researchers need plenty of historical data of devices in normal and failure mode for training algorithms. In this work, we challenge two problems: 1) lack of storage data in the methods above by creating a simulator and 2) applying existing online algorithms that can faster detect a failure occurred in one of the components. We created a Go-based (golang) package for simulating the behavior of modern storage infrastructure. The software is based on the discrete-event modeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Time Series Analysis and Forecasting · Data Stream Mining Techniques
