# Evaluation of Machine Learning-based Anomaly Detection Algorithms on an   Industrial Modbus/TCP Data Set

**Authors:** Simon Duque Anton, Suneetha Kanoor, Daniel Fraunholz, and Hans Dieter, Schotten

arXiv: 1905.11757 · 2019-05-29

## TL;DR

This paper evaluates machine learning algorithms for detecting malicious traffic in industrial Modbus/TCP data, highlighting the effectiveness of SVM and k-NN in a synthetic scenario, and comparing their performance.

## Contribution

It provides an empirical comparison of several ML-based anomaly detection algorithms on industrial Modbus/TCP data, emphasizing the suitability of supervised methods.

## Key findings

- SVM and k-NN perform well on synthetic data
- k-means clustering does not perform satisfactorily
- Supervised learning enables effective anomaly detection

## Abstract

In the context of the Industrial Internet of Things, communication technology, originally used in home and office environments, is introduced into industrial applications. Commercial off-the-shelf products, as well as unified and well-established communication protocols make this technology easy to integrate and use. Furthermore, productivity is increased in comparison to classic industrial control by making systems easier to manage, set up and configure. Unfortunately, most attack surfaces of home and office environments are introduced into industrial applications as well, which usually have very few security mechanisms in place. Over the last years, several technologies tackling that issue have been researched. In this work, machine learning-based anomaly detection algorithms are employed to find malicious traffic in a synthetically generated data set of Modbus/TCP communication of a fictitious industrial scenario. The applied algorithms are Support Vector Machine (SVM), Random Forest, k-nearest neighbour and k-means clustering. Due to the synthetic data set, supervised learning is possible. Support Vector Machine and k-nearest neighbour perform well with different data sets, while k-nearest neighbour and k-means clustering do not perform satisfactorily.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.11757/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1905.11757/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/1905.11757/full.md

---
Source: https://tomesphere.com/paper/1905.11757