# Multitask Learning for Network Traffic Classification

**Authors:** Shahbaz Rezaei, Xin Liu

arXiv: 1906.05248 · 2021-01-25

## TL;DR

This paper introduces a multi-task learning framework for network traffic classification that leverages easily obtainable flow features like bandwidth and duration to improve accuracy with less labeled data.

## Contribution

The paper proposes a novel multi-task learning approach that predicts bandwidth, duration, and traffic class simultaneously, reducing the need for extensive labeled data.

## Key findings

- High accuracy achieved with limited labeled traffic data.
- Effective use of easily obtainable flow features.
- Validated on ISCX and QUIC datasets.

## Abstract

Traffic classification has various applications in today's Internet, from resource allocation, billing and QoS purposes in ISPs to firewall and malware detection in clients. Classical machine learning algorithms and deep learning models have been widely used to solve the traffic classification task. However, training such models requires a large amount of labeled data. Labeling data is often the most difficult and time-consuming process in building a classifier. To solve this challenge, we reformulate the traffic classification into a multi-task learning framework where bandwidth requirement and duration of a flow are predicted along with the traffic class. The motivation of this approach is twofold: First, bandwidth requirement and duration are useful in many applications, including routing, resource allocation, and QoS provisioning. Second, these two values can be obtained from each flow easily without the need for human labeling or capturing flows in a controlled and isolated environment. We show that with a large amount of easily obtainable data samples for bandwidth and duration prediction tasks, and only a few data samples for the traffic classification task, one can achieve high accuracy. We conduct two experiment with ISCX and QUIC public datasets and show the efficacy of our approach.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.05248/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1906.05248/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1906.05248/full.md

---
Source: https://tomesphere.com/paper/1906.05248