# Cardinality Estimation in a Virtualized Network Device Using Online   Machine Learning

**Authors:** Reuven Cohen, Yuval Nezri

arXiv: 1903.05728 · 2019-03-15

## TL;DR

This paper introduces an online machine learning framework for adaptive cardinality estimation in virtualized network devices, significantly improving accuracy over traditional sampling methods by adapting to flow size distribution changes.

## Contribution

The work presents a novel adaptive cardinality estimation framework using online machine learning, addressing the limitations of existing sampling algorithms in dynamic network environments.

## Key findings

- Significantly improved accuracy over existing sampling algorithms.
- Effective adaptation to changes in flow size distribution.
- Validated using real network traffic traces.

## Abstract

Cardinality estimation algorithms receive a stream of elements, with possible repetitions, and return the number of distinct elements in the stream. Such algorithms seek to minimize the required memory and CPU resource consumption at the price of inaccuracy in their output. In computer networks, cardinality estimation algorithms are mainly used for counting the number of distinct flows, and they are divided into two categories: sketching algorithms and sampling algorithms. Sketching algorithms require the processing of all packets, and they are therefore usually implemented by dedicated hardware. Sampling algorithms do not require processing of all packets, but they are known for their inaccuracy. In this work we identify one of the major drawbacks of sampling-based cardinality estimation algorithms: their inability to adapt to changes in flow size distribution. To address this problem, we propose a new sampling-based adaptive cardinality estimation framework, which uses online machine learning. We evaluate our framework using real traffic traces, and show significantly better accuracy compared to the best known sampling-based algorithms, for the same fraction of processed packets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.05728/full.md

## Figures

32 figures with captions in the complete paper: https://tomesphere.com/paper/1903.05728/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1903.05728/full.md

---
Source: https://tomesphere.com/paper/1903.05728