Harnessing Generative Pre-Trained Transformer for Datacenter Packet   Trace Generation

Chen Griner

arXiv:2501.12033·cs.NI·January 22, 2025

Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation

Chen Griner

PDF

Open Access

TL;DR

This paper introduces DTG-GPT, a generative transformer model that synthesizes realistic datacenter traffic traces, enabling better research and optimization by accurately mimicking complex traffic patterns across different network scales.

Contribution

The paper presents a novel GPT-based model for datacenter traffic generation that effectively reproduces intricate traffic patterns from limited data, a significant step beyond traditional mathematical models.

Findings

01

DTG-GPT can generate traffic traces that closely mimic real traffic patterns.

02

The model maintains fidelity across different network scales.

03

It demonstrates potential for future data sharing via trained models.

Abstract

Today, the rapid growth of applications reliant on datacenters calls for new advancements to meet the increasing traffic and computational demands. Traffic traces from datacenters are essential for further development and optimization of future datacenters. However, traces are rarely released to the public. Researchers often use simplified mathematical models that lack the depth needed to recreate intricate traffic patterns and, thus, miss optimization opportunities found in realistic traffic. In this preliminary work, we introduce DTG-GPT, a packet-level Datacenter Traffic Generator (DTG), based on the generative pre-trained transformer (GPT) architecture used by many state-of-the-art large language models. We train our model on a small set of available traffic traces from different domains and offer a simple methodology to evaluate the fidelity of the generated traces to their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Discriminative Fine-Tuning · Layer Normalization · Cosine Annealing · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · Softmax