Packet2Vec: Utilizing Word2Vec for Feature Extraction in Packet Data

Eric L. Goodman; Chase Zimmerman; Corey Hudson

arXiv:2004.14477·cs.LG·May 1, 2020·5 cites

Packet2Vec: Utilizing Word2Vec for Feature Extraction in Packet Data

Eric L. Goodman, Chase Zimmerman, Corey Hudson

PDF

Open Access 1 Repo

TL;DR

This paper introduces Packet2Vec, a novel method adapting Word2Vec for automatic feature extraction from raw network packet data to improve intrusion detection accuracy.

Contribution

Packet2Vec is the first approach to apply Word2Vec to network packet data for feature extraction in intrusion detection tasks.

Findings

01

Achieved high AUC scores between 0.988-0.996 for ROC.

02

Obtained AUC between 0.604-0.667 for Precision/Recall.

03

Demonstrated effectiveness of deep learning-based feature extraction.

Abstract

One of deep learning's attractive benefits is the ability to automatically extract relevant features for a target problem from largely raw data, instead of utilizing human engineered and error prone handcrafted features. While deep learning has shown success in fields such as image classification and natural language processing, its application for feature extraction on raw network packet data for intrusion detection is largely unexplored. In this paper we modify a Word2Vec approach, used for text processing, and apply it to packet data for automatic feature extraction. We call this approach Packet2Vec. For the classification task of benign versus malicious traffic on a 2009 DARPA network data set, we obtain an area under the curve (AUC) of the receiver operating characteristic (ROC) between 0.988-0.996 and an AUC of the Precision/Recall curve between 0.604-0.667.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sandialabs/packet2vec
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Network Packet Processing and Optimization