Detecting Unknown DGAs without Context Information

Arthur Drichel; Justus von Brandt; Ulrike Meyer

arXiv:2205.14940·cs.CR·May 31, 2022

Detecting Unknown DGAs without Context Information

Arthur Drichel, Justus von Brandt, Ulrike Meyer

PDF

TL;DR

This paper introduces a novel approach combining softmax classifiers and regexes to detect unknown DGAs, effectively identifying new malware families without prior context, while maintaining high accuracy for known DGAs.

Contribution

The study evaluates 59,690 classifiers and proposes a simple, effective method for detecting multiple unknown DGAs, enhancing malware detection capabilities without relying on contextual information.

Findings

01

Effective detection of unknown DGAs with high probability

02

Maintains state-of-the-art performance for known DGAs

03

Operates without context, preserving privacy

Abstract

New malware emerges at a rapid pace and often incorporates Domain Generation Algorithms (DGAs) to avoid blocking the malware's connection to the command and control (C2) server. Current state-of-the-art classifiers are able to separate benign from malicious domains (binary classification) and attribute them with high probability to the DGAs that generated them (multiclass classification). While binary classifiers can label domains of yet unknown DGAs as malicious, multiclass classifiers can only assign domains to DGAs that are known at the time of training, limiting the ability to uncover new malware families. In this work, we perform a comprehensive study on the detection of new DGAs, which includes an evaluation of 59,690 classifiers. We examine four different approaches in 15 different configurations and propose a simple yet effective approach based on the combination of a softmax…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax