Detecting Unknown DGAs without Context Information
Arthur Drichel, Justus von Brandt, Ulrike Meyer

TL;DR
This paper introduces a novel approach combining softmax classifiers and regexes to detect unknown DGAs, effectively identifying new malware families without prior context, while maintaining high accuracy for known DGAs.
Contribution
The study evaluates 59,690 classifiers and proposes a simple, effective method for detecting multiple unknown DGAs, enhancing malware detection capabilities without relying on contextual information.
Findings
Effective detection of unknown DGAs with high probability
Maintains state-of-the-art performance for known DGAs
Operates without context, preserving privacy
Abstract
New malware emerges at a rapid pace and often incorporates Domain Generation Algorithms (DGAs) to avoid blocking the malware's connection to the command and control (C2) server. Current state-of-the-art classifiers are able to separate benign from malicious domains (binary classification) and attribute them with high probability to the DGAs that generated them (multiclass classification). While binary classifiers can label domains of yet unknown DGAs as malicious, multiclass classifiers can only assign domains to DGAs that are known at the time of training, limiting the ability to uncover new malware families. In this work, we perform a comprehensive study on the detection of new DGAs, which includes an evaluation of 59,690 classifiers. We examine four different approaches in 15 different configurations and propose a simple yet effective approach based on the combination of a softmax…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax
