Detecting DGA domains with recurrent neural networks and side   information

Ryan R. Curtin; Andrew B. Gardner; Slawomir Grzonkowski; Alexey; Kleymenov; Alejandro Mosquera

arXiv:1810.02023·cs.CR·June 24, 2019

Detecting DGA domains with recurrent neural networks and side information

Ryan R. Curtin, Andrew B. Gardner, Slawomir Grzonkowski, Alexey, Kleymenov, Alejandro Mosquera

PDF

1 Repo

TL;DR

This paper introduces a novel recurrent neural network model combined with side information to detect DGA-generated malicious domains, especially those resembling English words, outperforming existing methods.

Contribution

The work presents a new RNN architecture and a difficulty measure called smashword score for improved detection of challenging DGA families.

Findings

01

Model effectively identifies domains from difficult DGA families.

02

Outperforms existing detection approaches.

03

Best performance on DGA families resembling English words.

Abstract

Modern malware typically makes use of a domain generation algorithm (DGA) to avoid command and control domains or IPs being seized or sinkholed. This means that an infected system may attempt to access many domains in an attempt to contact the command and control server. Therefore, the automatic detection of DGA domains is an important task, both for the sake of blocking malicious domains and identifying compromised hosts. However, many DGAs use English wordlists to generate plausibly clean-looking domain names; this makes automatic detection difficult. In this work, we devise a notion of difficulty for DGA families called the smashword score; this measures how much a DGA family looks like English words. We find that this measure accurately reflects how much a DGA family's domains look like they are made from natural English words. We then describe our new modeling approach, which is a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alistairwgillespie/deep_dga_detection
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.