AST-Based Deep Learning for Detecting Malicious PowerShell

Gili Rusak; Abdullah Al-Dujaili; Una-May O'Reilly

arXiv:1810.09230·cs.SE·October 23, 2018

AST-Based Deep Learning for Detecting Malicious PowerShell

Gili Rusak, Abdullah Al-Dujaili, Una-May O'Reilly

PDF

TL;DR

This paper proposes a hybrid method combining static program analysis with deep learning to improve detection of malicious PowerShell scripts, focusing on AST-based embeddings for classification.

Contribution

It introduces a novel approach that integrates abstract syntax trees with deep learning for PowerShell malware detection, emphasizing node embedding techniques.

Findings

01

Preliminary results show effective node embeddings for PowerShell ASTs.

02

The approach improves classification of malicious script families.

03

Embedding-based features outperform traditional methods.

Abstract

With the celebrated success of deep learning, some attempts to develop effective methods for detecting malicious PowerShell programs employ neural nets in a traditional natural language processing setup while others employ convolutional neural nets to detect obfuscated malicious commands at a character level. While these representations may express salient PowerShell properties, our hypothesis is that tools from static program analysis will be more effective. We propose a hybrid approach combining traditional program analysis (in the form of abstract syntax trees) and deep learning. This poster presents preliminary results of a fundamental step in our approach: learning embeddings for nodes of PowerShell ASTs. We classify malicious scripts by family type and explore embedded program vector representations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.