Classifying Sequences of Extreme Length with Constant Memory Applied to   Malware Detection

Edward Raff; William Fleshman; Richard Zak; Hyrum S. Anderson; Bobby; Filar; Mark McLean

arXiv:2012.09390·stat.ML·December 18, 2020

Classifying Sequences of Extreme Length with Constant Memory Applied to Malware Detection

Edward Raff, William Fleshman, Richard Zak, Hyrum S. Anderson, Bobby, Filar, Mark McLean

PDF

1 Repo 1 Video

TL;DR

This paper introduces a memory-efficient approach to sequence classification that handles extremely long inputs, enabling improved malware detection with larger datasets and more complex models.

Contribution

The authors develop a novel memory-invariant max pooling method and enhance MalConv with a global channel gating attention mechanism for processing 100 million time steps.

Findings

01

Memory usage is reduced by 116 times, enabling processing of longer sequences.

02

Training time is reduced by up to 25.8 times on original datasets.

03

The new architecture improves feature interaction learning across extremely long sequences.

Abstract

Recent works within machine learning have been tackling inputs of ever-increasing size, with cybersecurity presenting sequence classification problems of particularly extreme lengths. In the case of Windows executable malware detection, inputs may exceed $100$ MB, which corresponds to a time series with $T = 100, 000, 000$ steps. To date, the closest approach to handling such a task is MalConv, a convolutional neural network capable of processing up to $T = 2, 000, 000$ steps. The $O (T)$ memory of CNNs has prevented further application of CNNs to malware. In this work, we develop a new approach to temporal max pooling that makes the required memory invariant to the sequence length $T$ . This makes MalConv $116 \times$ more memory efficient, and up to $25.8 \times$ faster to train on its original dataset, while removing the input length restrictions to MalConv. We re-invest these gains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NeuromorphicComputationResearchProgram/MalConv2
pytorchOfficial

Videos

Classifying Sequences of Extreme Length with Constant Memory Applied to Malware Detection· underline

Taxonomy

MethodsMax Pooling