TL;DR
This paper introduces a memory-efficient approach to sequence classification that handles extremely long inputs, enabling improved malware detection with larger datasets and more complex models.
Contribution
The authors develop a novel memory-invariant max pooling method and enhance MalConv with a global channel gating attention mechanism for processing 100 million time steps.
Findings
Memory usage is reduced by 116 times, enabling processing of longer sequences.
Training time is reduced by up to 25.8 times on original datasets.
The new architecture improves feature interaction learning across extremely long sequences.
Abstract
Recent works within machine learning have been tackling inputs of ever-increasing size, with cybersecurity presenting sequence classification problems of particularly extreme lengths. In the case of Windows executable malware detection, inputs may exceed MB, which corresponds to a time series with steps. To date, the closest approach to handling such a task is MalConv, a convolutional neural network capable of processing up to steps. The memory of CNNs has prevented further application of CNNs to malware. In this work, we develop a new approach to temporal max pooling that makes the required memory invariant to the sequence length . This makes MalConv more memory efficient, and up to faster to train on its original dataset, while removing the input length restrictions to MalConv. We re-invest these gains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsMax Pooling
