On Improving Deep Learning Trace Analysis with System Call Arguments
Quentin Fournier, Daniel Aloise, Seyed Vahid Azhari, and Fran\c{c}ois, Tetreault

TL;DR
This paper enhances deep learning analysis of kernel traces by incorporating event arguments, improving anomaly detection and language modeling performance across multiple neural network architectures.
Contribution
It introduces a general, task-agnostic approach to embedding event names and arguments in kernel trace analysis, validated on real-world datasets.
Findings
Up to 11.3% performance improvement on language modeling tasks
Effective integration of event arguments boosts neural network accuracy
Validated on datasets from web requests and pre-production servers
Abstract
Kernel traces are sequences of low-level events comprising a name and multiple arguments, including a timestamp, a process id, and a return value, depending on the event. Their analysis helps uncover intrusions, identify bugs, and find latency causes. However, their effectiveness is hindered by omitting the event arguments. To remedy this limitation, we introduce a general approach to learning a representation of the event names along with their arguments using both embedding and encoding. The proposed method is readily applicable to most neural networks and is task-agnostic. The benefit is quantified by conducting an ablation study on three groups of arguments: call-related, process-related, and time-related. Experiments were conducted on a novel web request dataset and validated on a second dataset collected on pre-production servers by Ciena, our partnering company. By leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Dropout · Attention Is All You Need · Label Smoothing · Adam · Residual Connection · Tanh Activation
