Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models
Weiqiu You, Youngja Park

TL;DR
This paper introduces a two-stage training approach with large language models to classify cyberattack techniques from unstructured text in threat reports, improving accuracy in low-resource scenarios.
Contribution
It proposes a novel auxiliary data utilization method and a two-stage training process for better cyberattack technique classification from natural language.
Findings
Macro-F1 improved by 5-9 percentage points
Maintains competitive Micro-F1 scores
Validated on TRAM dataset with positive results
Abstract
Understanding the attack patterns associated with a cyberattack is crucial for comprehending the attacker's behaviors and implementing the right mitigation measures. However, majority of the information regarding new attacks is typically presented in unstructured text, posing significant challenges for security analysts in collecting necessary information. In this paper, we present a sentence classification system that can identify the attack techniques described in natural language sentences from cyber threat intelligence (CTI) reports. We propose a new method for utilizing auxiliary data with the same labels to improve classification for the low-resource cyberattack classification task. The system first trains the model using the augmented training data and then trains more using only the primary data. We validate our model using the TRAM data1 and the MITRE ATT&CK framework.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection
