Exploring the Limits of Transfer Learning with Unified Model in the   Cybersecurity Domain

Kuntal Kumar Pal; Kazuaki Kashihara; Ujjwala Anantheswaran; Kirby C.; Kuznia; Siddhesh Jagtap; Chitta Baral

arXiv:2302.10346·cs.CL·February 22, 2023·5 cites

Exploring the Limits of Transfer Learning with Unified Model in the Cybersecurity Domain

Kuntal Kumar Pal, Kazuaki Kashihara, Ujjwala Anantheswaran, Kirby C., Kuznia, Siddhesh Jagtap, Chitta Baral

PDF

Open Access

TL;DR

This paper introduces UTS, a generative multi-task NLP model trained on diverse cybersecurity texts, which enhances threat detection and adapts efficiently to new tasks with limited data.

Contribution

The paper presents UTS, a unified multi-task generative model for cybersecurity NLP tasks, addressing data scarcity and enabling quick adaptation to unseen tasks.

Findings

01

UTS improves performance on multiple cybersecurity datasets.

02

UTS can be adapted to new tasks with few examples.

03

The model handles diverse text sources effectively.

Abstract

With the increase in cybersecurity vulnerabilities of software systems, the ways to exploit them are also increasing. Besides these, malware threats, irregular network interactions, and discussions about exploits in public forums are also on the rise. To identify these threats faster, to detect potentially relevant entities from any texts, and to be aware of software vulnerabilities, automated approaches are necessary. Application of natural language processing (NLP) techniques in the Cybersecurity domain can help in achieving this. However, there are challenges such as the diverse nature of texts involved in the cybersecurity domain, the unavailability of large-scale publicly available datasets, and the significant cost of hiring subject matter experts for annotations. One of the solutions is building multi-task models that can be trained jointly with limited data. In this work, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Cybercrime and Law Enforcement Studies

MethodsAttentive Walk-Aggregating Graph Neural Network