LSTM Recurrent Neural Networks for Cybersecurity Named Entity   Recognition

Houssem Gasmi (DISP); Jannik Laval (DISP); Abdelaziz Bouras (DISP)

arXiv:2409.10521·cs.IR·September 18, 2024·48 cites

LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition

Houssem Gasmi (DISP), Jannik Laval (DISP), Abdelaziz Bouras (DISP)

PDF

Open Access

TL;DR

This paper presents an LSTM-CRF based approach for cybersecurity Named Entity Recognition that outperforms existing methods without requiring domain-specific feature engineering.

Contribution

It introduces a domain-independent LSTM-CRF model for cybersecurity NER that eliminates the need for manual feature engineering and domain expertise.

Findings

01

Outperforms state-of-the-art methods on cybersecurity NER tasks.

02

Does not require domain-specific feature engineering.

03

Effective on a reasonably sized annotated corpus.

Abstract

The automated and timely conversion of cybersecurity information from unstructured online sources, such as blogs and articles to more formal representations has become a necessity for many applications in the domain nowadays. Named Entity Recognition (NER) is one of the early phases towards this goal. It involves the detection of the relevant domain entities, such as product, version, attack name, etc. in technical documents. Although generally considered a simple task in the information extraction field, it is quite challenging in some domains like cybersecurity because of the complex structure of its entities. The state of the art methods require time-consuming and labor intensive feature engineering that describes the properties of the entities, their context, domain knowledge, and linguistic characteristics. The model demonstrated in this paper is domain independent and does not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Network Security and Intrusion Detection · Natural Language Processing Techniques