LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition
Houssem Gasmi (DISP), Jannik Laval (DISP), Abdelaziz Bouras (DISP)

TL;DR
This paper presents an LSTM-CRF based approach for cybersecurity Named Entity Recognition that outperforms existing methods without requiring domain-specific feature engineering.
Contribution
It introduces a domain-independent LSTM-CRF model for cybersecurity NER that eliminates the need for manual feature engineering and domain expertise.
Findings
Outperforms state-of-the-art methods on cybersecurity NER tasks.
Does not require domain-specific feature engineering.
Effective on a reasonably sized annotated corpus.
Abstract
The automated and timely conversion of cybersecurity information from unstructured online sources, such as blogs and articles to more formal representations has become a necessity for many applications in the domain nowadays. Named Entity Recognition (NER) is one of the early phases towards this goal. It involves the detection of the relevant domain entities, such as product, version, attack name, etc. in technical documents. Although generally considered a simple task in the information extraction field, it is quite challenging in some domains like cybersecurity because of the complex structure of its entities. The state of the art methods require time-consuming and labor intensive feature engineering that describes the properties of the entities, their context, domain knowledge, and linguistic characteristics. The model demonstrated in this paper is domain independent and does not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Network Security and Intrusion Detection · Natural Language Processing Techniques
