IDPS Signature Classification with a Reject Option and the Incorporation of Expert Knowledge
Hidetoshi Kawaguchi, Yuichi Nakatani, Shogo Okada

TL;DR
This paper presents a machine learning model with a reject option for classifying IDPS signatures, incorporating expert knowledge and web-derived features to improve classification accuracy and reduce setup costs.
Contribution
It introduces a novel feature design combining symbolic, keyword, and web-based message features, and integrates a reject option to minimize misclassification in IDPS signature classification.
Findings
Combined SFs and WMFs outperform SFs and KFs.
The reject option reduces critical misclassification.
Web features improve classification of non-rule-matching signatures.
Abstract
As the importance of intrusion detection and prevention systems (IDPSs) increases, great costs are incurred to manage the signatures that are generated by malicious communication pattern files. Experts in network security need to classify signatures by importance for an IDPS to work. We propose and evaluate a machine learning signature classification model with a reject option (RO) to reduce the cost of setting up an IDPS. To train the proposed model, it is essential to design features that are effective for signature classification. Experts classify signatures with predefined if-then rules. An if-then rule returns a label of low, medium, high, or unknown importance based on keyword matching of the elements in the signature. Therefore, we first design two types of features, symbolic features (SFs) and keyword features (KFs), which are used in keyword matching for the if-then rules.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Network Security and Intrusion Detection · Text and Document Classification Technologies
