Reproducing Random Forest Efficacy in Detecting Port Scanning
Jason M. Pittman

TL;DR
This paper reproduces six recent studies on using random forest algorithms for port scan detection, addressing inconsistencies and lack of source code to improve reliability and reproducibility in cybersecurity research.
Contribution
It systematically reproduces and evaluates six recent random forest port scan detection studies, standardizing datasets and providing source code to enhance reproducibility.
Findings
Reproduced results consistent with original studies
Identified variability in study outcomes and datasets
Provided open-source implementation for future research
Abstract
Port scanning is the process of attempting to connect to various network ports on a computing endpoint to determine which ports are open and which services are running on them. It is a common method used by hackers to identify vulnerabilities in a network or system. By determining which ports are open, an attacker can identify which services and applications are running on a device and potentially exploit any known vulnerabilities in those services. Consequently, it is important to detect port scanning because it is often the first step in a cyber attack. By identifying port scanning attempts, cybersecurity professionals can take proactive measures to protect the systems and networks before an attacker has a chance to exploit any vulnerabilities. Against this background, researchers have worked for over a decade to develop robust methods to detect port scanning. One such method revealed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Digital and Cyber Forensics
