Distribution and volume based scoring for Isolation Forests
Hichem Dhouib, Alissa Wilms, Paul Boes

TL;DR
This paper enhances the Isolation Forest algorithm for anomaly detection by introducing a distribution-aware scoring method and a hyper-volume based scoring at the tree level, leading to improved detection performance.
Contribution
It proposes a novel information-theoretic generalization of the score function and an alternative hyper-volume based scoring method for Isolation Forests.
Findings
Significant improvement over standard Isolation Forest on some datasets.
Average performance boost across all datasets for one proposed variant.
Code is publicly available for reproducibility.
Abstract
We make two contributions to the Isolation Forest method for anomaly and outlier detection. The first contribution is an information-theoretically motivated generalisation of the score function that is used to aggregate the scores across random tree estimators. This generalisation allows one to take into account not just the ensemble average across trees but instead the whole distribution. The second contribution is an alternative scoring function at the level of the individual tree estimator, in which we replace the depth-based scoring of the Isolation Forest with one based on hyper-volumes associated to an isolation tree's leaf nodes. We motivate the use of both of these methods on generated data and also evaluate them on 34 datasets from the recent and exhaustive ``ADBench'' benchmark, finding significant improvement over the standard isolation forest for both variants on some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Network Security and Intrusion Detection
