A Mallows-like Criterion for Anomaly Detection with Random Forest   Implementation

Gaoxiang Zhao; Lu Wang; Xiaoqiang Wang

arXiv:2405.18932·stat.ML·May 30, 2024

A Mallows-like Criterion for Anomaly Detection with Random Forest Implementation

Gaoxiang Zhao, Lu Wang, Xiaoqiang Wang

PDF

Open Access

TL;DR

This paper introduces a novel model averaging criterion integrated into Random Forests, enhancing anomaly detection accuracy and robustness on diverse benchmark datasets, especially for imbalanced data scenarios.

Contribution

It proposes a new weighting criterion for model aggregation using focal loss, specifically tailored for anomaly detection within Random Forests, improving performance over traditional methods.

Findings

01

Outperforms traditional model averaging techniques.

02

Achieves higher accuracy in anomaly detection tasks.

03

Demonstrates robustness across various benchmark datasets.

Abstract

The effectiveness of anomaly signal detection can be significantly undermined by the inherent uncertainty of relying on one specified model. Under the framework of model average methods, this paper proposes a novel criterion to select the weights on aggregation of multiple models, wherein the focal loss function accounts for the classification of extremely imbalanced data. This strategy is further integrated into Random Forest algorithm by replacing the conventional voting method. We have evaluated the proposed method on benchmark datasets across various domains, including network intrusion. The findings indicate that our proposed method not only surpasses the model averaging with typical loss functions but also outstrips common anomaly detection algorithms in terms of accuracy and robustness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Artificial Immune Systems Applications · Network Security and Intrusion Detection

MethodsFocal Loss