Environment-biased Feature Ranking for Novelty Detection Robustness

Stefan Smeu; Elena Burceanu; Emanuela Haller; Andrei Liviu Nicolicioiu

arXiv:2309.12301·cs.LG·October 11, 2023

Environment-biased Feature Ranking for Novelty Detection Robustness

Stefan Smeu, Elena Burceanu, Emanuela Haller, Andrei Liviu Nicolicioiu

PDF

Open Access

TL;DR

This paper introduces a method for robust novelty detection that identifies and removes environment-biased features to improve detection accuracy across different environmental conditions.

Contribution

It proposes a feature ranking technique based on environment variance to enhance novelty detection robustness, addressing spurious correlations.

Findings

01

Improves detection performance by up to 6%

02

Effectively removes environment-biased features

03

Validates on real and synthetic benchmarks

Abstract

We tackle the problem of robust novelty detection, where we aim to detect novelties in terms of semantic content while being invariant to changes in other, irrelevant factors. Specifically, we operate in a setup with multiple environments, where we determine the set of features that are associated more with the environments, rather than to the content relevant for the task. Thus, we propose a method that starts with a pretrained embedding and a multi-env setup and manages to rank the features based on their environment-focus. First, we compute a per-feature score based on the feature distribution variance between envs. Next, we show that by dropping the highly scored ones, we manage to remove spurious correlations and improve the overall performance by up to 6%, both in covariance and sub-population shift cases, both for a real and a synthetic benchmark, that we introduce for this task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Data-Driven Disease Surveillance · Influenza Virus Research Studies