Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection

Jian Shi; Pengyi Zhang; Ni Zhang; Hakim Ghazzai; Peter Wonka

arXiv:2302.14696·cs.CV·July 9, 2024·1 cites

Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection

Jian Shi, Pengyi Zhang, Ni Zhang, Hakim Ghazzai, Peter Wonka

PDF

Open Access 1 Repo 4 Reviews

TL;DR

This paper introduces DIA, a novel framework that uses dissolving transformations and contrastive learning to enhance fine-grained anomaly detection in medical images, achieving state-of-the-art results.

Contribution

The paper proposes a new dissolving is amplifying framework that improves fine-grained anomaly detection by removing and emphasizing subtle features in medical images.

Findings

01

Achieves around 18.40% AUC boost over baseline methods.

02

Outperforms existing benchmark methods in fine-grained anomaly detection.

03

Provides a self-supervised approach for semantic representation learning.

Abstract

Medical imaging often contains critical fine-grained features, such as tumors or hemorrhages, crucial for diagnosis yet potentially too subtle for detection with conventional methods. In this paper, we introduce \textit{DIA}, dissolving is amplifying. DIA is a fine-grained anomaly detection framework for medical images. First, we introduce \textit{dissolving transformations}. We employ diffusion with a generative diffusion model as a dedicated feature-aware denoiser. Applying diffusion to medical images in a certain manner can remove or diminish fine-grained discriminative features. Second, we introduce an \textit{amplifying framework} based on contrastive learning to learn a semantically meaningful representation of medical images in a self-supervised manner, with a focus on fine-grained features. The amplifying framework contrasts additional pairs of images with and without dissolving…

Peer Reviews

Decision·ICLR 2024 Conference Withdrawn Submission

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The problem setting of learning fine-grained subtleness for anomaly detection is of significant clinical potential for medical imaging applications. This paper is well-written with overall sufficient clarity. The insight that diffusion models tend to remove fine-grained discriminative features is interesting. The idea of taking the input images for further reverse diffusions is a bit surpring at first sight but it is reasonable if we assume diffusion models pull reversed samples towards the

Weaknesses

A very critical sanity check is needed to validate the dissolving transforms: Would simple gaussian smoothings and/or additive noises on the image space or shallow-layer feature space lead to similar “dissolving” effect as with diffusion models? The general idea is still to remove fine-grained subtleness by smoothing or altering the content. But if simple gaussian smoothing/additive noises can reach similar effects, there would be no need to bother training a diffusion model. This is the major r

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The proposed idea of employing the reverse process of a generative diffusion model to remove fine-grained features from the original images and generate negative pairs for contrastive learning is novel and sound.

Weaknesses

- The theoretical basis for the dissolving transformation through the reverse diffusion process to remove fine-grained features is not well explained, while visual validation is partly provided in Figure 1. - The authors specifically target anomaly detection for medical images throughout the manuscript, but it is not clear whether and why the proposed method is better suited for medical images, but not for other domains. - The previous methods compared in the experimental section primarily fo

Reviewer 03Rating 3· reject, not good enoughConfidence 5

Strengths

(1) The combination of diffusion models and contrastive learning is quite interesting. Instead of using the diffusion model for image reconstruction, adopting a diffusion model for noise injection is interesting. (2) The paper is easy to follow. (3) Experiments are conducted on six medical image datasets from different imaging modalities and organs.

Weaknesses

(1) The paper employs a diffusion model to introduce noise into the data. What advantages does this approach offer compared to the straightforward addition of random noise to images? Is there any experimentation or ablation study that demonstrates the benefits of using the diffusion model for noise injection? (2) There is a need for a more in-depth discussion of the proposed method. What are the underlying reasons for the effectiveness of the model? A deeper exploration of the working principle

Reviewer 04Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

1. This paper focuses on a specific application with reasonable motivations and insights. 2. Experiments show the effectiveness of the proposed method.

Weaknesses

1. This paper is somewhat incremental and lacks novelty. This paper proposes to use diffusion model as a data augmentation tool to provide fine-grained data. The following method is highly similar with CSI, an existing method. To offer a clearer perspective, the authors should provide a detailed comparison with CSI, highlighting the key distinctions between these two approaches. Does the contribution come from the proposed augmentation? 2. The concept presented in this paper has been explored ex

Code & Models

Repositories

shijianjian/dia
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications

MethodsContrastive Learning · Diffusion