Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits
Gilad Nurko, Roi Benita, Yehoshua Dissen, Tomohiro Nakatani, Marc Delcroix, Shoko Araki, Joseph Keshet

TL;DR
This paper introduces a novel coupled diffusion model framework that jointly enhances signals and classifies in noisy environments, leveraging mutual guidance between signal and logits without retraining classifiers.
Contribution
It presents a domain-agnostic, coupled diffusion approach that integrates signal enhancement and classification, improving robustness without classifier retraining.
Findings
Outperforms traditional sequential enhancement methods in noisy conditions.
Improves classification accuracy in image and speech tasks under noise.
Enables mutual guidance between signal and logits without retraining classifiers.
Abstract
Robust classification in noisy environments remains a fundamental challenge in machine learning. Standard approaches typically treat signal enhancement and classification as separate, sequential stages: first enhancing the signal and then applying a classifier. This approach fails to leverage the semantic information in the classifier's output during denoising. In this work, we propose a general, domain-agnostic framework that integrates two interacting diffusion models: one operating on the input signal and the other on the classifier's output logits, without requiring any retraining or fine-tuning of the classifier. This coupled formulation enables mutual guidance, where the enhancing signal refines the class estimation and, conversely, the evolving class logits guide the signal reconstruction towards discriminative regions of the manifold. We introduce three strategies to effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
