Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Gilad Nurko; Roi Benita; Yehoshua Dissen; Tomohiro Nakatani; Marc Delcroix; Shoko Araki; Joseph Keshet

arXiv:2602.15405·cs.LG·May 19, 2026

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Gilad Nurko, Roi Benita, Yehoshua Dissen, Tomohiro Nakatani, Marc Delcroix, Shoko Araki, Joseph Keshet

PDF

TL;DR

This paper introduces a novel coupled diffusion model framework that jointly enhances signals and classifies in noisy environments, leveraging mutual guidance between signal and logits without retraining classifiers.

Contribution

It presents a domain-agnostic, coupled diffusion approach that integrates signal enhancement and classification, improving robustness without classifier retraining.

Findings

01

Outperforms traditional sequential enhancement methods in noisy conditions.

02

Improves classification accuracy in image and speech tasks under noise.

03

Enables mutual guidance between signal and logits without retraining classifiers.

Abstract

Robust classification in noisy environments remains a fundamental challenge in machine learning. Standard approaches typically treat signal enhancement and classification as separate, sequential stages: first enhancing the signal and then applying a classifier. This approach fails to leverage the semantic information in the classifier's output during denoising. In this work, we propose a general, domain-agnostic framework that integrates two interacting diffusion models: one operating on the input signal and the other on the classifier's output logits, without requiring any retraining or fine-tuning of the classifier. This coupled formulation enables mutual guidance, where the enhancing signal refines the class estimation and, conversely, the evolving class logits guide the signal reconstruction towards discriminative regions of the manifold. We introduce three strategies to effectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning