DenoiseRep: Denoising Model for Representation Learning

Zhengrui Xu; Guan'an Wang; Xiaowen Huang; Jitao Sang

arXiv:2406.08773·cs.CV·November 6, 2024

DenoiseRep: Denoising Model for Representation Learning

Zhengrui Xu, Guan'an Wang, Xiaowen Huang, Jitao Sang

PDF

Open Access 1 Repo 1 Video

TL;DR

DenoiseRep introduces a novel denoising model that enhances feature discrimination in representation learning by jointly extracting and denoising features, improving performance across various vision tasks and architectures.

Contribution

The paper proposes DenoiseRep, a unified framework that combines feature extraction and denoising, with a theoretical proof of its equivalence and computation-free denoising, applicable to multiple architectures.

Findings

01

Impressive improvements on vision tasks like re-identification, classification, detection, and segmentation.

02

Effective on CNN and Transformer architectures, including ResNet, ViT, Swin, and Vmamda.

03

Demonstrates stability and robustness across diverse datasets and tasks.

Abstract

The denoising model has been proven a powerful generative model but has little exploration of discriminative tasks. Representation learning is important in discriminative tasks, which is defined as "learning representations (or features) of the data that make it easier to extract useful information when building classifiers or other predictors". In this paper, we propose a novel Denoising Model for Representation Learning (DenoiseRep) to improve feature discrimination with joint feature extraction and denoising. DenoiseRep views each embedding layer in a backbone as a denoising layer, processing the cascaded embedding layers as if we are recursively denoise features step-by-step. This unifies the frameworks of feature extraction and denoising, where the former progressively embeds features from low-level to high-level, and the latter recursively denoises features step-by-step. After…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wangguanan/denoiserep
pytorchOfficial

Videos

DenoiseRep: Denoising Model for Representation Learning· slideslive

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Gait Recognition and Analysis

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Layer Normalization · Residual Connection · Byte Pair Encoding · Absolute Position Encodings · Multi-Head Attention · Softmax