RAUM-Net: Regional Attention and Uncertainty-aware Mamba Network

Mingquan Liu

arXiv:2506.21905·cs.CV·June 30, 2025

RAUM-Net: Regional Attention and Uncertainty-aware Mamba Network

Mingquan Liu

PDF

Open Access

TL;DR

RAUM-Net is a semi-supervised approach for fine-grained visual categorization that combines regional attention, Bayesian uncertainty, and Mamba-based feature modeling to improve robustness with limited labeled data.

Contribution

It introduces a novel semi-supervised method integrating regional attention, Bayesian uncertainty, and Mamba networks for enhanced FGVC performance.

Findings

01

Strong performance on FGVC benchmarks with occlusions.

02

Robustness when labeled data is limited.

03

Effective pseudo label selection via Bayesian inference.

Abstract

Fine Grained Visual Categorization (FGVC) remains a challenging task in computer vision due to subtle inter class differences and fragile feature representations. Existing methods struggle in fine grained scenarios, especially when labeled data is scarce. We propose a semi supervised method combining Mamba based feature modeling, region attention, and Bayesian uncertainty. Our approach enhances local to global feature modeling while focusing on key areas during learning. Bayesian inference selects high quality pseudo labels for stability. Experiments show strong performance on FGVC benchmarks with occlusions, demonstrating robustness when labeled data is limited. Code is available at https://github.com/wxqnl/RAUM Net.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques