Multi-QuAD: Multi-Level Quality-Adaptive Dynamic Network for Reliable Multimodal Classification
Shu Shen, C.L.Philip Chen, and Tong Zhang

TL;DR
Multi-QuAD introduces a dynamic, quality-aware multimodal classification framework that adaptively adjusts network depth and parameters based on sample quality, significantly improving reliability and performance.
Contribution
It proposes a novel quality estimation method and dynamic network mechanisms, including GCND and LGP, for reliable multimodal classification with variable data quality.
Findings
Outperforms state-of-the-art methods on four datasets.
Demonstrates strong adaptability to diverse data quality.
Achieves higher classification accuracy and reliability.
Abstract
Multimodal machine learning has achieved remarkable progress in many scenarios, but its reliability is undermined by varying sample quality. This paper finds that existing reliable multimodal classification methods not only fail to provide robust estimation of data quality, but also lack dynamic networks for sample-specific depth and parameters to achieve reliable inference. To this end, a novel framework for multimodal reliable classification termed \textit{Multi-level Quality-Adaptive Dynamic multimodal network} (Multi-QuAD) is proposed. Multi-QuAD first adopts a novel approach based on noise-free prototypes and a classifier-free design to reliably estimate the quality of each sample at both modality and feature levels. It then achieves sample-specific network depth via the \textbf{\textit{Global Confidence Normalized Depth (GCND)}} mechanism. By normalizing depth across modalities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
