Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning

Xingyu Zhu; Liang Yi; Shuo Wang; Wenbo Zhu; Yonglinag Wu; Beier Zhu; Hanwang Zhang

arXiv:2603.22070·cs.CV·March 26, 2026

Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning

Xingyu Zhu, Liang Yi, Shuo Wang, Wenbo Zhu, Yonglinag Wu, Beier Zhu, Hanwang Zhang

PDF

Open Access

TL;DR

This paper introduces BayesMM, a Bayesian framework for test-time point cloud analysis that models multimodal data as Gaussian distributions, improving robustness under domain shifts without additional training.

Contribution

BayesMM is the first to model multimodal point cloud data with Bayesian distribution learning for online adaptation, addressing cache-based limitations and fusion instability.

Findings

01

Achieves over 4% average improvement on point cloud benchmarks.

02

Maintains robustness under distributional shifts.

03

Automatically adjusts modality contributions via Bayesian model averaging.

Abstract

Multimodal 3D vision-language models show strong generalization across diverse 3D tasks, but their performance still degrades notably under domain shifts. This has motivated recent studies on test-time adaptation (TTA), which enables models to adapt online using test-time data. Among existing TTA methods, cache-based mechanisms are widely adopted for leveraging previously observed samples in online prediction refinement. However, they store only limited historical information, leading to progressive information loss as the test stream evolves. In addition, their prediction logits are fused heuristically, making adaptation unstable. To address these limitations, we propose BayesMM, a Multimodal Bayesian Distribution Learning framework for test-time point cloud analysis. BayesMM models textual priors and streaming visual features of each class as Gaussian distributions: textual parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis