MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment
Zicheng Zhang, Wei Sun, Xiongkuo Min, Quan Zhou, Jun He, Qiyuan Wang,, and Guangtao Zhai

TL;DR
This paper introduces a multi-modal no-reference point cloud quality assessment method that combines local geometry analysis and texture features from 2D projections, outperforming existing techniques.
Contribution
It proposes a novel multi-modal NR-PCQA approach using sub-models and image projections with cross-modal attention for improved quality assessment.
Findings
Outperforms state-of-the-art NR-PCQA methods
Effectively combines geometry and texture information
Demonstrates superior accuracy on benchmark datasets
Abstract
The visual quality of point clouds has been greatly emphasized since the ever-increasing 3D vision applications are expected to provide cost-effective and high-quality experiences for users. Looking back on the development of point cloud quality assessment (PCQA) methods, the visual quality is usually evaluated by utilizing single-modal information, i.e., either extracted from the 2D projections or 3D point cloud. The 2D projections contain rich texture and semantic information but are highly dependent on viewpoints, while the 3D point clouds are more sensitive to geometry distortions and invariant to viewpoints. Therefore, to leverage the advantages of both point cloud and projected image modalities, we propose a novel no-reference point cloud quality assessment (NR-PCQA) metric in a multi-modal fashion. In specific, we split the point clouds into sub-models to represent local geometry…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Infrared Thermography in Medicine
