Smart Feature is What You Need
Zhaoxin Hu, Keyan Ren

TL;DR
This paper introduces MMA, a novel point cloud feature network that enhances weakly-supervised 3D object detection by leveraging shape and object inference, significantly improving performance and robustness.
Contribution
The paper proposes MMA, a new feature representation network that combines adjacency and disparity attention to improve weakly-supervised 3D detection accuracy.
Findings
MMA improves weakly-supervised detection to approach fully-supervised performance.
MMA effectively mitigates label jitter, turning it into a data enhancement source.
Extensive experiments validate MMA's effectiveness in indoor scenarios.
Abstract
Lack of shape guidance and label jitter caused by information deficiency of weak label are the main problems in 3D weakly-supervised object detection. Current weakly-supervised models often use heuristics or assumptions methods to infer information from weak labels without taking advantage of the inherent clues of weakly-supervised and fully-supervised methods, thus it is difficult to explore a method that combines data utilization efficiency and model accuracy. In an attempt to address these issues, we propose a novel plug-and-in point cloud feature representation network called Multi-scale Mixed Attention (MMA). MMA utilizes adjacency attention within neighborhoods and disparity attention at different density scales to build a feature representation network. The smart feature representation obtained from MMA has shape tendency and object existence area inference, which can constrain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies
MethodsSoftmax · Attention Is All You Need
