Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
Minkyoung Cho, Yulong Cao, Jiachen Sun, Qingzhao Zhang, Marco Pavone,, Jeong Joon Park, Heng Yang, Z. Morley Mao

TL;DR
Cocoon is a novel multi-modal perception framework that uses uncertainty-aware sensor fusion at the object and feature levels, improving 3D detection accuracy in normal and challenging conditions.
Contribution
It introduces an uncertainty quantification method for heterogeneous sensor data, with a feature aligner and surrogate ground truth to enable adaptive fusion.
Findings
Outperforms existing methods in normal and challenging conditions
Effective uncertainty metric validated across datasets
Enhances robustness in long-tail scenarios
Abstract
An important paradigm in 3D object detection is the use of multiple modalities to enhance accuracy in both normal and challenging conditions, particularly for long-tail scenarios. To address this, recent studies have explored two directions of adaptive approaches: MoE-based adaptive fusion, which struggles with uncertainties arising from distinct object configurations, and late fusion for output-level adaptive fusion, which relies on separate detection pipelines and limits comprehensive understanding. In this work, we introduce Cocoon, an object- and feature-level uncertainty-aware fusion framework. The key innovation lies in uncertainty quantification for heterogeneous representations, enabling fair comparison across modalities through the introduction of a feature aligner and a learnable surrogate ground truth, termed feature impression. We also define a training objective to ensure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Industrial Vision Systems and Defect Detection · Fault Detection and Control Systems
