FocalComm: Hard Instance-Aware Multi-Agent Perception
Dereje Shenkut, Vijayakumar Bhagavatula

TL;DR
FocalComm is a collaborative perception framework that enhances safety-critical object detection in autonomous driving by focusing on hard instances and efficiently exchanging salient features among agents.
Contribution
It introduces a novel hard instance mining module and a query-based feature fusion technique, improving detection of small, safety-critical objects over existing methods.
Findings
Outperforms state-of-the-art methods on V2X-Real and DAIR-V2X datasets.
Significantly improves pedestrian detection accuracy.
Effective in both vehicle-centric and infrastructure-centric setups.
Abstract
Multi-agent collaborative perception (CP) is a promising paradigm for improving autonomous driving safety, particularly for vulnerable road users like pedestrians, via robust 3D perception. However, existing CP approaches often optimize for vehicle detection performance metrics, underperforming on smaller, safety-critical objects such as pedestrians, where detection failures can be catastrophic. Furthermore, previous CP methods rely on full feature exchange rather than communicating only salient features that help reduce false negatives. To this end, we present FocalComm, a novel collaborative perception framework that focuses on exchanging hard-instance-oriented features among connected collaborative agents. FocalComm consists of two key novel designs: (1) a learnable progressive hard instance mining (HIM) module to extract hard instance-oriented features per agent, and (2) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning
