TL;DR
HeroCrystal is a privacy-preserving multi-camera object detection framework that synthesizes target domain data, employs federated learning, and fuses heterogeneous models to improve detection accuracy while respecting privacy constraints.
Contribution
It introduces a diffusion-based synthetic data generation, federated model fusion for heterogeneous architectures, and an algorithm for integrating inconsistent categories across clients.
Findings
Outperforms existing privacy-preserving detection methods by +2.1% mAP.
Achieves a new state-of-the-art mAP of 33.4% on cross-domain benchmarks.
Effectively handles data privacy, class imbalance, and architecture heterogeneity.
Abstract
We propose HeroCrystal, a novel privacy-preserving framework for multi-camera domain-adaptive object detection, addressing challenges such as data privacy, class imbalance, and heterogeneous architectures. Our framework consists of three key stages. In the Generated Stage, we introduce a one-shot, target-aware diffusion-based generation module that learns visual style from a single target-domain image while leveraging prompt-based control to synthesize specific object instances. Unlike conventional style transfer-based methods that require large target datasets and ignore semantic-level discrepancies, our approach enables privacy-preserving augmentation to reduce ethical concerns, and introduces controllable rare object generation to mitigate long-tailed category degradation. In the Federated Stage, we employ probabilistic Faster R-CNN on the client side to improve localization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
