DCHM: Depth-Consistent Human Modeling for Multiview Detection

Jiahao Ma; Tianyu Wang; Miaomiao Liu; David Ahmedt-Aristizabal; Chuong Nguyen

arXiv:2507.14505·cs.CV·July 22, 2025

DCHM: Depth-Consistent Human Modeling for Multiview Detection

Jiahao Ma, Tianyu Wang, Miaomiao Liu, David Ahmedt-Aristizabal, Chuong Nguyen

PDF

TL;DR

DCHM introduces a novel depth-consistent human modeling framework that enhances multiview pedestrian detection accuracy by producing precise, noise-reduced 3D human representations without relying on human-labeled annotations.

Contribution

The paper presents DCHM, a new method for multiview human modeling that achieves depth consistency and accurate pedestrian localization in complex scenarios without human annotations.

Findings

01

Reduces noise in human modeling compared to previous methods.

02

Outperforms state-of-the-art baselines in multiview pedestrian detection.

03

First to reconstruct pedestrians and perform multiview segmentation in challenging environments.

Abstract

Multiview pedestrian detection typically involves two stages: human modeling and pedestrian localization. Human modeling represents pedestrians in 3D space by fusing multiview information, making its quality crucial for detection accuracy. However, existing methods often introduce noise and have low precision. While some approaches reduce noise by fitting on costly multiview 3D annotations, they often struggle to generalize across diverse scenes. To eliminate reliance on human-labeled annotations and accurately model humans, we propose Depth-Consistent Human Modeling (DCHM), a framework designed for consistent depth estimation and multiview fusion in global coordinates. Specifically, our proposed pipeline with superpixel-wise Gaussian Splatting achieves multiview depth consistency in sparse-view, large-scaled, and crowded scenarios, producing precise point clouds for pedestrian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.