Part Segmentation of Human Meshes via Multi-View Human Parsing

James Dickens; Kamyar Hamad

arXiv:2507.18655·cs.CV·September 3, 2025

Part Segmentation of Human Meshes via Multi-View Human Parsing

James Dickens, Kamyar Hamad

PDF

TL;DR

This paper introduces a geometric method for semantic segmentation of human meshes using multi-view parsing and point cloud deep learning, achieving accurate per-vertex labeling without texture data.

Contribution

It develops a novel pseudo-ground truth labeling pipeline and a memory-efficient sampling strategy for effective human mesh segmentation.

Findings

01

High accuracy in per-vertex human mesh segmentation.

02

Effective pseudo-labeling pipeline for large datasets.

03

Memory-efficient sampling improves computational performance.

Abstract

Recent advances in point cloud deep learning have led to models that achieve high per-part labeling accuracy on large-scale point clouds, using only the raw geometry of unordered point sets. In parallel, the field of human parsing focuses on predicting body part and clothing/accessory labels from images. This work aims to bridge these two domains by enabling per-vertex semantic segmentation of large-scale human meshes. To achieve this, a pseudo-ground truth labeling pipeline is developed for the Thuman2.1 dataset: meshes are first aligned to a canonical pose, segmented from multiple viewpoints, and the resulting point-level labels are then backprojected onto the original mesh to produce per-point pseudo ground truth annotations. Subsequently, a novel, memory-efficient sampling strategy is introduced, a windowed iterative farthest point sampling (FPS) with space-filling curve-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.