SEAL-pose: Enhancing 3D Human Pose Estimation via a Learned Loss for Structural Consistency
Yeonsung Kim, Junggeun Do, Seunguk Do, Sangmin Kim, Jaesik Park, Jay-Yoon Lee

TL;DR
SEAL-pose introduces a learnable, data-driven loss function that enhances 3D human pose estimation by capturing complex structural dependencies, outperforming traditional methods with manual priors.
Contribution
It proposes a novel learnable loss-net that learns structural dependencies directly from data, improving 3D human pose estimation without manual priors.
Findings
Reduces per-joint errors across benchmarks
Outperforms models with explicit structural constraints
Effective in cross-dataset and in-the-wild scenarios
Abstract
3D human pose estimation (HPE) is characterized by intricate local and global dependencies among joints. Conventional supervised losses are limited in capturing these correlations because they treat each joint independently. Previous studies have attempted to promote structural consistency through manually designed priors or rule-based constraints; however, these approaches typically require manual specification and are often non-differentiable, limiting their use as end-to-end training objectives. We propose SEAL-pose, a data-driven framework in which a learnable loss-net trains a pose-net by evaluating structural plausibility. Rather than relying on hand-crafted priors, our joint-graph-based design enables the loss-net to learn complex structural dependencies directly from data. Extensive experiments on three 3D HPE benchmarks with eight backbones show that SEAL-pose reduces per-joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Robot Manipulation and Learning
