Edge Weight Prediction For Category-Agnostic Pose Estimation
Or Hirschorn, Shai Avidan

TL;DR
EdgeCape introduces a novel approach for category-agnostic pose estimation by predicting edge weights in pose graphs and incorporating structural priors, significantly improving keypoint localization across diverse categories.
Contribution
The paper proposes EdgeCape, a framework that predicts edge weights in pose graphs and integrates Markovian Structural Bias to enhance global spatial dependency modeling.
Findings
Achieves state-of-the-art results on MP-100 benchmark in 1-shot setting.
Outperforms similar methods in 5-shot pose estimation.
Significantly improves keypoint localization accuracy.
Abstract
Category-Agnostic Pose Estimation (CAPE) localizes keypoints across diverse object categories with a single model, using one or a few annotated support images. Recent works have shown that using a pose graph (i.e., treating keypoints as nodes in a graph rather than isolated points) helps handle occlusions and break symmetry. However, these methods assume a static pose graph with equal-weight edges, leading to suboptimal results. We introduce EdgeCape, a novel framework that overcomes these limitations by predicting the graph's edge weights which optimizes localization. To further leverage structural priors, we propose integrating Markovian Structural Bias, which modulates the self-attention interaction between nodes based on the number of hops between them. We show that this improves the model's ability to capture global spatial dependencies. Evaluated on the MP-100 benchmark, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
