Edge Weight Prediction For Category-Agnostic Pose Estimation

Or Hirschorn; Shai Avidan

arXiv:2411.16665·cs.CV·February 3, 2026

Edge Weight Prediction For Category-Agnostic Pose Estimation

Or Hirschorn, Shai Avidan

PDF

Open Access 1 Repo

TL;DR

EdgeCape introduces a novel approach for category-agnostic pose estimation by predicting edge weights in pose graphs and incorporating structural priors, significantly improving keypoint localization across diverse categories.

Contribution

The paper proposes EdgeCape, a framework that predicts edge weights in pose graphs and integrates Markovian Structural Bias to enhance global spatial dependency modeling.

Findings

01

Achieves state-of-the-art results on MP-100 benchmark in 1-shot setting.

02

Outperforms similar methods in 5-shot pose estimation.

03

Significantly improves keypoint localization accuracy.

Abstract

Category-Agnostic Pose Estimation (CAPE) localizes keypoints across diverse object categories with a single model, using one or a few annotated support images. Recent works have shown that using a pose graph (i.e., treating keypoints as nodes in a graph rather than isolated points) helps handle occlusions and break symmetry. However, these methods assume a static pose graph with equal-weight edges, leading to suboptimal results. We introduce EdgeCape, a novel framework that overcomes these limitations by predicting the graph's edge weights which optimizes localization. To further leverage structural priors, we propose integrating Markovian Structural Bias, which modulates the self-attention interaction between nodes based on the number of hops between them. We show that this improves the model's ability to capture global spatial dependencies. Evaluated on the MP-100 benchmark, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

orhir/EdgeCape
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning