LMPNet for Weakly-supervised Keypoint Discovery

Pei Guo; Ryan Farrell

arXiv:2507.02308·cs.CV·July 4, 2025

LMPNet for Weakly-supervised Keypoint Discovery

Pei Guo, Ryan Farrell

PDF

TL;DR

LMPNet introduces a weakly-supervised approach for semantic object keypoint discovery using a novel leaky max pooling layer to encourage meaningful filter activations, achieving accuracy comparable to supervised methods.

Contribution

The paper proposes LMPNet, a novel weakly-supervised method that transforms intermediate filters into keypoint detectors without requiring detailed annotations.

Findings

01

LMPNet automatically discovers semantic keypoints.

02

It achieves prediction accuracy comparable to supervised models.

03

The method is robust to object pose variations.

Abstract

In this work, we explore the task of semantic object keypoint discovery weakly-supervised by only category labels. This is achieved by transforming discriminatively-trained intermediate layer filters into keypoint detectors. We begin by identifying three preferred characteristics of keypoint detectors: (i) spatially sparse activations, (ii) consistency and (iii) diversity. Instead of relying on hand-crafted loss terms, a novel computationally-efficient leaky max pooling (LMP) layer is proposed to explicitly encourage final conv-layer filters to learn "non-repeatable local patterns" that are well aligned with object keypoints. Informed by visualizations, a simple yet effective selection strategy is proposed to ensure consistent filter activations and attention mask-out is then applied to force the network to distribute its attention to the whole object instead of just the most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMax Pooling