LGM-Pose: A Lightweight Global Modeling Network for Real-time Human Pose Estimation

Biao Guo; Cong Zhou; Fangmin Guo; Xiaonan Luo; Guibo Luo; Feng Zhang

arXiv:2506.04561·cs.CV·November 14, 2025

LGM-Pose: A Lightweight Global Modeling Network for Real-time Human Pose Estimation

Biao Guo, Cong Zhou, Fangmin Guo, Xiaonan Luo, Guibo Luo, Feng Zhang

PDF

Open Access

TL;DR

LGM-Pose introduces a lightweight, single-branch global modeling network for real-time human pose estimation, effectively capturing global context while reducing parameters and increasing speed.

Contribution

The paper proposes a novel lightweight global modeling network with innovative modules that improve global feature extraction and multi-scale fusion in pose estimation.

Findings

01

Reduces model parameters compared to existing methods.

02

Achieves superior accuracy on COCO and MPII datasets.

03

Provides faster inference speeds.

Abstract

Most of the current top-down multi-person pose estimation lightweight methods are based on multi-branch parallel pure CNN network architecture, which often struggle to capture the global context required for detecting semantically complex keypoints and are hindered by high latency due to their intricate and redundant structures. In this article, an approximate single-branch lightweight global modeling network (LGM-Pose) is proposed to address these challenges. In the network, a lightweight MobileViM Block is designed with a proposed Lightweight Attentional Representation Module (LARM), which integrates information within and between patches using the Non-Parametric Transformation Operation(NPT-Op) to extract global information. Additionally, a novel Shuffle-Integrated Fusion Module (SFusion) is introduced to effectively integrate multi-scale information, mitigating performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Human Motion and Animation