# Reasoning action-centric temporal relations at rich feature hierarchies for action recognition

**Authors:** Manshu Liang, Wenbin Wu, Zhuolei Chen, Tengfei Han, Yuan Zheng, Hikmat Ullah Khan, Hikmat Ullah Khan, Hikmat Ullah Khan

PMC · DOI: 10.1371/journal.pone.0327302 · PLOS One · 2025-07-24

## TL;DR

This paper introduces a new method for action recognition by focusing on the temporal relations between action-related regions in videos.

## Contribution

The novel Action-centric Temporal-relational Reasoning (ATR) block enables hierarchical and efficient temporal reasoning for action recognition.

## Key findings

- The ATR block improves performance on multiple action recognition benchmarks.
- The method introduces minor computational overhead while enhancing hierarchical reasoning.
- The approach shows consistent improvements over strong baseline models.

## Abstract

Reasoning temporal relations among action-related objects plays an important role in action recognition. However, previous approaches only focus the reasoning on high-level semantics and inevitably involve the background in reasoning. In this work, we propose to formulate the temporal relational reasoning in an action-centric and hierarchical style, with a novel Action-centric Temporal-relational Reasoning (ATR) block. Specifically, ATR comprises two components: an Action-related Region Locator (ARL) to locate the action-related regions via estimating the actionness, and an Efficient Action-centric Reasoner (EAR) to efficiently reason the temporal relations between the located regions so as to realize the action-centric reasoning. Thanks to its flexible and efficient designs, our ATR can be directly integrated into existing action recognition models at different depths, empowering the hierarchical reasoning on the action-centric temporal relations at the cost of minor computational overhead. We extensively evaluate our ATR block on three action recognition benchmarks, Something-Something V1, V2, and Kinetics, with the backbones of TSN, TSM, and SlowOnly. The consistent and notable improvements over the strong baselines sufficiently corroborate the effectiveness of ATR, along with the action-centric and hierarchical formulation for temporal relational reasoning. Our proposed approach provides potential practical significance for real-world scenarios.

## Full-text entities

- **Chemicals:** TSM (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12288993/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12288993/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC12288993/full.md

---
Source: https://tomesphere.com/paper/PMC12288993