Persistent-Transient Duality: A Multi-mechanism Approach for Modeling Human-Object Interaction
Hung Tran, Vuong Le, Svetha Venkatesh, Truyen Tran

TL;DR
This paper introduces a dual-mechanism neural network model that captures both continuous global plans and intermittent local actions in human-object interactions, improving motion forecasting accuracy.
Contribution
It proposes a novel Persistent-Transient Duality framework with dedicated neural modules for dynamic switching, addressing limitations of previous morphing structure models.
Findings
Outperforms existing models on two rich datasets
Effectively captures mode-switching in human motion
Demonstrates superior motion forecasting accuracy
Abstract
Humans are highly adaptable, swiftly switching between different modes to progressively handle different tasks, situations and contexts. In Human-object interaction (HOI) activities, these modes can be attributed to two mechanisms: (1) the large-scale consistent plan for the whole activity and (2) the small-scale children interactive actions that start and end along the timeline. While neuroscience and cognitive science have confirmed this multi-mechanism nature of human behavior, machine modeling approaches for human motion are trailing behind. While attempted to use gradually morphing structures (e.g., graph attention networks) to model the dynamic HOI patterns, they miss the expeditious and discrete mode-switching nature of the human motion. To bridge that gap, this work proposes to model two concurrent mechanisms that jointly control human motion: the Persistent process that runs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Persistent-Transient Duality: A Multi-Mechanism Approach for Modeling Human-Object Interaction· youtube
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Functional Brain Connectivity Studies · Human Pose and Action Recognition
