PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Hehe Fan; Xin Yu; Yuhang Ding; Yi Yang; Mohan Kankanhalli

arXiv:2205.13713·cs.CV·May 30, 2022·69 cites

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan Kankanhalli

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces PSTNet, a novel deep learning architecture that employs point spatio-temporal convolution to effectively model and analyze irregular point cloud sequences for tasks like 3D action recognition and semantic segmentation.

Contribution

The paper proposes a new PST convolution that disentangles spatial and temporal features in point cloud sequences, enabling hierarchical feature extraction with improved modeling of 3D dynamics.

Findings

01

PSTNet outperforms existing methods on 3D action recognition datasets.

02

PSTNet achieves superior results in 4D semantic segmentation tasks.

03

The proposed method effectively captures local spatial structures and temporal dynamics.

Abstract

Point cloud sequences are irregular and unordered in the spatial dimension while exhibiting regularities and order in the temporal dimension. Therefore, existing grid based convolutions for conventional video processing cannot be directly applied to spatio-temporal modeling of raw point cloud sequences. In this paper, we propose a point spatio-temporal (PST) convolution to achieve informative representations of point cloud sequences. The proposed PST convolution first disentangles space and time in point cloud sequences. Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension. Furthermore, we incorporate the proposed PST convolution into a deep network, namely PSTNet, to extract features of point cloud sequences in a hierarchical manner.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hehefan/Point-Spatio-Temporal-Convolution
pytorchOfficial

Videos

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences· slideslive

Taxonomy

TopicsHuman Pose and Action Recognition · 3D Shape Modeling and Analysis · Human Motion and Animation

MethodsConvolution