RFNet-4D++: Joint Object Reconstruction and Flow Estimation from 4D   Point Clouds with Cross-Attention Spatio-Temporal Features

Tuan-Anh Vu; Duc Thanh Nguyen; Binh-Son Hua; Quang-Hieu Pham and; Sai-Kit Yeung

arXiv:2203.16482·cs.CV·October 18, 2023·1 cites

RFNet-4D++: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds with Cross-Attention Spatio-Temporal Features

Tuan-Anh Vu, Duc Thanh Nguyen, Binh-Son Hua, Quang-Hieu Pham and, Sai-Kit Yeung

PDF

Open Access 1 Repo

TL;DR

RFNet-4D++ is a novel neural network architecture that jointly reconstructs 3D objects and estimates their motion flows from 4D point cloud sequences, leveraging cross-attention spatio-temporal features for improved accuracy and efficiency.

Contribution

The paper introduces RFNet-4D++, a new joint learning framework for object reconstruction and flow estimation from 4D point clouds, utilizing cross-attention mechanisms and an unsupervised flow learning module.

Findings

01

Achieves state-of-the-art results in object reconstruction and flow estimation.

02

Runs significantly faster than existing methods during training and inference.

03

Validated on benchmark datasets with extensive experiments.

Abstract

Object reconstruction from 3D point clouds has been a long-standing research problem in computer vision and computer graphics, and achieved impressive progress. However, reconstruction from time-varying point clouds (a.k.a. 4D point clouds) is generally overlooked. In this paper, we propose a new network architecture, namely RFNet-4D++, that jointly reconstructs objects and their motion flows from 4D point clouds. The key insight is simultaneously performing both tasks via learning of spatial and temporal features from a sequence of point clouds can leverage individual tasks, leading to improved overall performance. To prove this ability, we design a temporal vector field learning module using an unsupervised learning approach for flow estimation task, leveraged by supervised learning of spatial structures for object reconstruction. Extensive experiments and analyses on benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkust-vgd/rfnet-4d
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Advanced Vision and Imaging