RFNet-4D++: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds with Cross-Attention Spatio-Temporal Features
Tuan-Anh Vu, Duc Thanh Nguyen, Binh-Son Hua, Quang-Hieu Pham and, Sai-Kit Yeung

TL;DR
RFNet-4D++ is a novel neural network architecture that jointly reconstructs 3D objects and estimates their motion flows from 4D point cloud sequences, leveraging cross-attention spatio-temporal features for improved accuracy and efficiency.
Contribution
The paper introduces RFNet-4D++, a new joint learning framework for object reconstruction and flow estimation from 4D point clouds, utilizing cross-attention mechanisms and an unsupervised flow learning module.
Findings
Achieves state-of-the-art results in object reconstruction and flow estimation.
Runs significantly faster than existing methods during training and inference.
Validated on benchmark datasets with extensive experiments.
Abstract
Object reconstruction from 3D point clouds has been a long-standing research problem in computer vision and computer graphics, and achieved impressive progress. However, reconstruction from time-varying point clouds (a.k.a. 4D point clouds) is generally overlooked. In this paper, we propose a new network architecture, namely RFNet-4D++, that jointly reconstructs objects and their motion flows from 4D point clouds. The key insight is simultaneously performing both tasks via learning of spatial and temporal features from a sequence of point clouds can leverage individual tasks, leading to improved overall performance. To prove this ability, we design a temporal vector field learning module using an unsupervised learning approach for flow estimation task, leveraged by supervised learning of spatial structures for object reconstruction. Extensive experiments and analyses on benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Advanced Vision and Imaging
