OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction
Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu

TL;DR
OcclusionFusion introduces an occlusion-aware motion estimation method using LSTM and graph neural networks to improve real-time 3D reconstruction accuracy under occlusions.
Contribution
The paper presents a novel approach combining LSTM and graph neural networks for occlusion-aware motion estimation in real-time 3D reconstruction.
Findings
Outperforms existing single-view methods significantly.
Reduces motion errors in long, challenging sequences.
Enhances robustness in occluded regions.
Abstract
RGBD-based real-time dynamic 3D reconstruction suffers from inaccurate inter-frame motion estimation as errors may accumulate with online tracking. This problem is even more severe for single-view-based systems due to strong occlusions. Based on these observations, we propose OcclusionFusion, a novel method to calculate occlusion-aware 3D motion to guide the reconstruction. In our technique, the motion of visible regions is first estimated and combined with temporal information to infer the motion of the occluded regions through an LSTM-involved graph neural network. Furthermore, our method computes the confidence of the estimated motion by modeling the network output with a probabilistic model, which alleviates untrustworthy motions and enables robust tracking. Experimental results on public datasets and our own recorded data show that our technique outperforms existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Computer Graphics and Visualization Techniques
