Learning non-rigid surface reconstruction from spatio-temporal image patches
Matteo Pedone, Abdelrahman Mostafa, Janne heikkil\"a

TL;DR
This paper introduces a neural network-based method for dense, non-rigid surface reconstruction from video, leveraging local spatio-temporal patches and synthetic training data to improve accuracy over traditional methods.
Contribution
The novel approach combines local patch-based depth estimation with synthetic training data for improved non-rigid surface reconstruction from video.
Findings
Lower reconstruction error compared to non-rigid structure from motion.
Effective on both synthetic and real Kinect data.
Utilizes a parametric model for local patches to simplify complexity.
Abstract
We present a method to reconstruct a dense spatio-temporal depth map of a non-rigidly deformable object directly from a video sequence. The estimation of depth is performed locally on spatio-temporal patches of the video, and then the full depth video of the entire shape is recovered by combining them together. Since the geometric complexity of a local spatio-temporal patch of a deforming non-rigid object is often simple enough to be faithfully represented with a parametric model, we artificially generate a database of small deforming rectangular meshes rendered with different material properties and light conditions, along with their corresponding depth videos, and use such data to train a convolutional neural network. We tested our method on both synthetic and Kinect data and experimentally observed that the reconstruction error is significantly lower than the one obtained using other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
