A New Spatio-Temporal Loss Function for 3D Motion Reconstruction and Extended Temporal Metrics for Motion Evaluation
Mansour Tchenegnon, Sylvie Gibet, Thibaut Le Naour

TL;DR
This paper introduces a novel spatio-temporal Laplacian loss function for improving 3D human motion reconstruction from videos, emphasizing better temporal consistency and comprehensive evaluation metrics.
Contribution
It presents a new Laplacian loss based on graph representation and a fully convolutional temporal network for enhanced motion reconstruction.
Findings
Improved temporal consistency in motion estimation.
Enhanced performance on Human3.6M benchmarks.
Comprehensive evaluation with motion descriptors.
Abstract
We propose a new loss function that we call Laplacian loss, based on spatio-temporal Laplacian representation of the motion as a graph. This loss function is intended to be used in training models for motion reconstruction through 3D human pose estimation from videos. It compares the differential coordinates of the joints obtained from the graph representation of the ground truth against the one of the estimation. We design a fully convolutional temporal network for motion reconstruction to achieve better temporal consistency of estimation. We use this generic model to study the impact of our proposed loss function on the benchmarks provided by Human3.6M. We also make use of various motion descriptors such as velocity, acceleration to make a thorough evaluation of the temporal consistency while comparing the results to some of the state-of-the-art solutions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Human Motion and Animation
