Unsupervised Skin Feature Tracking with Deep Neural Networks
Jose Chang, Torbj\"orn E.M. Nordling

TL;DR
This paper introduces an unsupervised deep learning pipeline for skin feature tracking that achieves high accuracy without extensive labeled data, outperforming traditional and modern supervised methods in challenging motion scenarios.
Contribution
The paper presents a novel unsupervised autoencoder-based method for skin feature tracking, reducing data needs and improving accuracy over existing supervised and traditional techniques.
Findings
Mean error between 0.6 and 3.3 pixels
Outperforms SIFT, SURF, Lucas Kanade, PIPs++, and CoTracker
Effective in high-motion conditions
Abstract
Facial feature tracking is essential in imaging ballistocardiography for accurate heart rate estimation and enables motor degradation quantification in Parkinson's disease through skin feature tracking. While deep convolutional neural networks have shown remarkable accuracy in tracking tasks, they typically require extensive labeled data for supervised training. Our proposed pipeline employs a convolutional stacked autoencoder to match image crops with a reference crop containing the target feature, learning deep feature encodings specific to the object category in an unsupervised manner, thus reducing data requirements. To overcome edge effects making the performance dependent on crop size, we introduced a Gaussian weight on the residual errors of the pixels when calculating the loss function. Training the autoencoder on facial images and validating its performance on manually labeled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods
