Towards Head Motion Compensation Using Multi-Scale Convolutional Neural   Networks

Omer Rajput; Nils Gessert; Martin Gromniak; Lars Matth\"aus; Alexander; Schlaefer

arXiv:1807.03651·cs.CV·July 11, 2018·1 cites

Towards Head Motion Compensation Using Multi-Scale Convolutional Neural Networks

Omer Rajput, Nils Gessert, Martin Gromniak, Lars Matth\"aus, Alexander, Schlaefer

PDF

Open Access

TL;DR

This paper explores markerless head pose estimation using RGBD data for medical applications, introducing a novel multi-scale CNN architecture and a systematic data collection method with ground-truth labels.

Contribution

It proposes a new multi-scale CNN architecture for improved head pose regression and a systematic data acquisition strategy with ground-truth labels for training.

Findings

01

Multi-scale CNN improves pose estimation accuracy.

02

Systematic data collection enhances training quality.

03

Comparison with model-based tracking shows advantages.

Abstract

Head pose estimation and tracking is useful in variety of medical applications. With the advent of RGBD cameras like Kinect, it has become feasible to do markerless tracking by estimating the head pose directly from the point clouds. One specific medical application is robot assisted transcranial magnetic stimulation (TMS) where any patient motion is compensated with the help of a robot. For increased patient comfort, it is important to track the head without markers. In this regard, we address the head pose estimation problem using two different approaches. In the first approach, we build upon the more traditional approach of model based head tracking, where a head model is morphed according to the particular head to be tracked and the morphed model is used to track the head in the point cloud streams. In the second approach, we propose a new multi-scale convolutional neural network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Vision and Imaging · Human Pose and Action Recognition