Predicting Head Movement in Panoramic Video: A Deep Reinforcement   Learning Approach

Yuhang Song; Mai Xu; Jianyi Wang; Minglang Qiao; Liangyu Huo; Zulin; Wang

arXiv:1710.10755·cs.CV·December 2, 2019

Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

Yuhang Song, Mai Xu, Jianyi Wang, Minglang Qiao, Liangyu Huo, Zulin, Wang

PDF

1 Repo

TL;DR

This paper introduces a deep reinforcement learning method to predict head movements in panoramic video, leveraging a new database and demonstrating effectiveness in both offline and online scenarios.

Contribution

It presents a novel DRL-based approach for head movement prediction in panoramic video, including offline and online models, supported by a new HM database and validation experiments.

Findings

01

High consistency of HM data across subjects

02

DRL effectively predicts HM positions

03

Offline model enhances online prediction performance

Abstract

Panoramic video provides immersive and interactive experience by enabling humans to control the field of view (FoV) through head movement (HM). Thus, HM plays a key role in modeling human attention on panoramic video. This paper establishes a database collecting subjects' HM in panoramic video sequences. From this database, we find that the HM data are highly consistent across subjects. Furthermore, we find that deep reinforcement learning (DRL) can be applied to predict HM positions, via maximizing the reward of imitating human HM scanpaths through the agent's actions. Based on our findings, we propose a DRL-based HM prediction (DHP) approach with offline and online versions, called offline-DHP and online-DHP. In offline-DHP, multiple DRL workflows are run to determine potential HM positions at each panoramic frame. Then, a heat map of the potential HM positions, named the HM map, is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YuhangSong/DHP
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.