Learning Camera Movement Control from Real-World Drone Videos

Yunzhong Hou; Liang Zheng; Philip Torr

arXiv:2412.09620·cs.CV·December 13, 2024

Learning Camera Movement Control from Real-World Drone Videos

Yunzhong Hou, Liang Zheng, Philip Torr

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper presents a scalable, data-driven approach to automate drone camera movements for filming, using real-world videos and a transformer-based model to learn complex navigation behaviors.

Contribution

We introduce DVGFormer, a transformer architecture trained on real-world drone trajectories to predict camera movements without heuristics or simulation.

Findings

01

Effectively navigates obstacles and maintains dynamic camera angles.

02

Learns complex camera behaviors like orbiting and obstacle avoidance.

03

Performs well across diverse real and synthetic scenes.

Abstract

This study seeks to automate camera movement control for filming existing subjects into attractive videos, contrasting with the creation of non-existent content by directly generating the pixels. We select drone videos as our test case due to their rich and challenging motion patterns, distinctive viewing angles, and precise controls. Existing AI videography methods struggle with limited appearance diversity in simulation training, high costs of recording expert operations, and difficulties in designing heuristic-based goals to cover all scenarios. To avoid these issues, we propose a scalable method that involves collecting real-world training data to improve diversity, extracting camera trajectories automatically to minimize annotation costs, and training an effective architecture that does not rely on heuristics. Specifically, we collect 99k high-quality trajectories by running 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
yunzhong-hou/DVGFormer
model

Datasets

yunzhong-hou/DroneMotion-99k
dataset· 510 dl
510 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Video Surveillance and Tracking Methods · Robotic Path Planning Algorithms