ReRoPE: Repurposing RoPE for Relative Camera Control

Chunyang Li; Yuanbo Yang; Jiahao Shao; Hongyu Zhou; Katja Schwarz; Yiyi Liao

arXiv:2602.08068·cs.CV·February 10, 2026

ReRoPE: Repurposing RoPE for Relative Camera Control

Chunyang Li, Yuanbo Yang, Jiahao Shao, Hongyu Zhou, Katja Schwarz, Yiyi Liao

PDF

Open Access

TL;DR

ReRoPE introduces a plug-and-play method to incorporate relative camera pose control into pre-trained video diffusion models by leveraging underutilized spectral components of Rotary Positional Embeddings, enhancing control without retraining.

Contribution

It proposes ReRoPE, a novel approach that injects relative camera information into existing models by exploiting spectral bandwidth, enabling controllable video generation efficiently.

Findings

01

ReRoPE improves camera control accuracy in video generation.

02

The method maintains high visual fidelity and pre-trained priors.

03

ReRoPE is training-efficient and adaptable to existing models.

Abstract

Video generation with controllable camera viewpoints is essential for applications such as interactive content creation, gaming, and simulation. Existing methods typically adapt pre-trained video models using camera poses relative to a fixed reference, e.g., the first frame. However, these encodings lack shift-invariance, often leading to poor generalization and accumulated drift. While relative camera pose embeddings defined between arbitrary view pairs offer a more robust alternative, integrating them into pre-trained video diffusion models without prohibitive training costs or architectural changes remains challenging. We introduce ReRoPE, a plug-and-play framework that incorporates relative camera information into pre-trained video diffusion models without compromising their generation capability. Our approach is based on the insight that Rotary Positional Embeddings (RoPE) in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Human Pose and Action Recognition