MemCam: Memory-Augmented Camera Control for Consistent Video Generation

Xinhang Gao; Junlin Guan; Shuhan Luo; Wenzhuo Li; Guanghuan Tan; Jiacheng Wang

arXiv:2603.26193·cs.CV·March 30, 2026

MemCam: Memory-Augmented Camera Control for Consistent Video Generation

Xinhang Gao, Junlin Guan, Shuhan Luo, Wenzhuo Li, Guanghuan Tan, Jiacheng Wang

PDF

1 Models

TL;DR

MemCam introduces a memory-augmented approach for interactive video generation that maintains scene consistency over long sequences by leveraging external memory and context compression.

Contribution

The paper proposes MemCam, a novel memory-augmented method with context compression and dynamic retrieval to improve scene consistency in long, camera-controlled videos.

Findings

01

MemCam outperforms baselines in scene consistency for long videos.

02

It effectively maintains scene coherence during large camera rotations.

03

The approach reduces computational overhead while enriching contextual information.

Abstract

Interactive video generation has significant potential for scene simulation and video creation. However, existing methods often struggle with maintaining scene consistency during long video generation under dynamic camera control due to limited contextual information. To address this challenge, we propose MemCam, a memory-augmented interactive video generation approach that treats previously generated frames as external memory and leverages them as contextual conditioning to achieve controllable camera viewpoints with high scene consistency. To enable longer and more relevant context, we design a context compression module that encodes memory frames into compact representations and employs co-visibility-based selection to dynamically retrieve the most relevant historical frames, thereby reducing computational overhead while enriching contextual information. Experiments on interactive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
newhorizon2005/MemCam
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.