MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving   Camera Videos

Mathias Parger; Chengcheng Tang; Thomas Neff; Christopher D. Twigg,; Cem Keskin; Robert Wang; Markus Steinberger

arXiv:2210.09887·cs.CV·August 16, 2023

MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos

Mathias Parger, Chengcheng Tang, Thomas Neff, Christopher D. Twigg,, Cem Keskin, Robert Wang, Markus Steinberger

PDF

Open Access

TL;DR

MotionDeltaCNN is a novel sparse CNN inference framework that efficiently processes moving camera videos by fusing new and old regions without increasing memory, outperforming previous methods significantly.

Contribution

It introduces spherical buffers and padded convolutions to enable efficient fusion of image regions in moving camera videos, extending DeltaCNN's capabilities.

Findings

01

Outperforms DeltaCNN by up to 90% on moving camera videos

02

Supports seamless fusion of new and old regions without extra memory overhead

03

Effectively handles camera motion without knowing extrinsics

Abstract

Convolutional neural network inference on video input is computationally expensive and requires high memory bandwidth. Recently, DeltaCNN managed to reduce the cost by only processing pixels with significant updates over the previous frame. However, DeltaCNN relies on static camera input. Moving cameras add new challenges in how to fuse newly unveiled image regions with already processed regions efficiently to minimize the update rate - without increasing memory overhead and without knowing the camera extrinsics of future frames. In this work, we propose MotionDeltaCNN, a sparse CNN inference framework that supports moving cameras. We introduce spherical buffers and padded convolutions to enable seamless fusion of newly unveiled regions and previously processed regions -- without increasing memory footprint. Our evaluation shows that we outperform DeltaCNN by up to 90% for moving camera…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging