Leveraging Motion Estimation for Efficient Bayer-Domain Computer Vision
Haichao Wang, Xinyue Xi, Jiangtao Wen, Yuxing Han

TL;DR
This paper introduces MEVC, a novel framework that accelerates video vision tasks directly in the Bayer domain by integrating motion estimation into convolutional layers, reducing computation by over 70% with minimal accuracy loss.
Contribution
The paper presents the first effective method to reuse motion estimation for accelerating video vision directly from raw Bayer sensor data, eliminating the need for traditional ISP processing.
Findings
Achieves over 70% reduction in FLOPs across multiple benchmarks.
Supports raw Bayer input for various vision tasks.
Maintains high accuracy with minimal degradation.
Abstract
Existing computer vision processing pipeline acquires visual information using an image sensor that captures pixel information in the Bayer pattern. The raw sensor data are then processed using an image signal processor (ISP) that first converts Bayer pixel data to RGB on a pixel by pixel basis, followed by video convolutional network (VCN) processing on a frame by frame basis. Both ISP and VCN are computationally expensive with high power consumption and latency. In this paper, we propose a novel framework that eliminates the ISP and leverages motion estimation to accelerate video vision tasks directly in the Bayer domain. We introduce Motion Estimation-based Video Convolution (MEVC), which integrates sliding-window motion estimation into each convolutional layer, enabling prediction and residual-based refinement that reduces redundant computations across frames. This design bridges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Image and Video Stabilization
