Morph: Flexible Acceleration for 3D CNN-based Video Understanding
Kartik Hegde, Rohit Agrawal, Yulun Yao, Christopher W. Fletcher

TL;DR
Morph is a flexible hardware accelerator designed for 3D CNNs in video recognition, achieving significant energy efficiency and performance improvements through adaptive support for various tiling strategies.
Contribution
The paper introduces Morph, a novel adaptive accelerator architecture for 3D CNNs, with a co-designed software infrastructure for optimized configuration, addressing the unique challenges of video recognition acceleration.
Findings
Up to 3.4x energy reduction compared to baseline
Up to 5.1x performance/watt improvement
15.9x energy reduction over Eyeriss
Abstract
The past several years have seen both an explosion in the use of Convolutional Neural Networks (CNNs) and the design of accelerators to make CNN inference practical. In the architecture community, the lion share of effort has targeted CNN inference for image recognition. The closely related problem of video recognition has received far less attention as an accelerator target. This is surprising, as video recognition is more computationally intensive than image recognition, and video traffic is predicted to be the majority of internet traffic in the coming years. This paper fills the gap between algorithmic and hardware advances for video recognition by providing a design space exploration and flexible architecture for accelerating 3D Convolutional Neural Networks (3D CNNs) - the core kernel in modern video understanding. When compared to (2D) CNNs used for image recognition,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
