MMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software Implications
Cheng Xu, Xiaofeng Hou, Jiacheng Liu, Chao Li, Tianhao, Huang, Xiaozhi Zhu, Mo Niu, Lingyu Sun, Peng Tang, Tongqiao Xu, and Kwang-Ting Cheng, Minyi Guo

TL;DR
MMBench is a comprehensive benchmark suite designed to evaluate multi-modal DNNs, revealing their unique execution characteristics and guiding future hardware and software optimization for cloud and edge systems.
Contribution
The paper introduces MMBench, the first comprehensive benchmark suite for multi-modal DNNs, and provides an in-depth analysis of their system and hardware implications.
Findings
Multi-modal DNNs exhibit multi-stage execution and frequent synchronization.
They show high heterogeneity compared to uni-modal DNNs.
The benchmark extends to edge devices for broader applicability.
Abstract
The explosive growth of various types of big data and advances in AI technologies have catalyzed a new type of workloads called multi-modal DNNs. Multi-modal DNNs are capable of interpreting and reasoning about information from multiple modalities, making them more applicable to real-world AI scenarios. In recent research, multi-modal DNNs have outperformed the best uni-modal DNN in a wide range of distributed computing applications from traditional multimedia systems to emerging autonomous edge systems. However, despite their importance and superiority, very limited research attention has been devoted to understand the characteristics of multi-modal DNNs and their implications on current computing software/hardware platforms. Existing benchmarks either target uni-modal DNNs or only focus on the algorithm characteristics of multi-modal DNNs. There lacks representative benchmark suites…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Computing and Algorithms · Visual Attention and Saliency Detection
