DeepCache: Principled Cache for Mobile Deep Vision

Mengwei Xu; Mengze Zhu; Yunxin Liu; Felix Xiaozhu Lin; and Xuanzhe Liu

arXiv:1712.01670·cs.CV·March 31, 2020

DeepCache: Principled Cache for Mobile Deep Vision

Mengwei Xu, Mengze Zhu, Yunxin Liu, Felix Xiaozhu Lin, and Xuanzhe Liu

PDF

1 Repo

TL;DR

DeepCache is a cache system designed for mobile deep vision that exploits temporal locality in video streams to improve inference efficiency, reduce energy consumption, and operate seamlessly with existing models.

Contribution

It introduces a principled cache design that leverages video and model structure without requiring model modifications or manual tuning.

Findings

01

Average inference time reduced by 18%

02

Maximum inference time reduction of 47%

03

System energy consumption decreased by 20%

Abstract

We present DeepCache, a principled cache design for deep learning inference in continuous mobile vision. DeepCache benefits model execution efficiency by exploiting temporal locality in input video streams. It addresses a key challenge raised by mobile vision: the cache must operate under video scene variation, while trading off among cacheability, overhead, and loss in model accuracy. At the input of a model, DeepCache discovers video temporal locality by exploiting the video's internal structure, for which it borrows proven heuristics from video compression; into the model, DeepCache propagates regions of reusable results by exploiting the model's internal structure. Notably, DeepCache eschews applying video heuristics to model internals which are not pixels but high-dimensional, difficult-to-interpret data. Our implementation of DeepCache works with unmodified deep learning models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xumengwei/DeepCache
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.