GPTPU: Accelerating Applications using Edge Tensor Processing Units

Kuan-Chieh Hsu; Hung-Wei Tseng

arXiv:2107.05473·cs.DC·July 14, 2021

GPTPU: Accelerating Applications using Edge Tensor Processing Units

Kuan-Chieh Hsu, Hung-Wei Tseng

PDF

Open Access 1 Repo

TL;DR

GPTPU is an open framework that enables general-purpose computing on Edge Tensor Processing Units, significantly accelerating applications and reducing energy consumption compared to traditional CPUs.

Contribution

This paper introduces GPTPU, a novel open-source framework that bridges the gap between application demands and NN accelerator hardware interfaces.

Findings

01

Achieves 2.46x speedup over high-end CPUs

02

Reduces energy consumption by 40%

03

Identifies new use cases for tensor algorithms

Abstract

Neural network (NN) accelerators have been integrated into a wide-spectrum of computer systems to accommodate the rapidly growing demands for artificial intelligence (AI) and machine learning (ML) applications. NN accelerators share the idea of providing native hardware support for operations on multidimensional tensor data. Therefore, NN accelerators are theoretically tensor processors that can improve system performance for any problem that uses tensors as inputs/outputs. Unfortunately, commercially available NN accelerators only expose computation capabilities through AI/ML-specific interfaces. Furthermore, NN accelerators reveal very few hardware design details, so applications cannot easily leverage the tensor operations NN accelerators provide. This paper introduces General-Purpose Computing on Edge Tensor Processing Units (GPTPU), an open-source, open-architecture framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

escalab/GPTPU
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Tensor decomposition and applications · Advanced Neural Network Applications