Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free   Optimization Algorithms

Derssie Mebratu; Niranjan Hasabnis; Pietro Mercati; Gaurit Sharma,; Shamima Najnin

arXiv:2109.06266·cs.LG·September 15, 2021

Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free Optimization Algorithms

Derssie Mebratu, Niranjan Hasabnis, Pietro Mercati, Gaurit Sharma,, Shamima Najnin

PDF

TL;DR

This paper explores automatic tuning of TensorFlow's CPU backend parameters using gradient-free optimization algorithms, comparing Bayesian optimization, genetic algorithms, and Nelder-Mead simplex to improve deep learning performance.

Contribution

It provides a systematic comparison of three black-box optimization algorithms for tuning TensorFlow's CPU backend, highlighting Bayesian optimization's overall effectiveness.

Findings

01

Bayesian optimization generally outperforms other algorithms.

02

Nelder-Mead simplex is effective in some cases.

03

Manual tuning is complex and sub-optimal.

Abstract

Modern deep learning (DL) applications are built using DL libraries and frameworks such as TensorFlow and PyTorch. These frameworks have complex parameters and tuning them to obtain good training and inference performance is challenging for typical users, such as DL developers and data scientists. Manual tuning requires deep knowledge of the user-controllable parameters of DL frameworks as well as the underlying hardware. It is a slow and tedious process, and it typically delivers sub-optimal solutions. In this paper, we treat the problem of tuning parameters of DL frameworks to improve training and inference performance as a black-box optimization problem. We then investigate applicability and effectiveness of Bayesian optimization (BO), genetic algorithm (GA), and Nelder-Mead simplex (NMS) to tune the parameters of TensorFlow's CPU backend. While prior work has already investigated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.