Relay: A New IR for Machine Learning Frameworks

Jared Roesch; Steven Lyubomirsky; Logan Weber; Josh Pollock; Marisa; Kirisame; Tianqi Chen; Zachary Tatlock

arXiv:1810.00952·cs.PL·October 3, 2018

Relay: A New IR for Machine Learning Frameworks

Jared Roesch, Steven Lyubomirsky, Logan Weber, Josh Pollock, Marisa, Kirisame, Tianqi Chen, Zachary Tatlock

PDF

TL;DR

Relay introduces a high-level, purely-functional, statically-typed intermediate representation designed to improve efficiency, expressiveness, and portability in machine learning frameworks across diverse hardware devices.

Contribution

It presents a novel IR called Relay that balances efficiency, expressiveness, and portability, integrated into the NNVM compiler framework used by MxNet.

Findings

01

Relay is part of the open source NNVM compiler framework.

02

Designed as a purely-functional, statically-typed language.

03

Aims to optimize compilation for heterogeneous hardware.

Abstract

Machine learning powers diverse services in industry including search, translation, recommendation systems, and security. The scale and importance of these models require that they be efficient, expressive, and portable across an array of heterogeneous hardware devices. These constraints are often at odds; in order to better accommodate them we propose a new high-level intermediate representation (IR) called Relay. Relay is being designed as a purely-functional, statically-typed language with the goal of balancing efficient compilation, expressiveness, and portability. We discuss the goals of Relay and highlight its important design constraints. Our prototype is part of the open source NNVM compiler framework, which powers Amazon's deep learning framework MxNet.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.