Improving the Expressiveness of Deep Learning Frameworks with Recursion

Eunji Jeong; Joo Seong Jeong; Soojeong Kim; Gyeong-In Yu; Byung-Gon; Chun

arXiv:1809.00832·cs.LG·September 5, 2018

Improving the Expressiveness of Deep Learning Frameworks with Recursion

Eunji Jeong, Joo Seong Jeong, Soojeong Kim, Gyeong-In Yu, Byung-Gon, Chun

PDF

TL;DR

This paper introduces recursion into deep learning frameworks, enabling more efficient and natural execution of recursive neural networks by exploiting their hierarchical structure for improved performance.

Contribution

It adds recursive execution capabilities and APIs to existing frameworks like TensorFlow, enhancing their ability to represent and efficiently run recursive neural networks.

Findings

01

Recursive implementation outperforms iterative methods in training and inference times.

02

The approach better captures the recursive structure of neural networks.

03

Resource utilization is improved with recursive execution.

Abstract

Recursive neural networks have widely been used by researchers to handle applications with recursively or hierarchically structured data. However, embedded control flow deep learning frameworks such as TensorFlow, Theano, Caffe2, and MXNet fail to efficiently represent and execute such neural networks, due to lack of support for recursion. In this paper, we add recursion to the programming model of existing frameworks by complementing their design with recursive execution of dataflow graphs as well as additional APIs for recursive definitions. Unlike iterative implementations, which can only understand the topological index of each node in recursive data structures, our recursive implementation is able to exploit the recursive relationships between nodes for efficient execution based on parallel computation. We present an implementation on TensorFlow and evaluation results with various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.