92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII   Cluster

Douglas Aberdeen; Jonathan Baxter; Robert Edwards

arXiv:1911.05181·cs.LG·November 14, 2019

92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster

Douglas Aberdeen, Jonathan Baxter, Robert Edwards

PDF

TL;DR

This paper demonstrates the distributed training of ultra-large neural networks on a cluster of Pentium III processors, achieving high performance and cost efficiency in recognizing Japanese characters from a large dataset.

Contribution

It introduces a technique for distributed training of ultra-large neural networks on commodity hardware, with a practical experiment showing high performance and cost-effectiveness.

Findings

01

Achieved 163.3 GFlops/s performance in training

02

Trained a 1.73 million parameter network on 9 million patterns

03

Cost-effective training with 92.4c/MFlops/s price/performance ratio

Abstract

Artificial neural networks with millions of adjustable parameters and a similar number of training examples are a potential solution for difficult, large-scale pattern recognition problems in areas such as speech and face recognition, classification of large volumes of web data, and finance. The bottleneck is that neural network training involves iterative gradient descent and is extremely computationally intensive. In this paper we present a technique for distributed training of Ultra Large Scale Neural Networks (ULSNN) on Bunyip, a Linux-based cluster of 196 Pentium III processors. To illustrate ULSNN training we describe an experiment in which a neural network with 1.73 million adjustable parameters was trained to recognize machine-printed Japanese characters from a database containing 9 million training patterns. The training runs with a average performance of 163.3 GFlops/s (single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.