Binary Neural Network in Robotic Manipulation: Flexible Object   Manipulation for Humanoid Robot Using Partially Binarized Auto-Encoder on   FPGA

Satoshi Ohara; Tetsuya Ogata; Hiromitsu Awano

arXiv:2107.00209·cs.RO·July 2, 2021

Binary Neural Network in Robotic Manipulation: Flexible Object Manipulation for Humanoid Robot Using Partially Binarized Auto-Encoder on FPGA

Satoshi Ohara, Tetsuya Ogata, Hiromitsu Awano

PDF

Open Access

TL;DR

This paper presents a partially binarized auto-encoder neural network for flexible object manipulation in humanoid robots, optimized for FPGA implementation to achieve high speed and low power consumption.

Contribution

It introduces a novel partially binarized auto-encoder model that reduces size and maintains accuracy, enabling efficient FPGA deployment for robotic manipulation.

Findings

01

Achieves 41.1 fps on FPGA with 3.1W power

02

Outperforms CPU and GPU systems by 10x and 3.7x in speed

03

Maintains inference accuracy with model compression

Abstract

A neural network based flexible object manipulation system for a humanoid robot on FPGA is proposed. Although the manipulations of flexible objects using robots attract ever increasing attention since these tasks are the basic and essential activities in our daily life, it has been put into practice only recently with the help of deep neural networks. However such systems have relied on GPU accelerators, which cannot be implemented into the space limited robotic body. Although field programmable gate arrays (FPGAs) are known to be energy efficient and suitable for embedded systems, the model size should be drastically reduced since FPGAs have limited on-chip memory. To this end, we propose ``partially'' binarized deep convolutional auto-encoder technique, where only an encoder part is binarized to compress model size without degrading the inference accuracy. The model implemented on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeuroscience and Neural Engineering · CCD and CMOS Imaging Sensors · Robot Manipulation and Learning