Binary Neural Network in Robotic Manipulation: Flexible Object Manipulation for Humanoid Robot Using Partially Binarized Auto-Encoder on FPGA
Satoshi Ohara, Tetsuya Ogata, Hiromitsu Awano

TL;DR
This paper presents a partially binarized auto-encoder neural network for flexible object manipulation in humanoid robots, optimized for FPGA implementation to achieve high speed and low power consumption.
Contribution
It introduces a novel partially binarized auto-encoder model that reduces size and maintains accuracy, enabling efficient FPGA deployment for robotic manipulation.
Findings
Achieves 41.1 fps on FPGA with 3.1W power
Outperforms CPU and GPU systems by 10x and 3.7x in speed
Maintains inference accuracy with model compression
Abstract
A neural network based flexible object manipulation system for a humanoid robot on FPGA is proposed. Although the manipulations of flexible objects using robots attract ever increasing attention since these tasks are the basic and essential activities in our daily life, it has been put into practice only recently with the help of deep neural networks. However such systems have relied on GPU accelerators, which cannot be implemented into the space limited robotic body. Although field programmable gate arrays (FPGAs) are known to be energy efficient and suitable for embedded systems, the model size should be drastically reduced since FPGAs have limited on-chip memory. To this end, we propose ``partially'' binarized deep convolutional auto-encoder technique, where only an encoder part is binarized to compress model size without degrading the inference accuracy. The model implemented on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Neural Engineering · CCD and CMOS Imaging Sensors · Robot Manipulation and Learning
