The Binary Quantized Neural Network for Dense Prediction via Specially   Designed Upsampling and Attention

Xingyu Ding; Lianlei Shan; Guiqin Zhao; Meiqi Wu; Wenzhang; Zhou; Wei Li

arXiv:2405.17776·cs.LG·May 29, 2024·3 cites

The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention

Xingyu Ding, Lianlei Shan, Guiqin Zhao, Meiqi Wu, Wenzhang, Zhou, Wei Li

PDF

Open Access

TL;DR

This paper introduces a binary neural network approach for dense prediction tasks, utilizing specially designed upsampling and attention strategies to improve accuracy and efficiency in tasks like segmentation.

Contribution

It presents a novel multi-branch upsampling method and an optimized attention mechanism that significantly reduce computation while maintaining high accuracy in binary neural networks for dense prediction.

Findings

01

Achieved high accuracy with binary neural networks on dense prediction tasks.

02

Reduced attention computation complexity by a factor of 100.

03

Validated effectiveness on Cityscapes, KITTI, and ECSSD datasets.

Abstract

Deep learning-based information processing consumes long time and requires huge computing resources, especially for dense prediction tasks which require an output for each pixel, like semantic segmentation and salient object detection. There are mainly two challenges for quantization of dense prediction tasks. Firstly, directly applying the upsampling operation that dense prediction tasks require is extremely crude and causes unacceptable accuracy reduction. Secondly, the complex structure of dense prediction networks means it is difficult to maintain a fast speed as well as a high accuracy when performing quantization. In this paper, we propose an effective upsampling method and an efficient attention computation strategy to transfer the success of the binary neural networks (BNN) from single prediction tasks to dense prediction tasks. Firstly, we design a simple and robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings