Efficient and Flexible Method for Reducing Moderate-size Deep Neural   Networks with Condensation

Tianyi Chen; Zhi-Qin John Xu

arXiv:2405.01041·cs.LG·July 2, 2024

Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation

Tianyi Chen, Zhi-Qin John Xu

PDF

Open Access

TL;DR

This paper introduces a condensation-based reduction algorithm for moderate-size neural networks, effectively shrinking their size while preserving accuracy, thus enhancing inference speed for scientific and image classification tasks.

Contribution

The paper proposes a novel condensation reduction algorithm applicable to fully connected and convolutional networks, demonstrating significant size reduction with maintained performance.

Findings

01

Reduced network size to 41.7% in combustion tasks

02

Shrank CIFAR10 network to 11.5% of original size

03

Maintained prediction accuracy after reduction

Abstract

Neural networks have been extensively applied to a variety of tasks, achieving astounding results. Applying neural networks in the scientific field is an important research direction that is gaining increasing attention. In scientific applications, the scale of neural networks is generally moderate-size, mainly to ensure the speed of inference during application. Additionally, comparing neural networks to traditional algorithms in scientific applications is inevitable. These applications often require rapid computations, making the reduction of neural network sizes increasingly important. Existing work has found that the powerful capabilities of neural networks are primarily due to their non-linearity. Theoretical work has discovered that under strong non-linearity, neurons in the same layer tend to behave similarly, a phenomenon known as condensation. Condensation offers an opportunity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings