Designing Network Design Strategies Through Gradient Path Analysis

Chien-Yao Wang; Hong-Yuan Mark Liao; I-Hau Yeh

arXiv:2211.04800·cs.CV·November 10, 2022·180 cites

Designing Network Design Strategies Through Gradient Path Analysis

Chien-Yao Wang, Hong-Yuan Mark Liao, I-Hau Yeh

PDF

Open Access

TL;DR

This paper introduces a novel network design approach based on analyzing gradient paths during backpropagation, aiming to improve model expressiveness and learning ability.

Contribution

It proposes gradient path analysis as a new strategy for designing network architectures, focusing on backpropagation paths rather than data paths.

Findings

01

Gradient path design strategies are theoretically justified.

02

Experimental results show improved network expressiveness.

03

Strategies are effective at layer, stage, and network levels.

Abstract

Designing a high-efficiency and high-quality expressive network architecture has always been the most important research topic in the field of deep learning. Most of today's network design strategies focus on how to integrate features extracted from different layers, and how to design computing units to effectively extract these features, thereby enhancing the expressiveness of the network. This paper proposes a new network design strategy, i.e., to design the network architecture based on gradient path analysis. On the whole, most of today's mainstream network design strategies are based on feed forward path, that is, the network architecture is designed based on the data path. In this paper, we hope to enhance the expressive ability of the trained model by improving the network learning ability. Due to the mechanism driving the network parameter learning is the backward propagation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Face and Expression Recognition · Neural Networks and Applications