Multi-stage feature decorrelation constraints for improving CNN classification performance
Qiuyu Zhu, Hao Wang, Xuewen Zu, Chengfei Liu

TL;DR
This paper introduces a multi-stage feature decorrelation loss (MFD Loss) for CNNs, which reduces feature redundancy across layers, leading to improved classification accuracy demonstrated on multiple datasets.
Contribution
The paper proposes a novel MFD Loss that constrains feature correlations at all stages of CNNs, enhancing feature effectiveness and classification performance.
Findings
MFD Loss improves CNN classification accuracy across datasets.
Constraining multi-stage features reduces information redundancy.
MFD Loss is compatible with various loss functions and CNN architectures.
Abstract
For the convolutional neural network (CNN) used for pattern classification, the training loss function is usually applied to the final output of the network, except for some regularization constraints on the network parameters. However, with the increasing of the number of network layers, the influence of the loss function on the network front layers gradually decreases, and the network parameters tend to fall into local optimization. At the same time, it is found that the trained network has significant information redundancy at all stages of features, which reduces the effectiveness of feature mapping at all stages and is not conducive to the change of the subsequent parameters of the network in the direction of optimality. Therefore, it is possible to obtain a more optimized solution of the network and further improve the classification accuracy of the network by designing a loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM
MethodsSoftmax
