Coloring Deep CNN Layers with Activation Hue Loss
Louis-Fran\c{c}ois Bouchard, Mohsen Ben Lazreg, Matthew Toews

TL;DR
This paper introduces the activation hue, a novel angular regularization method for deep CNNs, which improves classification performance by encouraging class-specific activation angles during training.
Contribution
It proposes the activation hue concept and a regularization loss based on angular activation labels, enhancing CNN training effectiveness.
Findings
Activation hue captures class-informative activation angles.
Regularization with hue loss improves classification accuracy.
Method is effective across various datasets including ImageNet.
Abstract
This paper proposes a novel hue-like angular parameter to model the structure of deep convolutional neural network (CNN) activation space, referred to as the {\em activation hue}, for the purpose of regularizing models for more effective learning. The activation hue generalizes the notion of color hue angle in standard 3-channel RGB intensity space to -channel activation space. A series of observations based on nearest neighbor indexing of activation vectors with pre-trained networks indicate that class-informative activations are concentrated about an angle in both the image plane and in multi-channel activation space. A regularization term in the form of hue-like angular labels is proposed to complement standard one-hot loss. Training from scratch using combined one-hot + activation hue loss improves classification performance modestly for a wide variety…
Peer Reviews
Decision·Submitted to ICLR 2024
1. The presentation is clear and easy to follow. 2. The method is well-motivated and novel. The analysis is interesting and insightful. 3. The evaluation and visualization is extensive and interesting.
1. The experiments are conducted with traditional architectures. How about applying the proposed method for training more advanced architecture, such as Vision Transformer? 2. There are already some well-known techniques based on the similarity of activation vectors, such as label smooth. I think the related methods should be compared or discussed. 3. The extra regularization term seems to introduce extra computation costs. I think the comparison of computation should be presented.
1. The introduction of the activation hue is an innovative way to regularize the structure of CNN's activation space, which may lead to improved model performance for CNN architecture. The generalization of the notion of color hue to N-channel activation space is an interesting concept that could have broad applications in the field. 2. The combined use of one-hot loss and activation hue loss has been shown to modestly improve classification performance across a variety of classification tasks w
1. The paper does not provide evaluation results by properly scaling the employed models, e.g., applying the approach to ResNet-50 or a larger one. Thus, it is difficult to assess the extent of improvement brought by the proposed method. 2. The proposed activation hue's properties should be discussed along with experiments. Will it improve the CNN network converge, or make it robust to some perturbations? 3. More results about employing the given method to downstream tasks, e.g., detection, an
See above
See above
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Adversarial Robustness in Machine Learning
