Mixture separability loss in a deep convolutional network for image classification
Trung Dung Do, Cheng-Bin Jin, Hakil Kim, Van Huan Nguyen

TL;DR
This paper introduces a novel mixture separability loss (MSL) function for deep convolutional networks that enhances training by maintaining weight updates even after initial convergence, leading to improved image classification performance.
Contribution
The paper proposes a new loss function, MSL, which combines between-class and within-class components and can be applied to intermediate layers to improve training effectiveness.
Findings
MSL improves training depth and convergence.
Networks with MSL outperform baseline models on multiple datasets.
MSL effectively utilizes intermediate feature maps for better classification.
Abstract
In machine learning, the cost function is crucial because it measures how good or bad a system is. In image classification, well-known networks only consider modifying the network structures and applying cross-entropy loss at the end of the network. However, using only cross-entropy loss causes a network to stop updating weights when all training images are correctly classified. This is the problem of the early saturation. This paper proposes a novel cost function, called mixture separability loss (MSL), which updates the weights of the network even when most of the training images are accurately predicted. MSL consists of between-class and within-class loss. Between-class loss maximizes the differences between inter-class images, whereas within-class loss minimizes the similarities between intra-class images. We designed the proposed loss function to attach to different convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Remote-Sensing Image Classification
