Convex Loss Functions for Support Vector Machines (SVMs) and Neural Networks
Filippo Portera

TL;DR
This paper introduces a new convex loss function for SVMs and neural networks, demonstrating improved generalization performance and experimental validation on small datasets for classification and regression tasks.
Contribution
The paper proposes a novel convex loss function that incorporates pattern correlations, with derivations and experimental validation for SVMs and neural networks.
Findings
Up to 2.0% F1 score improvement in classification
1.0% reduction in MSE for regression
Consistent or better generalization measures compared to standard losses
Abstract
We propose a new convex loss for Support Vector Machines, both for the binary classification and for the regression models. Therefore, we show the mathematical derivation of the dual problems and we experiment with them on several small datasets. The minimal dimension of those datasets is due to the difficult scalability of the SVM method to bigger instances. This preliminary study should prove that using pattern correlations inside the loss function could enhance the generalisation performances. Our method consistently achieved comparable or superior performance, with improvements of up to 2.0% in F1 scores for classification tasks and 1.0% reduction in Mean Squared Error (MSE) for regression tasks across various datasets, compared to standard losses. Coherently, results show that generalisation measures are never worse than the standard losses and several times they are better. In our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
