An Architecture Combining Convolutional Neural Network (CNN) and Support Vector Machine (SVM) for Image Classification
Abien Fred Agarap

TL;DR
This paper explores combining CNNs with SVM classifiers for image classification, comparing their performance to traditional CNN-Softmax models on MNIST and Fashion-MNIST datasets, showing competitive accuracy levels.
Contribution
It presents an empirical study of CNN-SVM architecture, demonstrating its effectiveness and comparing it to CNN-Softmax on standard datasets.
Findings
CNN-SVM achieved ~99.04% accuracy on MNIST
CNN-Softmax achieved ~99.23% accuracy on MNIST
On Fashion-MNIST, CNN-SVM reached ~90.72% accuracy
Abstract
Convolutional neural networks (CNNs) are similar to "ordinary" neural networks in the sense that they are made up of hidden layers consisting of neurons with "learnable" parameters. These neurons receive inputs, performs a dot product, and then follows it with a non-linearity. The whole network expresses the mapping between raw image pixels and their class scores. Conventionally, the Softmax function is the classifier used at the last layer of this network. However, there have been studies (Alalshekmubarak and Smith, 2013; Agarap, 2017; Tang, 2013) conducted to challenge this norm. The cited studies introduce the usage of linear support vector machine (SVM) in an artificial neural network architecture. This project is yet another take on the subject, and is inspired by (Tang, 2013). Empirical data has shown that the CNN-SVM model was able to achieve a test accuracy of ~99.04% using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Image Retrieval and Classification Techniques
MethodsSoftmax
