Vehicle classification using ResNets, localisation and spatially-weighted pooling
Rohan Watkins, Nick Pears, Suresh Manandhar

TL;DR
This paper explores the effectiveness of ResNet architectures with spatially-weighted pooling and localization for fine-grained vehicle classification, achieving high accuracy without pre-training.
Contribution
It introduces the combination of ResNet, spatially-weighted pooling, and localization for improved vehicle classification accuracy.
Findings
Spatially Weighted Pooling increases accuracy by 1.5%
Localization improves accuracy by 3.4%
Combined method achieves 96.35% accuracy on the dataset
Abstract
We investigate whether ResNet architectures can outperform more traditional Convolutional Neural Networks on the task of fine-grained vehicle classification. We train and test ResNet-18, ResNet-34 and ResNet-50 on the Comprehensive Cars dataset without pre-training on other datasets. We then modify the networks to use Spatially Weighted Pooling. Finally, we add a localisation step before the classification process, using a network based on ResNet-50. We find that using Spatially Weighted Pooling and localisation both improve classification accuracy of ResNet50. Spatially Weighted Pooling increases accuracy by 1.5 percent points and localisation increases accuracy by 3.4 percent points. Using both increases accuracy by 3.7 percent points giving a top-1 accuracy of 96.351\% on the Comprehensive Cars dataset. Our method achieves higher accuracy than a range of methods including those that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
