Evaluating Deep Learning Models for African Wildlife Image Classification: From DenseNet to Vision Transformers

Lukman Jibril Aliyu; Umar Sani Muhammad; Bilqisu Ismail; Nasiru Muhammad; Almustapha A Wakili; Seid Muhie Yimam; Shamsuddeen Hassan Muhammad; Mustapha Abdullahi

arXiv:2507.21364·cs.CV·July 30, 2025

Evaluating Deep Learning Models for African Wildlife Image Classification: From DenseNet to Vision Transformers

Lukman Jibril Aliyu, Umar Sani Muhammad, Bilqisu Ismail, Nasiru Muhammad, Almustapha A Wakili, Seid Muhie Yimam, Shamsuddeen Hassan Muhammad, Mustapha Abdullahi

PDF

TL;DR

This study compares various deep learning models, including DenseNet and Vision Transformers, for classifying African wildlife images, highlighting accuracy, resource needs, and deployment considerations for conservation efforts.

Contribution

It provides a comparative analysis of CNNs and Vision Transformers for wildlife classification, emphasizing practical deployment in conservation contexts.

Findings

01

DenseNet-201 achieved 67% accuracy among CNNs.

02

ViT-H/14 achieved 99% accuracy but with high computational cost.

03

Lightweight DenseNet model was deployed for real-time use.

Abstract

Wildlife populations in Africa face severe threats, with vertebrate numbers declining by over 65% in the past five decades. In response, image classification using deep learning has emerged as a promising tool for biodiversity monitoring and conservation. This paper presents a comparative study of deep learning models for automatically classifying African wildlife images, focusing on transfer learning with frozen feature extractors. Using a public dataset of four species: buffalo, elephant, rhinoceros, and zebra; we evaluate the performance of DenseNet-201, ResNet-152, EfficientNet-B4, and Vision Transformer ViT-H/14. DenseNet-201 achieved the best performance among convolutional networks (67% accuracy), while ViT-H/14 achieved the highest overall accuracy (99%), but with significantly higher computational cost, raising deployment concerns. Our experiments highlight the trade-offs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.