Advanced Vision Transformers and Open-Set Learning for Robust Mosquito Classification: A Novel Approach to Entomological Studies
Ahmed Akib Jawad Karim, Muhammad Zawad Mahmud, Riasat Khan

TL;DR
This paper introduces a novel framework combining vision transformers and open-set learning for accurate and robust mosquito species classification, improving surveillance and control efforts in public health.
Contribution
It presents an innovative integration of Transformer models with open-set learning techniques for enhanced mosquito classification accuracy and generalizability.
Findings
Swin Transformer achieves 99.80% accuracy and 0.998 F1 score.
MobileViT attains 98.90% accuracy with fewer parameters.
Open-set learning enables detection of unseen insect classes.
Abstract
Mosquito-related diseases pose a significant threat to global public health, necessitating efficient and accurate mosquito classification for effective surveillance and control. This work presents an innovative approach to mosquito classification by leveraging state-of-the-art vision transformers and open-set learning techniques. A novel framework has been introduced that integrates Transformer-based deep learning models with comprehensive data augmentation and preprocessing methods, enabling robust and precise identification of ten mosquito species. The Swin Transformer model achieves the best performance for traditional closed-set learning with 99.80% accuracy and 0.998 F1 score. The lightweight MobileViT technique attains an almost similar accuracy of 98.90% with significantly reduced parameters and model complexities. Next, the applied deep learning models' adaptability and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI
MethodsDepthwise Convolution · Average Pooling · Linear Layer · Max Pooling · Layer Normalization · Multi-Head Attention · Stochastic Depth · Attention Is All You Need · Pointwise Convolution · Depthwise Separable Convolution
