Vision Transformers for Weakly-Supervised Microorganism Enumeration
Javier Ure\~na Santiago, Thomas Str\"ohle, Antonio, Rodr\'iguez-S\'anchez, Ruth Breu

TL;DR
This paper compares vision transformers and traditional models for counting microorganisms in images, showing ViTs perform competitively and opening new research avenues in microbial image analysis.
Contribution
It provides a comparative analysis of ViTs versus ResNets for microorganism counting, highlighting ViTs' potential in weakly-supervised microbial enumeration.
Findings
ResNets outperform ViTs overall
ViTs show competent results across datasets
ViTs open new research directions in microbial counting
Abstract
Microorganism enumeration is an essential task in many applications, such as assessing contamination levels or ensuring health standards when evaluating surface cleanliness. However, it's traditionally performed by human-supervised methods that often require manual counting, making it tedious and time-consuming. Previous research suggests automating this task using computer vision and machine learning methods, primarily through instance segmentation or density estimation techniques. This study conducts a comparative analysis of vision transformers (ViTs) for weakly-supervised counting in microorganism enumeration, contrasting them with traditional architectures such as ResNet and investigating ViT-based models such as TransCrowd. We trained different versions of ViTs as the architectural backbone for feature extraction using four microbiology datasets to determine potential new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Image Processing Techniques and Applications · Image Retrieval and Classification Techniques
MethodsMax Pooling · Convolution · Average Pooling · Global Average Pooling · Kaiming Initialization
