CapsuleNet: A Deep Learning Model To Classify GI Diseases Using EfficientNet-b7
Aniket Das, Ayushman Singh, Nishant, Sharad Prakash

TL;DR
This paper introduces CapsuleNet, a deep learning model utilizing EfficientNet-b7 for classifying GI diseases from capsule endoscopy images, demonstrating improved accuracy and efficiency over baseline models.
Contribution
The study presents a novel CapsuleNet model that combines EfficientNet-b7 with specialized layers for GI disease classification, addressing data imbalance and optimizing inference speed.
Findings
Achieved 84.5% micro accuracy on validation data
Outperformed VGG16 baseline in most classes
Demonstrated effectiveness on imbalanced dataset
Abstract
Gastrointestinal (GI) diseases represent a significant global health concern, with Capsule Endoscopy (CE) offering a non-invasive method for diagnosis by capturing a large number of GI tract images. However, the sheer volume of video frames necessitates automated analysis to reduce the workload on doctors and increase the diagnostic accuracy. In this paper, we present CapsuleNet, a deep learning model developed for the Capsule Vision 2024 Challenge, aimed at classifying 10 distinct GI abnormalities. Using a highly imbalanced dataset, we implemented various data augmentation strategies, reducing the data imbalance to a manageable level. Our model leverages a pretrained EfficientNet-b7 backbone, tuned with additional layers for classification and optimized with PReLU activation functions. The model demonstrated superior performance on validation data, achieving a micro accuracy of 84.5%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · COVID-19 diagnosis using AI · Digital Imaging for Blood Diseases
MethodsParameterized ReLU
