PS-DeVCEM: Pathology-sensitive deep learning model for video capsule endoscopy based on weakly labeled data
A. Mohammed, I. Farup, M. Pedersen, S. Yildirim, and {\O} Hovde

TL;DR
This paper introduces PS-DeVCEM, a deep learning model for detecting and classifying colon diseases in video capsule endoscopy, trained with weak labels and capable of temporal localization without detailed annotations.
Contribution
The paper presents a novel weakly supervised deep learning approach with attention mechanisms and self-supervision for colon disease detection in VCE videos, along with a new annotated dataset.
Findings
Achieved 61.6% precision and 55.1% F1-score, outperforming state-of-the-art methods.
Enabled temporal localization of pathologies without frame-level annotations.
Collected and annotated the largest VCE dataset with weak labels.
Abstract
We propose a novel pathology-sensitive deep learning model (PS-DeVCEM) for frame-level anomaly detection and multi-label classification of different colon diseases in video capsule endoscopy (VCE) data. Our proposed model is capable of coping with the key challenge of colon apparent heterogeneity caused by several types of diseases. Our model is driven by attention-based deep multiple instance learning and is trained end-to-end on weakly labeled data using video labels instead of detailed frame-by-frame annotation. The spatial and temporal features are obtained through ResNet50 and residual Long short-term memory (residual LSTM) blocks, respectively. Additionally, the learned temporal attention module provides the importance of each frame to the final label prediction. Moreover, we developed a self-supervision method to maximize the distance between classes of pathologies. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
