Lightweight Relational Embedding in Task-Interpolated Few-Shot Networks for Enhanced Gastrointestinal Disease Classification
Xinliu Zhong, Leo Hwa Liang, Angela S. Koh, Yeo Si Yong

TL;DR
This paper introduces a lightweight relational embedding approach within a task-interpolated few-shot network to improve gastrointestinal disease classification from endoscopic images, addressing challenges of image quality and similarity.
Contribution
It presents a novel deep learning architecture combining task interpolation, relational embedding, and bi-level routing attention for enhanced few-shot endoscopic image classification.
Findings
Achieved 90.1% accuracy on Kvasir dataset
Outperformed existing state-of-the-art methods
Demonstrated robustness across diverse datasets
Abstract
Traditional diagnostic methods like colonoscopy are invasive yet critical tools necessary for accurately diagnosing colorectal cancer (CRC). Detection of CRC at early stages is crucial for increasing patient survival rates. However, colonoscopy is dependent on obtaining adequate and high-quality endoscopic images. Prolonged invasive procedures are inherently risky for patients, while suboptimal or insufficient images hamper diagnostic accuracy. These images, typically derived from video frames, often exhibit similar patterns, posing challenges in discrimination. To overcome these challenges, we propose a novel Deep Learning network built on a Few-Shot Learning architecture, which includes a tailored feature extractor, task interpolation, relational embedding, and a bi-level routing attention mechanism. The Few-Shot Learning paradigm enables our model to rapidly adapt to unseen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · Routing Attention
