CB-HVTNet: A channel-boosted hybrid vision transformer network for   lymphocyte assessment in histopathological images

Momina Liaqat Ali; Zunaira Rauf; Asifullah Khan; Anabia Sohail; Rafi; Ullah; Jeonghwan Gwak

arXiv:2305.09211·eess.IV·October 30, 2023·2 cites

CB-HVTNet: A channel-boosted hybrid vision transformer network for lymphocyte assessment in histopathological images

Momina Liaqat Ali, Zunaira Rauf, Asifullah Khan, Anabia Sohail, Rafi, Ullah, Jeonghwan Gwak

PDF

Open Access

TL;DR

CB HVT is a novel hybrid vision transformer network that combines CNNs and transformers with channel boosting to improve lymphocyte detection in histopathological images, outperforming existing models.

Contribution

The paper introduces a channel boosting mechanism using transfer learning and a hybrid CNN-transformer architecture for enhanced lymphocyte analysis.

Findings

01

Outperforms state-of-the-art detection models

02

Demonstrates strong generalization on public datasets

03

Effectively handles overlapping regions and artifacts

Abstract

Transformers, due to their ability to learn long range dependencies, have overcome the shortcomings of convolutional neural networks (CNNs) for global perspective learning. Therefore, they have gained the focus of researchers for several vision related tasks including medical diagnosis. However, their multi-head attention module only captures global level feature representations, which is insufficient for medical images. To address this issue, we propose a Channel Boosted Hybrid Vision Transformer (CB HVT) that uses transfer learning to generate boosted channels and employs both transformers and CNNs to analyse lymphocytes in histopathological images. The proposed CB HVT comprises five modules, including a channel generation module, channel exploitation module, channel merging module, region-aware module, and a detection and segmentation head, which work together to effectively identify…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Imaging for Blood Diseases · AI in cancer detection · COVID-19 diagnosis using AI

MethodsAttention Is All You Need · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Adam · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing