CB-HVTNet: A channel-boosted hybrid vision transformer network for lymphocyte assessment in histopathological images
Momina Liaqat Ali, Zunaira Rauf, Asifullah Khan, Anabia Sohail, Rafi, Ullah, Jeonghwan Gwak

TL;DR
CB HVT is a novel hybrid vision transformer network that combines CNNs and transformers with channel boosting to improve lymphocyte detection in histopathological images, outperforming existing models.
Contribution
The paper introduces a channel boosting mechanism using transfer learning and a hybrid CNN-transformer architecture for enhanced lymphocyte analysis.
Findings
Outperforms state-of-the-art detection models
Demonstrates strong generalization on public datasets
Effectively handles overlapping regions and artifacts
Abstract
Transformers, due to their ability to learn long range dependencies, have overcome the shortcomings of convolutional neural networks (CNNs) for global perspective learning. Therefore, they have gained the focus of researchers for several vision related tasks including medical diagnosis. However, their multi-head attention module only captures global level feature representations, which is insufficient for medical images. To address this issue, we propose a Channel Boosted Hybrid Vision Transformer (CB HVT) that uses transfer learning to generate boosted channels and employs both transformers and CNNs to analyse lymphocytes in histopathological images. The proposed CB HVT comprises five modules, including a channel generation module, channel exploitation module, channel merging module, region-aware module, and a detection and segmentation head, which work together to effectively identify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Imaging for Blood Diseases · AI in cancer detection · COVID-19 diagnosis using AI
MethodsAttention Is All You Need · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Adam · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing
