CT Liver Segmentation via PVT-based Encoding and Refined Decoding
Debesh Jha, Nikhil Kumar Tomar, Koushik Biswas, Gorkem Durak, Alpay, Medetalibeyoglu, Matthew Antalek, Yury Velichko, Daniela Ladner, Amir, Borhani, Ulas Bagci

TL;DR
This paper introduces PVTFormer, a novel deep learning model based on a pretrained pyramid vision transformer, designed for accurate liver segmentation from CT scans, achieving state-of-the-art results on the LiTS 2017 benchmark.
Contribution
The paper presents PVTFormer, combining PVT v2 with residual upsampling and hierarchical decoding, a novel architecture for improved liver segmentation accuracy.
Findings
Achieved a dice coefficient of 86.78% on LiTS 2017
Attained a mean Intersection over Union (mIoU) of 78.46%
Obtained a Hausdorff Distance of 3.50, indicating precise segmentation
Abstract
Accurate liver segmentation from CT scans is essential for effective diagnosis and treatment planning. Computer-aided diagnosis systems promise to improve the precision of liver disease diagnosis, disease progression, and treatment planning. In response to the need, we propose a novel deep learning approach, \textit{\textbf{PVTFormer}}, that is built upon a pretrained pyramid vision transformer (PVT v2) combined with advanced residual upsampling and decoder block. By integrating a refined feature channel approach with a hierarchical decoding strategy, PVTFormer generates high quality segmentation masks by enhancing semantic features. Rigorous evaluation of the proposed method on Liver Tumor Segmentation Benchmark (LiTS) 2017 demonstrates that our proposed architecture not only achieves a high dice coefficient of 86.78\%, mIoU of 78.46\%, but also obtains a low HD of 3.50. The results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Brain Tumor Detection and Classification · AI in cancer detection
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Linear Layer · Softmax · Residual Connection · Dense Connections · Vision Transformer
