Image Segmentation with transformers: An Overview, Challenges and Future

Deepjyoti Chetia; Debasish Dutta; Sanjib Kr Kalita

arXiv:2501.09372·cs.CV·January 17, 2025

Image Segmentation with transformers: An Overview, Challenges and Future

Deepjyoti Chetia, Debasish Dutta, Sanjib Kr Kalita

PDF

Open Access

TL;DR

This paper reviews how transformer architectures are revolutionizing image segmentation by addressing CNN limitations, highlighting current challenges, solutions, and future research directions in the field.

Contribution

It provides a comprehensive overview of transformer-based segmentation models, analyzing their advantages, challenges, and potential future developments.

Findings

01

Transformer models improve segmentation accuracy over CNNs.

02

Current challenges include model efficiency and data requirements.

03

Future trends involve lightweight architectures and better data efficiency.

Abstract

Image segmentation, a key task in computer vision, has traditionally relied on convolutional neural networks (CNNs), yet these models struggle with capturing complex spatial dependencies, objects with varying scales, need for manually crafted architecture components and contextual information. This paper explores the shortcomings of CNN-based models and the shift towards transformer architectures -to overcome those limitations. This work reviews state-of-the-art transformer-based segmentation models, addressing segmentation-specific challenges and their solutions. The paper discusses current challenges in transformer-based segmentation and outlines promising future trends, such as lightweight architectures and enhanced data efficiency. This survey serves as a guide for understanding the impact of transformers in advancing segmentation capabilities and overcoming the limitations of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification · Currency Recognition and Detection · Neural Networks and Applications