HyCTAS: Multi-Objective Hybrid Convolution-Transformer Architecture Search for Real-Time Image Segmentation
Hongyuan Yu, Cheng Wan, Xiyang Dai, Mengchen Liu, Dongdong Chen, Bin Xiao, Yan Huang, Yuan Lu, and Liang Wang

TL;DR
HyCTAS introduces a multi-objective architecture search method to efficiently design hybrid convolution-transformer models for real-time image segmentation, balancing accuracy and latency without extensive manual tuning.
Contribution
The paper proposes a novel multi-target supernet search algorithm that effectively integrates multi-head self-attention into high-resolution CNNs for segmentation.
Findings
Achieves competitive real-time segmentation accuracy on Cityscapes, ADE20K, and COCO.
Finds architectures on the Pareto front balancing latency and mIoU.
Demonstrates effectiveness without ImageNet pretraining.
Abstract
Real-time image segmentation demands architectures that preserve fine spatial detail while capturing global context under tight latency and memory budgets. Image segmentation is one of the most fundamental problems in computer vision and has drawn a lot of attention due to its vast applications in image understanding and autonomous driving. However, designing effective and efficient segmentation neural architectures is a labor-intensive process that may require numerous trials by human experts. In this paper, we address the challenge of integrating multi-head self-attention into high-resolution representation CNNs efficiently by leveraging architecture search. Manually replacing convolution layers with multi-head self-attention is non-trivial due to the costly overhead in memory to maintain high resolution. By contrast, we develop a multi-target multi-branch supernet method, which not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Image Processing Techniques and Applications · Image Retrieval and Classification Techniques
MethodsConvolution
