Multi-scale Feature Enhancement in Multi-task Learning for Medical Image Analysis
Phuoc-Nguyen Bui, Duc-Tai Le, Junghyun Bum, Hyunseung Choo

TL;DR
This paper introduces a multi-task learning model for medical image analysis that combines a novel ResFormer encoder with multi-scale feature enhancement, significantly improving segmentation and classification accuracy.
Contribution
It proposes a UNet-based multi-task model with a ResFormer encoder and dilated feature enhancement modules to better balance local and global features for medical imaging tasks.
Findings
Outperforms state-of-the-art methods in segmentation accuracy.
Achieves higher classification accuracy across multiple datasets.
Effectively captures multi-scale features for lesion detection.
Abstract
Traditional deep learning methods in medical imaging often focus solely on segmentation or classification, limiting their ability to leverage shared information. Multi-task learning (MTL) addresses this by combining both tasks through shared representations but often struggles to balance local spatial features for segmentation and global semantic features for classification, leading to suboptimal performance. In this paper, we propose a simple yet effective UNet-based MTL model, where features extracted by the encoder are used to predict classification labels, while the decoder produces the segmentation mask. The model introduces an advanced encoder incorporating a novel ResFormer block that integrates local context from convolutional feature extraction with long-range dependencies modeled by the Transformer. This design captures broader contextual relationships and fine-grained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Brain Tumor Detection and Classification
MethodsAttention Is All You Need · Absolute Position Encodings · Residual Connection · Adam · Softmax · Label Smoothing · Dropout · Dense Connections · Layer Normalization · Linear Layer
