MTMed3D: A Multi-Task Transformer-Based Model for 3D Medical Imaging
Fan Li, Arun Iyengar, Lanyu Xu

TL;DR
MTMed3D introduces a multi-task Transformer-based model for 3D medical imaging that jointly performs detection, segmentation, and classification, improving efficiency and maintaining high performance across tasks.
Contribution
It is the first to leverage Transformers for simultaneous multi-task learning in 3D medical imaging, combining detection, segmentation, and classification.
Findings
Achieves better detection results than prior methods.
Reduces computational costs and speeds up inference.
Maintains performance comparable to single-task models.
Abstract
In the field of medical imaging, AI-assisted techniques such as object detection, segmentation, and classification are widely employed to alleviate the workload of physicians and doctors. However, single-task models are predominantly used, overlooking the shared information across tasks. This oversight leads to inefficiencies in real-life applications. In this work, we propose MTMed3D, a novel end-to-end Multi-task Transformer-based model to address the limitations of single-task models by jointly performing 3D detection, segmentation, and classification in medical imaging. Our model uses a Transformer as the shared encoder to generate multi-scale features, followed by CNN-based task-specific decoders. The proposed framework was evaluated on the BraTS 2018 and 2019 datasets, achieving promising results across all three tasks, especially in detection, where our method achieves better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Artificial Intelligence in Healthcare and Education · Domain Adaptation and Few-Shot Learning
