A CNN-Transformer for Classification of Longitudinal 3D MRI Images -- A Case Study on Hepatocellular Carcinoma Prediction
Jakob Nolte, Maureen M. J. Guichelaar, Donald E. Bouman, Stephanie M., van den Berg, Maryam Amir Haeri

TL;DR
This paper introduces HCCNet, a novel CNN-Transformer model that effectively analyzes longitudinal 3D MRI data for improved hepatocellular carcinoma prediction, addressing challenges like limited data and irregular screening intervals.
Contribution
HCCNet combines a 3D ConvNeXt CNN with a Transformer encoder and employs a two-stage pre-training process tailored for longitudinal MRI data, enhancing disease progression modeling.
Findings
HCCNet outperforms baseline models in predictive accuracy.
The model effectively handles irregular screening intervals.
Pre-training improves model robustness and generalization.
Abstract
Longitudinal MRI analysis is crucial for predicting disease outcomes, particularly in chronic conditions like hepatocellular carcinoma (HCC), where early detection can significantly influence treatment strategies and patient prognosis. Yet, due to challenges like limited data availability, subtle parenchymal changes, and the irregular timing of medical screenings, current approaches have so far focused on cross-sectional imaging data. To address this, we propose HCCNet, a novel model architecture that integrates a 3D adaptation of the ConvNeXt CNN architecture with a Transformer encoder, capturing both the intricate spatial features of 3D MRIs and the complex temporal dependencies across different time points. HCCNet utilizes a two-stage pre-training process tailored for longitudinal MRI data. The CNN backbone is pre-trained using a self-supervised learning framework adapted for 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Radiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis
MethodsAttention Is All You Need · Adam · Softmax · Absolute Position Encodings · Residual Connection · Dropout · Byte Pair Encoding · Linear Layer · ConvNeXt · Multi-Head Attention
