FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
Soumen Basu, Mayuna Gupta, Chetan Madan, Pankaj Gupta, Chetan Arora

TL;DR
This paper introduces FocusMAE, a novel video-based approach using focused masked autoencoders for improved gallbladder cancer detection from ultrasound videos, achieving state-of-the-art accuracy and demonstrating generality on CT data.
Contribution
It proposes FocusMAE, a new masking strategy for autoencoders that emphasizes high-information regions in ultrasound videos for better disease representation, along with the creation of the largest US video dataset for GBC detection.
Findings
Achieved 96.4% accuracy on GBC detection, surpassing previous methods.
Demonstrated the effectiveness of FocusMAE on a public CT-based Covid dataset.
First study to utilize US video data for GBC detection.
Abstract
In recent years, automated Gallbladder Cancer (GBC) detection has gained the attention of researchers. Current state-of-the-art (SOTA) methodologies relying on ultrasound sonography (US) images exhibit limited generalization, emphasizing the need for transformative approaches. We observe that individual US frames may lack sufficient information to capture disease manifestation. This study advocates for a paradigm shift towards video-based GBC detection, leveraging the inherent advantages of spatiotemporal representations. Employing the Masked Autoencoder (MAE) for representation learning, we address shortcomings in conventional image-based methods. We propose a novel design called FocusMAE to systematically bias the selection of masking tokens from high-information regions, fostering a more refined representation of malignancy. Additionally, we contribute the most extensive US video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging
