Distributional Drift Detection in Medical Imaging with Sketching and Fine-Tuned Transformer
Yusen Wu, Phuong Nguyen, Rose Yesha, Yelena Yesha

TL;DR
This paper introduces a novel approach combining data-sketching and fine-tuned Vision Transformers to detect distributional drift in medical imaging, achieving high accuracy and sensitivity in real-time anomaly detection and feature comparison.
Contribution
It presents a new scalable method for distributional drift detection in medical images using data-sketching and fine-tuned transformers, improving accuracy and sensitivity over existing techniques.
Findings
Achieved 99.11% accuracy in mammography classification.
Enhanced dataset similarity detection from 50% to 99.1%.
Demonstrated high sensitivity to minimal noise levels.
Abstract
Distributional drift detection is important in medical applications as it helps ensure the accuracy and reliability of models by identifying changes in the underlying data distribution that could affect the prediction results of machine learning models. However, current methods have limitations in detecting drift, for example, the inclusion of abnormal datasets can lead to unfair comparisons. This paper presents an accurate and sensitive approach to detect distributional drift in CT-scan medical images by leveraging data-sketching and fine-tuning techniques. We developed a robust baseline library model for real-time anomaly detection, allowing for efficient comparison of incoming images and identification of anomalies. Additionally, we fine-tuned a pre-trained Vision Transformer model to extract relevant features, using mammography as a case study, significantly enhancing model accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Time Series Analysis and Forecasting
MethodsDropout · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings · Label Smoothing · Adam · Transformer · Softmax · Linear Layer · Residual Connection
