Vision Foundry: A System for Training Foundational Vision AI Models

Mahmut S. Gokmen; Mitchell A. Klusty; Evan W. Damron; W. Vaiden Logan; Aaron D. Mullen; Caroline N. Leach; Emily B. Collier; Samuel E. Armstrong; V.K. Cody Bumgardner

arXiv:2512.11837·q-bio.QM·December 16, 2025

Vision Foundry: A System for Training Foundational Vision AI Models

Mahmut S. Gokmen, Mitchell A. Klusty, Evan W. Damron, W. Vaiden Logan, Aaron D. Mullen, Caroline N. Leach, Emily B. Collier, Samuel E. Armstrong, V.K. Cody Bumgardner

PDF

Open Access

TL;DR

Vision Foundry is a user-friendly, HIPAA-compliant platform that enables clinical researchers to train and deploy foundational vision AI models using self-supervised learning, significantly reducing technical barriers and annotation needs.

Contribution

The paper introduces Vision Foundry, a code-free platform that integrates advanced SSL strategies for easy training and deployment of vision models in medical imaging.

Findings

01

Models trained with Vision Foundry outperform baselines in segmentation and regression tasks.

02

The platform demonstrates robust zero-shot generalization across different imaging protocols.

03

It enables domain experts to develop clinical AI tools with minimal annotation effort.

Abstract

Self-supervised learning (SSL) leverages vast unannotated medical datasets, yet steep technical barriers limit adoption by clinical researchers. We introduce Vision Foundry, a code-free, HIPAA-compliant platform that democratizes pre-training, adaptation, and deployment of foundational vision models. The system integrates the DINO-MX framework, abstracting distributed infrastructure complexities while implementing specialized strategies like Magnification-Aware Distillation (MAD) and Parameter-Efficient Fine-Tuning (PEFT). We validate the platform across domains, including neuropathology segmentation, lung cellularity estimation, and coronary calcium scoring. Our experiments demonstrate that models trained via Vision Foundry significantly outperform generic baselines in segmentation fidelity and regression accuracy, while exhibiting robust zero-shot generalization across imaging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Medical Image Segmentation Techniques