TL;DR
The paper introduces USTri, a three-stage ultrasound analysis pipeline that generalizes across organs, tasks, and devices, enabling end-to-end clinical workflow automation with high accuracy.
Contribution
It proposes a novel tri-stage framework combining a universal generalist, dataset-specific fine-tuning, and workflow-mimicking inference, advancing ultrasound AI capabilities.
Findings
Achieves state-of-the-art performance on FMC_UIA dataset across multiple tasks.
Produces clinically structured, interpretable reports with high accuracy.
Demonstrates robustness to device and protocol variability.
Abstract
Clinical ultrasound analysis demands models that generalize across heterogeneous organs, views, and devices, while supporting interpretable workflow-level analysis. Existing methods often rely on task-wise adaptation, and joint learning may be unstable due to cross-task interference, making it hard to deliver workflow-level outputs in practice. To address these challenges, we present USTri, a tri-stage ultrasound intelligence pipeline for unified multi-organ, multi-task analysis. Stage I trains a universal generalist USGen on different domains to learn broad, transferable priors that are robust to device and protocol variability. To better handle domain shifts and reach task-aligned performance while preserving ultrasound shared knowledge, Stage II builds USpec by keeping USGen frozen and finetuning dataset-specific heads. Stage III introduces USAgent, which mimics clinician workflows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
