Scaling In-Context Segmentation with Hierarchical Supervision

T. Camaret Ndir; Marco Reisert; Robin T. Schirrmeister

arXiv:2604.12752·cs.CV·April 23, 2026

Scaling In-Context Segmentation with Hierarchical Supervision

T. Camaret Ndir, Marco Reisert, Robin T. Schirrmeister

PDF

1 Repo

TL;DR

PatchICL introduces a hierarchical, selective attention framework for medical image segmentation that reduces computation and improves out-of-domain performance by focusing on informative regions.

Contribution

It presents a novel hierarchical approach combining patch selection and multi-level supervision, enhancing efficiency and generalization in in-context segmentation.

Findings

01

Reduces compute by 44% at 512×512 resolution compared to baseline.

02

Outperforms baseline on 6 of 13 out-of-domain modalities.

03

Achieves competitive accuracy on in-domain CT segmentation.

Abstract

In-context learning (ICL) enables medical image segmentation models to adapt to new anatomical structures from limited examples, reducing the clinical annotation burden. However, standard ICL methods typically rely on dense, global cross-attention, which scales poorly with image resolution. While recent approaches have introduced localized attention mechanisms, they often lack explicit supervision on the selection process, leading to redundant computation in non-informative regions. We propose PatchICL, a hierarchical framework that combines selective image patching with multi-level supervision. Our approach learns to actively identify and attend only to the most informative anatomical regions. Compared to UniverSeg, a strong global-attention baseline, PatchICL achieves competitive in-domain CT segmentation accuracy while reducing compute by 44\% at $512 \times 512$ resolution. On 35…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tidiane-camaret/ic_segmentation
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.