AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model

Yuchen Li; Li Zhang; Youwei Liang; Pengtao Xie

arXiv:2410.09714·cs.CV·January 21, 2026

AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model

Yuchen Li, Li Zhang, Youwei Liang, Pengtao Xie

PDF

TL;DR

AM-SAM introduces an automated prompting and mask calibration approach for the Segment Anything Model, significantly reducing human effort and improving segmentation accuracy through bi-level optimization and advanced feature representation techniques.

Contribution

It presents a novel bi-level optimization framework that automates prompt generation and enhances mask decoding without retraining SAM extensively.

Findings

01

Achieves higher dice scores on body segmentation datasets.

02

Matches or exceeds performance of human prompts.

03

Demonstrates faster convergence with automated prompts.

Abstract

Segment Anything Model (SAM) has gained significant recognition in the field of semantic segmentation due to its versatile capabilities and impressive performance. Despite its success, SAM faces two primary limitations: (1) it relies heavily on meticulous human-provided prompts like key points, bounding boxes or text messages, which is labor-intensive; (2) the mask decoder's feature representation is sometimes inaccurate, as it solely employs dot product operations at the end of mask decoder, which inadequately captures the necessary correlations for precise segmentation. Current solutions to these problems such as fine-tuning SAM often require retraining a large number of parameters, which needs huge amount of time and computing resources. To address these limitations, we propose an automated prompting and mask calibration method called AM-SAM based on a bi-level optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSegment Anything Model · Sparse Evolutionary Training