AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model
Yuchen Li, Li Zhang, Youwei Liang, Pengtao Xie

TL;DR
AM-SAM introduces an automated prompting and mask calibration approach for the Segment Anything Model, significantly reducing human effort and improving segmentation accuracy through bi-level optimization and advanced feature representation techniques.
Contribution
It presents a novel bi-level optimization framework that automates prompt generation and enhances mask decoding without retraining SAM extensively.
Findings
Achieves higher dice scores on body segmentation datasets.
Matches or exceeds performance of human prompts.
Demonstrates faster convergence with automated prompts.
Abstract
Segment Anything Model (SAM) has gained significant recognition in the field of semantic segmentation due to its versatile capabilities and impressive performance. Despite its success, SAM faces two primary limitations: (1) it relies heavily on meticulous human-provided prompts like key points, bounding boxes or text messages, which is labor-intensive; (2) the mask decoder's feature representation is sometimes inaccurate, as it solely employs dot product operations at the end of mask decoder, which inadequately captures the necessary correlations for precise segmentation. Current solutions to these problems such as fine-tuning SAM often require retraining a large number of parameters, which needs huge amount of time and computing resources. To address these limitations, we propose an automated prompting and mask calibration method called AM-SAM based on a bi-level optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSegment Anything Model · Sparse Evolutionary Training
