TL;DR
AoP-SAM automates prompt generation for the Segment Anything Model, significantly improving efficiency and accuracy in image segmentation without manual prompts or fine-tuning.
Contribution
This paper introduces AoP-SAM, a novel automatic prompt generation method that enhances SAM's usability and efficiency for real-world segmentation tasks.
Findings
Substantial improvement in prompt generation efficiency
Enhanced mask accuracy across multiple datasets
Reduced computational overhead in segmentation process
Abstract
The Segment Anything Model (SAM) is a powerful foundation model for image segmentation, showing robust zero-shot generalization through prompt engineering. However, relying on manual prompts is impractical for real-world applications, particularly in scenarios where rapid prompt provision and resource efficiency are crucial. In this paper, we propose the Automation of Prompts for SAM (AoP-SAM), a novel approach that learns to generate essential prompts in optimal locations automatically. AoP-SAM enhances SAM's efficiency and usability by eliminating manual input, making it better suited for real-world tasks. Our approach employs a lightweight yet efficient Prompt Predictor model that detects key entities across images and identifies the optimal regions for placing prompt candidates. This method leverages SAM's image embeddings, preserving its zero-shot generalization capabilities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
MethodsSegment Anything Model
