GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis
Jing Hao, Moyun Liu, Kuo Feng Hung

TL;DR
This paper introduces GEM, a simple yet effective glass surface segmentation model leveraging foundation models and synthetic data, achieving state-of-the-art results in glass detection accuracy.
Contribution
The paper proposes GEM, a novel segmentation framework combining SAM and synthetic data generation, to improve glass surface detection accuracy.
Findings
GEM achieves +2.1% IoU improvement over previous methods.
Synthetic data scale positively impacts transfer learning.
The model effectively segments transparent glass surfaces.
Abstract
Detecting glass regions is a challenging task due to the ambiguity of their transparency and reflection properties. These transparent glasses share the visual appearance of both transmitted arbitrary background scenes and reflected objects, thus having no fixed patterns.Recent visual foundation models, which are trained on vast amounts of data, have manifested stunning performance in terms of image perception and image generation. To segment glass surfaces with higher accuracy, we make full use of two visual foundation models: Segment Anything (SAM) and Stable Diffusion.Specifically, we devise a simple glass surface segmentor named GEM, which only consists of a SAM backbone, a simple feature pyramid, a discerning query selection module, and a mask decoder. The discerning query selection can adaptively identify glass surface features, assigning them as initialized queries in the mask…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Big Data and Business Intelligence
MethodsSparse Evolutionary Training · Diffusion · Segment Anything Model
