Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues

Tuan-Anh Vu; Hai Nguyen-Truong; Ziqiang Zheng; Binh-Son Hua; Qing Guo; Ivor Tsang; Sai-Kit Yeung

arXiv:2512.07034·cs.CV·December 9, 2025

Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues

Tuan-Anh Vu, Hai Nguyen-Truong, Ziqiang Zheng, Binh-Son Hua, Qing Guo, Ivor Tsang, Sai-Kit Yeung

PDF

Open Access

TL;DR

This paper introduces TransCues, a pyramid transformer-based framework that enhances transparent object segmentation by incorporating boundary and reflection cues, significantly outperforming existing methods across multiple datasets.

Contribution

The paper proposes a novel boundary and reflection feature enhancement approach within a pyramid transformer architecture for transparent object segmentation.

Findings

01

Achieved +4.2% mIoU on Trans10K-v2

02

Achieved +5.6% mIoU on MSD

03

Achieved +10.1% mIoU on RGBD-Mirror

Abstract

Glass is a prevalent material among solid objects in everyday life, yet segmentation methods struggle to distinguish it from opaque materials due to its transparency and reflection. While it is known that human perception relies on boundary and reflective-object features to distinguish glass objects, the existing literature has not yet sufficiently captured both properties when handling transparent objects. Hence, we propose incorporating both of these powerful visual cues via the Boundary Feature Enhancement and Reflection Feature Enhancement modules in a mutually beneficial way. Our proposed framework, TransCues, is a pyramidal transformer encoder-decoder architecture to segment transparent objects. We empirically show that these two modules can be used together effectively, improving overall performance across various benchmark datasets, including glass object semantic segmentation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Image Enhancement Techniques