Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation
Zhaofeng Shi, Heqian Qiu, Lanxiao Wang, Fanman Meng, Qingbo Wu and, Hongliang Li

TL;DR
This paper introduces a novel approach for text-supervised egocentric semantic segmentation, leveraging cognition transferring and decoupling to improve recognition of dense wearer-object relations in egocentric images.
Contribution
The paper proposes a Cognition Transferring and Decoupling Network (CTDN) with modules for cognition transfer and foreground-background decoupling, addressing relation insensitivity in egocentric views.
Findings
Outperforms recent methods on four TESS benchmarks.
Effectively transfers cognition from large-scale pre-trained models.
Improves foreground-background discrimination in egocentric images.
Abstract
In this paper, we explore a novel Text-supervised Egocentic Semantic Segmentation (TESS) task that aims to assign pixel-level categories to egocentric images weakly supervised by texts from image-level labels. In this task with prospective potential, the egocentric scenes contain dense wearer-object relations and inter-object interference. However, most recent third-view methods leverage the frozen Contrastive Language-Image Pre-training (CLIP) model, which is pre-trained on the semantic-oriented third-view data and lapses in the egocentric view due to the ``relation insensitive" problem. Hence, we propose a Cognition Transferring and Decoupling Network (CTDN) that first learns the egocentric wearer-object relations via correlating the image and text. Besides, a Cognition Transferring Module (CTM) is developed to distill the cognitive knowledge from the large-scale pre-trained model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
