Dictionary-based Framework for Interpretable and Consistent Object Parsing
Tiezheng Zhang, Qihang Yu, Alan Yuille, Ju He

TL;DR
CoCal introduces a dictionary-based, interpretable object parsing framework that leverages contrastive learning and logical constraints to improve segmentation accuracy and consistency, achieving state-of-the-art results on key benchmarks.
Contribution
The paper proposes CoCal, a novel hierarchical dictionary-based mask transformer with contrastive components and logical constraints for improved object parsing.
Findings
Achieves state-of-the-art performance on PartImageNet and Pascal-Part-108.
Outperforms previous methods by 2.08% and 0.70% in part mIoU.
Enhances object-level segmentation metrics.
Abstract
In this work, we present CoCal, an interpretable and consistent object parsing framework based on dictionary-based mask transformer. Designed around Contrastive Components and Logical Constraints, CoCal rethinks existing cluster-based mask transformer architectures used in segmentation; Specifically, CoCal utilizes a set of dictionary components, with each component being explicitly linked to a specific semantic class. To advance this concept, CoCal introduces a hierarchical formulation of dictionary components that aligns with the semantic hierarchy. This is achieved through the integration of both within-level contrastive components and cross-level logical constraints. Concretely, CoCal employs a component-wise contrastive algorithm at each semantic level, enabling the contrasting of dictionary components within the same class against those from different classes. Furthermore, CoCal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsContrastive Learning · Sparse Evolutionary Training
