Learning Universal Shape Dictionary for Realtime Instance Segmentation
Tutian Tang, Wenqiang Xu, Ruolin Ye, Lixin Yang, Cewu Lu

TL;DR
This paper introduces USD-Seg, an explicit shape representation method for instance segmentation using a learned shape dictionary, achieving real-time performance and high accuracy on COCO dataset.
Contribution
It proposes a universal shape dictionary for explicit shape modeling in instance segmentation, enabling fast and explainable shape predictions with minimal overhead.
Findings
Achieves 35.8 AP at 65 fps with YOLOv4 on COCO.
Attains 34.1 AP at 12 fps with FCOS on COCO.
Introduces the Efficiency of AP (AP_E) metric for computational assessment.
Abstract
We present a novel explicit shape representation for instance segmentation. Based on how to model the object shape, current instance segmentation systems can be divided into two categories, implicit and explicit models. The implicit methods, which represent the object mask/contour by intractable network parameters, and produce it through pixel-wise classification, are predominant. However, the explicit methods, which parameterize the shape with simple and explainable models, are less explored. Since the operations to generate the final shape are light-weighted, the explicit methods have a clear speed advantage over implicit methods, which is crucial for real-world applications. The proposed USD-Seg adopts a linear model, sparse coding with dictionary, for object shapes. First, it learns a dictionary from a large collection of shape datasets, making any shape being able to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
MethodsGrid Sensitive · *Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Average Pooling · 1x1 Convolution · Global Average Pooling · Convolution · Residual Connection · Sigmoid Activation · Bottom-up Path Augmentation
