PatchDCT: Patch Refinement for High Quality Instance Segmentation

Qinrou Wen; Jirui Yang; Xue Yang; Kewei Liang

arXiv:2302.02693·cs.CV·February 8, 2023·5 cites

PatchDCT: Patch Refinement for High Quality Instance Segmentation

Qinrou Wen, Jirui Yang, Xue Yang, Kewei Liang

PDF

Open Access 1 Repo 1 Video

TL;DR

PatchDCT introduces a novel multi-stage refinement framework for high-quality instance segmentation by dividing masks into patches and refining each with classifiers and regressors, significantly improving accuracy.

Contribution

It proposes PatchDCT, a new method that refines DCT-based masks by patch-wise processing, enhancing segmentation quality over previous DCT-Mask approaches.

Findings

01

Achieves up to 4.5% AP improvement on COCO

02

Surpasses DCT-Mask in boundary AP by up to 4.2%

03

Competitive with state-of-the-art segmentation methods

Abstract

High-quality instance segmentation has shown emerging importance in computer vision. Without any refinement, DCT-Mask directly generates high-resolution masks by compressed vectors. To further refine masks obtained by compressed vectors, we propose for the first time a compressed vector based multi-stage refinement framework. However, the vanilla combination does not bring significant gains, because changes in some elements of the DCT vector will affect the prediction of the entire mask. Thus, we propose a simple and novel method named PatchDCT, which separates the mask decoded from a DCT vector into several patches and refines each patch by the designed classifier and regressor. Specifically, the classifier is used to distinguish mixed patches from all patches, and to correct previously mispredicted foreground and background patches. In contrast, the regressor is used for DCT vector…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

olivia-w12/patchdct
pytorchOfficial

Videos

PatchDCT: Patch Refinement for High Quality Instance Segmentation· slideslive

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Image Retrieval and Classification Techniques