LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation
WeiFu Fu, CongChong Nie, Ting Sun, Jun Liu, TianLiang Zhang, Yong Liu

TL;DR
This paper presents a novel instance segmentation approach for the LVIS challenge that addresses long-tail distribution and boundary quality, achieving state-of-the-art results through a combination of advanced models, data balancing, and refinement techniques.
Contribution
The paper introduces a distribution balanced method and boundary refinement techniques integrated with a transformer-based backbone to improve large vocabulary instance segmentation.
Findings
Achieved over 45.4% boundary AP on LVIS validation set.
Ranked 1st in LVIS Challenge 2021 with 48.1% AP.
Early stopping and EMA significantly improved segmentation performance.
Abstract
This report introduces the technical details of the team FuXi-Fresher for LVIS Challenge 2021. Our method focuses on the problem in following two aspects: the long-tail distribution and the segmentation quality of mask and boundary. Based on the advanced HTC instance segmentation algorithm, we connect transformer backbone(Swin-L) through composite connections inspired by CBNetv2 to enhance the baseline results. To alleviate the problem of long-tail distribution, we design a Distribution Balanced method which includes dataset balanced and loss function balaced modules. Further, we use a Mask and Boundary Refinement method composed with mask scoring and refine-mask algorithms to improve the segmentation quality. In addition, we are pleasantly surprised to find that early stopping combined with EMA method can achieve a great improvement. Finally, by using multi-scale testing and increasing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsFeature Pyramid Network · RoIAlign · Region Proposal Network · 1x1 Convolution · Convolution · Hybrid Task Cascade · Early Stopping
