Deep Semantic Parsing of Freehand Sketches with Homogeneous   Transformation, Soft-Weighted Loss, and Staged Learning

Ying Zheng; Hongxun Yao; and Xiaoshuai Sun

arXiv:1910.06023·cs.CV·December 4, 2020

Deep Semantic Parsing of Freehand Sketches with Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning

Ying Zheng, Hongxun Yao, and Xiaoshuai Sun

PDF

TL;DR

This paper introduces a deep learning framework for part-level semantic parsing of freehand sketches, utilizing a homogeneous transformation for domain adaptation, a soft-weighted loss for better training, and staged learning for improved accuracy, achieving state-of-the-art results.

Contribution

The paper presents a novel homogeneous transformation method for domain adaptation, a soft-weighted loss function, and a staged learning strategy, collectively advancing freehand sketch semantic parsing.

Findings

01

Homogeneous transformation reduces domain gap between real images and sketches.

02

Soft-weighted loss improves handling of ambiguous boundaries and class imbalance.

03

The integrated framework achieves state-of-the-art performance on SketchParse dataset.

Abstract

In this paper, we propose a novel deep framework for part-level semantic parsing of freehand sketches, which makes three main contributions that are experimentally shown to have substantial practical merit. First, we propose a homogeneous transformation method to address the problem of domain adaptation. For the task of sketch parsing, there is no available data of labeled freehand sketches that can be directly used for model training. An alternative solution is to learn from datasets of real image parsing, while the domain adaptation is an inevitable problem. Unlike existing methods that utilize the edge maps of real images to approximate freehand sketches, the proposed homogeneous transformation method transforms the data from domains of real images and freehand sketches into a homogeneous space to minimize the semantic gap. Second, we design a soft-weighted loss function as guidance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.