Deep Semantic Parsing of Freehand Sketches with Homogeneous Transformation, Soft-Weighted Loss, and Staged Learning
Ying Zheng, Hongxun Yao, and Xiaoshuai Sun

TL;DR
This paper introduces a deep learning framework for part-level semantic parsing of freehand sketches, utilizing a homogeneous transformation for domain adaptation, a soft-weighted loss for better training, and staged learning for improved accuracy, achieving state-of-the-art results.
Contribution
The paper presents a novel homogeneous transformation method for domain adaptation, a soft-weighted loss function, and a staged learning strategy, collectively advancing freehand sketch semantic parsing.
Findings
Homogeneous transformation reduces domain gap between real images and sketches.
Soft-weighted loss improves handling of ambiguous boundaries and class imbalance.
The integrated framework achieves state-of-the-art performance on SketchParse dataset.
Abstract
In this paper, we propose a novel deep framework for part-level semantic parsing of freehand sketches, which makes three main contributions that are experimentally shown to have substantial practical merit. First, we propose a homogeneous transformation method to address the problem of domain adaptation. For the task of sketch parsing, there is no available data of labeled freehand sketches that can be directly used for model training. An alternative solution is to learn from datasets of real image parsing, while the domain adaptation is an inevitable problem. Unlike existing methods that utilize the edge maps of real images to approximate freehand sketches, the proposed homogeneous transformation method transforms the data from domains of real images and freehand sketches into a homogeneous space to minimize the semantic gap. Second, we design a soft-weighted loss function as guidance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
