Learning from the Tangram to Solve Mini Visual Tasks

Yizhou Zhao; Liang Qiu; Pan Lu; Feng Shi; Tian Han and; Song-Chun Zhu

arXiv:2112.06113·cs.CV·December 14, 2021

Learning from the Tangram to Solve Mini Visual Tasks

Yizhou Zhao, Liang Qiu, Pan Lu, Feng Shi, Tian Han and, Song-Chun Zhu

PDF

1 Repo 1 Video

TL;DR

This paper introduces the Tangram dataset and demonstrates that pre-training on abstract Tangram puzzles enhances neural network performance on various mini visual tasks involving low-resolution images, such as aesthetic assessments and icon recognition.

Contribution

The work creates a novel Tangram dataset inspired by human puzzle-solving, showing that pre-training on it improves performance on diverse low-resolution visual tasks.

Findings

01

Pre-training on Tangram aids in solving aesthetic tasks like folding clothes and room layout evaluation.

02

It accelerates few-shot learning in human handwriting recognition.

03

It improves icon identification accuracy based on contours.

Abstract

Current pre-training methods in computer vision focus on natural images in the daily-life context. However, abstract diagrams such as icons and symbols are common and important in the real world. This work is inspired by Tangram, a game that requires replicating an abstract pattern from seven dissected shapes. By recording human experience in solving tangram puzzles, we present the Tangram dataset and show that a pre-trained neural model on the Tangram helps solve some mini visual tasks based on low-resolution vision. Extensive experiments demonstrate that our proposed method generates intelligent solutions for aesthetic tasks such as folding clothes and evaluating room layouts. The pre-trained feature extractor can facilitate the convergence of few-shot learning tasks on human handwriting and improve the accuracy in identifying icons by their contours. The Tangram dataset is available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yizhouzhao/tangram
noneOfficial

Videos

Learning from the Tangram to Solve Mini Visual Tasks· underline