Semi-Automatic Data Annotation guided by Feature Space Projection

Barbara Caroline Benato; Jancarlo Ferreira Gomes; Alexandru; Cristian Telea; Alexandre Xavier Falc\~ao

arXiv:2007.13689·cs.LG·August 25, 2020

Semi-Automatic Data Annotation guided by Feature Space Projection

Barbara Caroline Benato, Jancarlo Ferreira Gomes, Alexandru, Cristian Telea, Alexandre Xavier Falc\~ao

PDF

TL;DR

This paper introduces a semi-automatic data annotation method that leverages feature space projection and semi-supervised learning to reduce manual labeling effort and improve classification accuracy, validated on MNIST and parasite images.

Contribution

The paper proposes a novel semi-automatic annotation approach combining feature space projection with semi-supervised learning, enhancing label propagation efficiency and accuracy.

Findings

01

Effective label propagation reduces manual annotation effort.

02

Improved classification accuracy on diverse datasets.

03

Visual analytics tools enhance human-machine collaboration.

Abstract

Data annotation using visual inspection (supervision) of each training sample can be laborious. Interactive solutions alleviate this by helping experts propagate labels from a few supervised samples to unlabeled ones based solely on the visual analysis of their feature space projection (with no further sample supervision). We present a semi-automatic data annotation approach based on suitable feature space projection and semi-supervised label estimation. We validate our method on the popular MNIST dataset and on images of human intestinal parasites with and without fecal impurities, a large and diverse dataset that makes classification very hard. We evaluate two approaches for semi-supervised learning from the latent and projection spaces, to choose the one that best reduces user annotation effort and also increases classification accuracy on unseen data. Our results demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.