Responsible Active Learning via Human-in-the-loop Peer Study

Yu-Tong Cao; Jingya Wang; Baosheng Yu; Dacheng Tao

arXiv:2211.13587·cs.LG·November 28, 2022

Responsible Active Learning via Human-in-the-loop Peer Study

Yu-Tong Cao, Jingya Wang, Baosheng Yu, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces Peer Study Learning (PSL), a responsible active learning approach that preserves data privacy by using a human-in-the-loop architecture with peer students, improving model stability and performance.

Contribution

The paper proposes a novel privacy-preserving active learning method with peer students and a discrepancy-based sampling criterion, enhancing data privacy and model robustness.

Findings

01

PSL outperforms existing active learning methods in accuracy.

02

PSL effectively preserves data privacy in sensitive settings.

03

Peer students improve active learner stability.

Abstract

Active learning has been proposed to reduce data annotation efforts by only manually labelling representative data samples for training. Meanwhile, recent active learning applications have benefited a lot from cloud computing services with not only sufficient computational resources but also crowdsourcing frameworks that include many humans in the active learning loop. However, previous active learning methods that always require passing large-scale unlabelled data to cloud may potentially raise significant data privacy issues. To mitigate such a risk, we propose a responsible active learning method, namely Peer Study Learning (PSL), to simultaneously preserve data privacy and improve model stability. Specifically, we first introduce a human-in-the-loop teacher-student architecture to isolate unlabelled data from the task learner (teacher) on the cloud-side by maintaining an active…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Privacy-Preserving Technologies in Data