A Survey of Human-in-the-loop for Machine Learning
Xingjiao Wu, Luwei Xiao, Yixuan Sun, Junhang Zhang, Tianlong Ma, Liang, He

TL;DR
This survey reviews human-in-the-loop machine learning, categorizing approaches into data improvement, interventional training, and system design, highlighting strengths, weaknesses, and future challenges across various domains.
Contribution
It provides a comprehensive classification and analysis of human-in-the-loop methods, offering insights into technical strengths, weaknesses, and open challenges in the field.
Findings
Classified human-in-the-loop approaches into three categories
Analyzed strengths and weaknesses of existing methods
Identified open challenges and future opportunities
Abstract
Human-in-the-loop aims to train an accurate prediction model with minimum cost by integrating human knowledge and experience. Humans can provide training data for machine learning applications and directly accomplish tasks that are hard for computers in the pipeline with the help of machine-based approaches. In this paper, we survey existing works on human-in-the-loop from a data perspective and classify them into three categories with a progressive relationship: (1) the work of improving model performance from data processing, (2) the work of improving model performance through interventional model training, and (3) the design of the system independent human-in-the-loop. Using the above categorization, we summarize major approaches in the field; along with their technical strengths/ weaknesses, we have simple classification and discussion in natural language processing, computer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
