Human-in-the-loop Handling of Knowledge Drift
Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini, Stefano Teso

TL;DR
This paper presents TRCKD, a human-in-the-loop method for detecting and adapting to knowledge drift in hierarchical classification, leveraging user input to improve model accuracy over time.
Contribution
It introduces TRCKD, a novel approach combining automated detection and user disambiguation to handle complex knowledge drift in hierarchical models.
Findings
User queries significantly improve prediction accuracy.
TRCKD effectively distinguishes different types of knowledge drift.
Simulation results show improved performance on synthetic and real data.
Abstract
We introduce and study knowledge drift (KD), a complex form of drift that occurs in hierarchical classification. Under KD the vocabulary of concepts, their individual distributions, and the is-a relations between them can all change over time. The main challenge is that, since the ground-truth concept hierarchy is unobserved, it is hard to tell apart different forms of KD. For instance, introducing a new is-a relation between two concepts might be confused with individual changes to those concepts, but it is far from equivalent. Failure to identify the right kind of KD compromises the concept hierarchy used by the classifier, leading to systematic prediction errors. Our key observation is that in many human-in-the-loop applications (like smart personal assistants) the user knows whether and what kind of drift occurred recently. Motivated by this, we introduce TRCKD, a novel approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing · Smart Grid Energy Management
