Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data
Paroma Varma, Bryan He, Dan Iter, Peng Xu, Rose Yu, Christopher De Sa,, Christopher R\'e

TL;DR
Socratic learning enhances generative models by automatically identifying latent data subsets where supervision sources vary in performance, significantly improving label quality without ground truth labels.
Contribution
It introduces a feedback-driven method to detect latent data subsets and augment generative models, addressing limitations of previous approaches that assume uniform source accuracy.
Findings
Reduced error by up to 56.06% in relation extraction
Effectively models latent subsets in training data
Improves weak supervision quality without ground truth
Abstract
A challenge in training discriminative models like neural networks is obtaining enough labeled training data. Recent approaches use generative models to combine weak supervision sources, like user-defined heuristics or knowledge bases, to label training data. Prior work has explored learning accuracies for these sources even without ground truth labels, but they assume that a single accuracy parameter is sufficient to model the behavior of these sources over the entire training set. In particular, they fail to model latent subsets in the training data in which the supervision sources perform differently than on average. We present Socratic learning, a paradigm that uses feedback from a corresponding discriminative model to automatically identify these subsets and augments the structure of the generative model accordingly. Experimentally, we show that without any ground truth labels, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Educational Assessment and Pedagogy
