Learning Classifiers under Delayed Feedback with a Time Window Assumption
Masahiro Kato, Shota Yasui

TL;DR
This paper introduces a new method for training binary classifiers with delayed feedback, leveraging all samples under a time window assumption to reduce bias and improve performance in online advertising scenarios.
Contribution
It proposes an unbiased, convex empirical risk approach that utilizes all available samples under the time window assumption, extending previous methods.
Findings
The proposed method achieves unbiased classification under delayed feedback.
Experimental results show improved performance over existing approaches.
Validated on synthetic and real online advertising datasets.
Abstract
We consider training a binary classifier under delayed feedback (\emph{DF learning}). For example, in the conversion prediction in online ads, we initially receive negative samples that clicked the ads but did not buy an item; subsequently, some samples among them buy an item then change to positive. In the setting of DF learning, we observe samples over time, then learn a classifier at some point. We initially receive negative samples; subsequently, some samples among them change to positive. This problem is conceivable in various real-world applications such as online advertisements, where the user action takes place long after the first click. Owing to the delayed feedback, naive classification of the positive and negative samples returns a biased classifier. One solution is to use samples that have been observed for more than a certain time window assuming these samples are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Machine Learning and Algorithms
