Sequential Learning without Feedback

Manjesh Hanawal; Csaba Szepesvari; Venkatesh Saligrama

arXiv:1610.05394·cs.LG·October 19, 2016

Sequential Learning without Feedback

Manjesh Hanawal, Csaba Szepesvari, Venkatesh Saligrama

PDF

Open Access

TL;DR

This paper addresses unsupervised sensor selection in sequential testing scenarios by introducing a weak-dominance condition, enabling the development of polynomial-time algorithms with sublinear regret guarantees.

Contribution

It introduces the weak-dominance condition for unsupervised sensor selection and provides polynomial-time algorithms with theoretical regret bounds under this condition.

Findings

01

Weak-dominance holds on real datasets.

02

Proposed algorithms achieve sublinear regret.

03

Weak-dominance is maximal for sublinear regret achievement.

Abstract

In many security and healthcare systems a sequence of features/sensors/tests are used for detection and diagnosis. Each test outputs a prediction of the latent state, and carries with it inherent costs. Our objective is to {\it learn} strategies for selecting tests to optimize accuracy \& costs. Unfortunately it is often impossible to acquire in-situ ground truth annotations and we are left with the problem of unsupervised sensor selection (USS). We pose USS as a version of stochastic partial monitoring problem with an {\it unusual} reward structure (even noisy annotations are unavailable). Unsurprisingly no learner can achieve sublinear regret without further assumptions. To this end we propose the notion of weak-dominance. This is a condition on the joint probability distribution of test outputs and latent state and says that whenever a test is accurate on an example, a later test in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Distributed Sensor Networks and Detection Algorithms · Advanced Bandit Algorithms Research