Learning from Conditional Distributions via Dual Embeddings

Bo Dai; Niao He; Yunpeng Pan; Byron Boots; Le Song

arXiv:1607.04579·cs.LG·January 3, 2017·20 cites

Learning from Conditional Distributions via Dual Embeddings

Bo Dai, Niao He, Yunpeng Pan, Byron Boots, Le Song

PDF

Open Access

TL;DR

This paper introduces a new min-max reformulation and an efficient algorithm for learning from conditional distributions, addressing sample scarcity issues in various machine learning tasks.

Contribution

It proposes a novel min-max reformulation and Embedding-SGD algorithm for learning from conditional distributions with limited samples, supported by theoretical analysis.

Findings

01

Significant improvement over existing algorithms in experiments

02

Theoretical sample complexity bounds established

03

Effective on both synthetic and real-world datasets

Abstract

Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample $x$ itself is associated with a conditional distribution $p (z ∣ x)$ represented by samples ${z_{i}}_{i = 1}^{M}$ , and the goal is to learn a function $f$ that links these conditional distributions to target values $y$ . These learning problems become very challenging when we only have limited samples or in the extreme case only one sample from each conditional distribution. Commonly used approaches either assume that $z$ is independent of $x$ , or require an overwhelmingly large samples from each conditional distribution. To address these challenges, we propose a novel approach which employs a new min-max reformulation of the learning from conditional distribution problem. With…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Machine Learning and Algorithms