Dropping Convexity for More Efficient and Scalable Online Multiview   Learning

Zhehui Chen; Lin F. Yang; Chris J. Li; Tuo Zhao

arXiv:1702.08134·cs.LG·September 17, 2019·1 cites

Dropping Convexity for More Efficient and Scalable Online Multiview Learning

Zhehui Chen, Lin F. Yang, Chris J. Li, Tuo Zhao

PDF

Open Access

TL;DR

This paper introduces a nonconvex approach to multiview representation learning, demonstrating that simple stochastic gradient descent can efficiently find global optima, supported by theoretical convergence analysis and numerical experiments.

Contribution

It proposes a nonconvex formulation for multiview learning and provides theoretical analysis showing convergence to global optima using diffusion approximations.

Findings

01

SGD efficiently finds global optima in the nonconvex formulation.

02

Theoretical convergence rates are established for the proposed method.

03

Numerical experiments support the theoretical results.

Abstract

Multiview representation learning is very popular for latent factor analysis. It naturally arises in many data analysis, machine learning, and information retrieval applications to model dependent structures among multiple data sources. For computational convenience, existing approaches usually formulate the multiview representation learning as convex optimization problems, where global optima can be obtained by certain algorithms in polynomial time. However, many pieces of evidence have corroborated that heuristic nonconvex approaches also have good empirical computational performance and convergence to the global optima, although there is a lack of theoretical justification. Such a gap between theory and practice motivates us to study a nonconvex formulation for multiview representation learning, which can be efficiently solved by a simple stochastic gradient descent (SGD) algorithm.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning