Scalable and Flexible Multiview MAX-VAR Canonical Correlation Analysis

Xiao Fu; Kejun Huang; Mingyi Hong; Nicholas D. Sidiropoulos; Anthony; Man-Cho So

arXiv:1605.09459·stat.ML·August 2, 2017·IEEE Trans. Signal Process.

Scalable and Flexible Multiview MAX-VAR Canonical Correlation Analysis

Xiao Fu, Kejun Huang, Mingyi Hong, Nicholas D. Sidiropoulos, Anthony, Man-Cho So

PDF

TL;DR

This paper introduces a scalable, regularized MAX-VAR GCCA algorithm that efficiently handles large-scale, multi-view data with structural constraints, improving convergence and applicability in multilingual and speech processing tasks.

Contribution

It proposes an alternating optimization algorithm for regularized MAX-VAR GCCA that is scalable, handles structural constraints, and guarantees convergence to critical points.

Findings

01

The algorithm converges globally to a critical point at a sublinear rate.

02

It approaches a global optimum at a linear rate without regularization.

03

Experiments demonstrate effectiveness on large-scale word embedding tasks.

Abstract

Generalized canonical correlation analysis (GCCA) aims at finding latent low-dimensional common structure from multiple views (feature vectors in different domains) of the same entities. Unlike principal component analysis (PCA) that handles a single view, (G)CCA is able to integrate information from different feature spaces. Here we focus on MAX-VAR GCCA, a popular formulation which has recently gained renewed interest in multilingual processing and speech modeling. The classic MAX-VAR GCCA problem can be solved optimally via eigen-decomposition of a matrix that compounds the (whitened) correlation matrices of the views; but this solution has serious scalability issues, and is not directly amenable to incorporating pertinent structural constraints such as non-negativity and sparsity on the canonical components. We posit regularized MAX-VAR GCCA as a non-convex optimization problem and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.