Toward Implicit Sample Noise Modeling: Deviation-driven Matrix   Factorization

Guang-He Lee; Shao-Wen Yang; Shou-De Lin

arXiv:1610.09274·cs.LG·October 31, 2016·1 cites

Toward Implicit Sample Noise Modeling: Deviation-driven Matrix Factorization

Guang-He Lee, Shao-Wen Yang, Shou-De Lin

PDF

Open Access

TL;DR

This paper introduces a novel matrix factorization approach that models and learns data deviations to dynamically reweight instances, improving convergence and accuracy in noisy and clean datasets.

Contribution

It proposes a deviation-driven matrix factorization model that jointly learns data deviations and reweights instances, enhancing robustness and efficiency.

Findings

01

Outperforms state-of-the-art models in accuracy.

02

Achieves faster convergence by down-weighting noisy data.

03

Effective in both recommendation and sensor datasets.

Abstract

The objective function of a matrix factorization model usually aims to minimize the average of a regression error contributed by each element. However, given the existence of stochastic noises, the implicit deviations of sample data from their true values are almost surely diverse, which makes each data point not equally suitable for fitting a model. In this case, simply averaging the cost among data in the objective function is not ideal. Intuitively we would like to emphasize more on the reliable instances (i.e., those contain smaller noise) while training a model. Motivated by such observation, we derive our formula from a theoretical framework for optimal weighting under heteroscedastic noise distribution. Specifically, by modeling and learning the deviation of data, we design a novel matrix factorization model. Our model has two advantages. First, it jointly learns the deviation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Tensor decomposition and applications · Music and Audio Processing