Toward Implicit Sample Noise Modeling: Deviation-driven Matrix Factorization
Guang-He Lee, Shao-Wen Yang, Shou-De Lin

TL;DR
This paper introduces a novel matrix factorization approach that models and learns data deviations to dynamically reweight instances, improving convergence and accuracy in noisy and clean datasets.
Contribution
It proposes a deviation-driven matrix factorization model that jointly learns data deviations and reweights instances, enhancing robustness and efficiency.
Findings
Outperforms state-of-the-art models in accuracy.
Achieves faster convergence by down-weighting noisy data.
Effective in both recommendation and sensor datasets.
Abstract
The objective function of a matrix factorization model usually aims to minimize the average of a regression error contributed by each element. However, given the existence of stochastic noises, the implicit deviations of sample data from their true values are almost surely diverse, which makes each data point not equally suitable for fitting a model. In this case, simply averaging the cost among data in the objective function is not ideal. Intuitively we would like to emphasize more on the reliable instances (i.e., those contain smaller noise) while training a model. Motivated by such observation, we derive our formula from a theoretical framework for optimal weighting under heteroscedastic noise distribution. Specifically, by modeling and learning the deviation of data, we design a novel matrix factorization model. Our model has two advantages. First, it jointly learns the deviation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Tensor decomposition and applications · Music and Audio Processing
