Feature vector regularization in machine learning

Yue Fan; Louise Raphael; Mark Kon

arXiv:1212.4569·stat.ML·December 31, 2013·2 cites

Feature vector regularization in machine learning

Yue Fan, Louise Raphael, Mark Kon

PDF

Open Access

TL;DR

This paper explores regularization techniques for feature vectors in machine learning, using function denoising methods on structured index spaces like graphs to improve data recovery and classification accuracy.

Contribution

It introduces a framework for regularizing feature vectors via function denoising methods on structured index spaces, demonstrating improved recovery and classification performance.

Findings

01

Regularization accuracy is non-monotonic in the denoising parameter.

02

Optimal regularization occurs at a finite positive parameter value.

03

Application to gene expression data improves cancer classification.

Abstract

Problems in machine learning (ML) can involve noisy input data, and ML classification methods have reached limiting accuracies when based on standard ML data sets consisting of feature vectors and their classes. Greater accuracy will require incorporation of prior structural information on data into learning. We study methods to regularize feature vectors (unsupervised regularization methods), analogous to supervised regularization for estimating functions in ML. We study regularization (denoising) of ML feature vectors using Tikhonov and other regularization methods for functions on $R^{n}$ . A feature vector $x = (x_{1}, \dots, x_{n}) = {x_{q}}_{q = 1}^{n}$ is viewed as a function of its index $q$ , and smoothed using prior information on its structure. This can involve a penalty functional on feature vectors analogous to those in statistical learning, or use of proximity (e.g. graph)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Image and Signal Denoising Methods · Neural Networks and Applications