Feature vector regularization in machine learning
Yue Fan, Louise Raphael, Mark Kon

TL;DR
This paper explores regularization techniques for feature vectors in machine learning, using function denoising methods on structured index spaces like graphs to improve data recovery and classification accuracy.
Contribution
It introduces a framework for regularizing feature vectors via function denoising methods on structured index spaces, demonstrating improved recovery and classification performance.
Findings
Regularization accuracy is non-monotonic in the denoising parameter.
Optimal regularization occurs at a finite positive parameter value.
Application to gene expression data improves cancer classification.
Abstract
Problems in machine learning (ML) can involve noisy input data, and ML classification methods have reached limiting accuracies when based on standard ML data sets consisting of feature vectors and their classes. Greater accuracy will require incorporation of prior structural information on data into learning. We study methods to regularize feature vectors (unsupervised regularization methods), analogous to supervised regularization for estimating functions in ML. We study regularization (denoising) of ML feature vectors using Tikhonov and other regularization methods for functions on . A feature vector is viewed as a function of its index , and smoothed using prior information on its structure. This can involve a penalty functional on feature vectors analogous to those in statistical learning, or use of proximity (e.g. graph)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Image and Signal Denoising Methods · Neural Networks and Applications
