Multi-Task Feature Learning Via Efficient l2,1-Norm Minimization
Jun Liu, Shuiwang Ji, Jieping Ye

TL;DR
This paper introduces an efficient method for joint feature selection across multiple tasks using l2,1-norm regularization, reformulating the problem for faster convex optimization with proven empirical efficiency.
Contribution
It proposes a novel reformulation of the l2,1-norm regularized regression as smooth convex problems solved via Nesterov's method, enabling faster computation.
Findings
Reformulation allows analytical Euclidean projection computation.
Second reformulation's projection can be computed in linear time.
Empirical results demonstrate the efficiency of the proposed algorithms.
Abstract
The problem of joint feature selection across a group of related tasks has applications in many areas including biomedical informatics and computer vision. We consider the l2,1-norm regularized regression model for joint feature selection from multiple tasks, which can be derived in the probabilistic framework by assuming a suitable prior from the exponential family. One appealing feature of the l2,1-norm regularization is that it encourages multiple predictors to share similar sparsity patterns. However, the resulting optimization problem is challenging to solve due to the non-smoothness of the l2,1-norm regularization. In this paper, we propose to accelerate the computation by reformulating it as two equivalent smooth convex optimization problems which are then solved via the Nesterov's method-an optimal first-order black-box method for smooth convex optimization. A key building block…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Face and Expression Recognition · Machine Learning and Algorithms
