Learning Sign-Constrained Support Vector Machines

Kenya Tajima; Takahiko Henmi; Kohei Tsuchida; Esmeraldo Ronnie R.; Zara; and Tsuyoshi Kato

arXiv:2101.01473·cs.LG·October 12, 2022

Learning Sign-Constrained Support Vector Machines

Kenya Tajima, Takahiko Henmi, Kohei Tsuchida, Esmeraldo Ronnie R., Zara, and Tsuyoshi Kato

PDF

Open Access

TL;DR

This paper introduces two optimization algorithms for training linear support vector machines with sign constraints on weights, leveraging domain knowledge to potentially improve generalization, and provides theoretical convergence analysis and empirical validation.

Contribution

It develops and analyzes two efficient algorithms for sign-constrained SVMs, incorporating domain knowledge into the learning process with proven convergence guarantees.

Findings

01

Algorithms converge sublinearly with $O(nd)$ per iteration cost

02

Explicit minimal iteration number for $psilon$-accurate solutions derived

03

Empirical results show sign constraints enhance performance with similarity-based features

Abstract

Domain knowledge is useful to improve the generalization performance of learning machines. Sign constraints are a handy representation to combine domain knowledge with learning machine. In this paper, we consider constraining the signs of the weight coefficients in learning the linear support vector machine, and develop two optimization algorithms for minimizing the empirical risk under the sign constraints. One of the two algorithms is based on the projected gradient method, in which each iteration of the projected gradient method takes $O (n d)$ computational cost and the sublinear convergence of the objective error is guaranteed. The second algorithm is based on the Frank-Wolfe method that also converges sublinearly and possesses a clear termination criterion. We show that each iteration of the Frank-Wolfe also requires $O (n d)$ cost. Furthermore, we derive the explicit expression for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM