# Scalable Semi-Supervised SVM via Triply Stochastic Gradients

**Authors:** Xiang Geng, Bin Gu, Xiang Li, Wanli Shi, Guansheng Zheng, Heng, Huang

arXiv: 1907.11584 · 2019-07-29

## TL;DR

This paper introduces TSGS3VM, a triply stochastic gradient algorithm that significantly improves the scalability and efficiency of semi-supervised SVMs for large datasets, with theoretical convergence guarantees.

## Contribution

It develops a novel triply stochastic gradient method for semi-supervised SVMs, addressing non-convexity and scalability issues in prior algorithms.

## Key findings

- TSGS3VM outperforms existing methods in efficiency and scalability.
- Theoretical analysis guarantees convergence to a stationary point.
- Experimental results validate the effectiveness across various datasets.

## Abstract

Semi-supervised learning (SSL) plays an increasingly important role in the big data era because a large number of unlabeled samples can be used effectively to improve the performance of the classifier. Semi-supervised support vector machine (S$^3$VM) is one of the most appealing methods for SSL, but scaling up S$^3$VM for kernel learning is still an open problem. Recently, a doubly stochastic gradient (DSG) algorithm has been proposed to achieve efficient and scalable training for kernel methods. However, the algorithm and theoretical analysis of DSG are developed based on the convexity assumption which makes them incompetent for non-convex problems such as S$^3$VM. To address this problem, in this paper, we propose a triply stochastic gradient algorithm for S$^3$VM, called TSGS$^3$VM. Specifically, to handle two types of data instances involved in S$^3$VM, TSGS$^3$VM samples a labeled instance and an unlabeled instance as well with the random features in each iteration to compute a triply stochastic gradient. We use the approximated gradient to update the solution. More importantly, we establish new theoretic analysis for TSGS$^3$VM which guarantees that TSGS$^3$VM can converge to a stationary point. Extensive experimental results on a variety of datasets demonstrate that TSGS$^3$VM is much more efficient and scalable than existing S$^3$VM algorithms.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.11584/full.md

## Figures

20 figures with captions in the complete paper: https://tomesphere.com/paper/1907.11584/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/1907.11584/full.md

---
Source: https://tomesphere.com/paper/1907.11584