# Three-Stage Subspace Clustering Framework with Graph-Based   Transformation and Optimization

**Authors:** Shuai Yang, Wenqi Zhu, Yuesheng Zhu

arXiv: 1905.01145 · 2019-05-06

## TL;DR

This paper introduces a three-stage subspace clustering framework that enhances affinity matrix quality using graph-based transformation and optimization, leading to more accurate clustering of high-dimensional data.

## Contribution

The paper proposes a universal three-stage framework with a novel graph-based transformation and optimization step to improve subspace clustering accuracy.

## Key findings

- Improved clustering accuracy on synthetic and real-world datasets.
- Enhanced affinity matrix connectivity and authenticity.
- Framework applicable to various regularization-based SC methods.

## Abstract

Subspace clustering (SC) refers to the problem of clustering high-dimensional data into a union of low-dimensional subspaces. Based on spectral clustering, state-of-the-art approaches solve SC problem within a two-stage framework. In the first stage, data representation techniques are applied to draw an affinity matrix from the original data. In the second stage, spectral clustering is directly applied to the affinity matrix so that data can be grouped into different subspaces. However, the affinity matrix obtained in the first stage usually fails to reveal the authentic relationship between data points, which leads to inaccurate clustering results. In this paper, we propose a universal Three-Stage Subspace Clustering framework (3S-SC). Graph-Based Transformation and Optimization (GBTO) is added between data representation and spectral clustering. The affinity matrix is obtained in the first stage, then it goes through the second stage, where the proposed GBTO is applied to generate a reconstructed affinity matrix with more authentic similarity between data points. Spectral clustering is applied after GBTO, which is the third stage. We verify our 3S-SC framework with GBTO through theoretical analysis. Experiments on both synthetic data and the real-world data sets of handwritten digits and human faces demonstrate the universality of the proposed 3S-SC framework in improving the connectivity and accuracy of SC methods based on $\ell_0$, $\ell_1$, $\ell_2$ or nuclear norm regularization.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.01145/full.md

## Figures

24 figures with captions in the complete paper: https://tomesphere.com/paper/1905.01145/full.md

---
Source: https://tomesphere.com/paper/1905.01145