# Clustering by Orthogonal NMF Model and Non-Convex Penalty Optimization

**Authors:** Shuai Wang, Tsung-Hui Chang, Ying Cui, and Jong-Shi Pang

arXiv: 1906.00570 · 2021-07-29

## TL;DR

This paper introduces a novel non-convex penalty approach for orthogonal NMF clustering that transforms orthogonality constraints into norm-based penalties, enabling scalable and efficient solutions that outperform existing methods.

## Contribution

The paper proposes a new formulation of orthogonal NMF using non-convex penalties, along with efficient algorithms and theoretical conditions for feasible solutions.

## Key findings

- The proposed NCP methods are computationally efficient.
- They match or outperform existing clustering methods.
- Experimental results validate the effectiveness of the approach.

## Abstract

The non-negative matrix factorization (NMF) model with an additional orthogonality constraint on one of the factor matrices, called the orthogonal NMF (ONMF), has been found a promising clustering model and can outperform the classical K-means. However, solving the ONMF model is a challenging optimization problem because the coupling of the orthogonality and non-negativity constraints introduces a mixed combinatorial aspect into the problem due to the determination of the correct status of the variables (positive or zero). Most of the existing methods directly deal with the orthogonality constraint in its original form via various optimization techniques, but are not scalable for large-scale problems. In this paper, we propose a new ONMF based clustering formulation that equivalently transforms the orthogonality constraint into a set of norm-based non-convex equality constraints. We then apply a non-convex penalty (NCP) approach to add them to the objective as penalty terms, leading to a problem that is efficiently solvable. One smooth penalty formulation and one non-smooth penalty formulation are respectively studied. We build theoretical conditions for the penalized problems to provide feasible stationary solutions to the ONMF based clustering problem, as well as proposing efficient algorithms for solving the penalized problems of the two NCP methods. Experimental results based on both synthetic and real datasets are presented to show that the proposed NCP methods are computationally time efficient, and either match or outperform the existing K-means and ONMF based methods in terms of the clustering performance.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.00570/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/1906.00570/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/1906.00570/full.md

---
Source: https://tomesphere.com/paper/1906.00570