Improved Algorithm and Bounds for Successive Projection

Jiashun Jin; Zheng Tracy Ke; Gabriel Moryoussef; Jiajun Tang; Jingming; Wang

arXiv:2403.11013·cs.LG·March 19, 2024·1 cites

Improved Algorithm and Bounds for Successive Projection

Jiashun Jin, Zheng Tracy Ke, Gabriel Moryoussef, Jiajun Tang, Jingming, Wang

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

This paper introduces pp-SPA, an enhanced vertex hunting algorithm that combines projection and denoising steps, providing faster and more accurate results than the traditional SPA, especially under high noise conditions.

Contribution

The paper proposes pp-SPA, a novel variant of SPA that improves vertex estimation accuracy and speed by incorporating denoising, along with new error bounds for SPA.

Findings

01

pp-SPA outperforms SPA in noisy environments

02

Faster convergence rates for pp-SPA

03

Improved non-asymptotic bounds for SPA

Abstract

Given a $K$ -vertex simplex in a $d$ -dimensional space, suppose we measure $n$ points on the simplex with noise (hence, some of the observed points fall outside the simplex). Vertex hunting is the problem of estimating the $K$ vertices of the simplex. A popular vertex hunting algorithm is successive projection algorithm (SPA). However, SPA is observed to perform unsatisfactorily under strong noise or outliers. We propose pseudo-point SPA (pp-SPA). It uses a projection step and a denoise step to generate pseudo-points and feed them into SPA for vertex hunting. We derive error bounds for pp-SPA, leveraging on extreme value theory of (possibly) high-dimensional random vectors. The results suggest that pp-SPA has faster rates and better numerical performances than SPA. Our analysis includes an improved non-asymptotic bound for the original SPA, which is of independent interest.

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 8· accept, good paperConfidence 3

Strengths

I like the pp-SPA algorithm a lot, and it feels like a very natural idea. It's an added bonus that the authors were able to get tighter theoretical bounds, which while admittedly are too complicated sometimes to compare fairly to classical SPA, are clearly tighter. It's not clear to me at all when the bounds for pp-SPA beat the bounds for classical SPA, simply because of how complicated the bounds are. I was able to follow most proofs as a non-expert, so the **proofs** are well written (the ac

Weaknesses

1) For some reason, the authors do not compare pp-SPA with any robust SPA. I found many results in the literature on robust SPA, so it seems bizare it is not included in the experimental section here. Please try to include at least one robust variant of SPA/similar algorithm in the experimental section. 2) The writing can be dramatically improved. See below minor points, but if this is accepted, please spent a few iterations on improving the writing. It's extremely terse at time. 3) You repr

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

The paper gives ample motivating examples for which the problem is relevant. The paper makes attempts to motivate the given bounds. The algorithm itself seems intuitive.

Weaknesses

The bounds given in the paper, e.g. Theorem 2, are very complex; they contain multiple terms and are subject to many conditions. It is hard to understand whether those conditions are restrictive, or whether the bounds are significant. I think the presentation of the problem could be expanded upon.

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

I believe, the paper brings potentially several strong contributions: 1) The authors propose algorithm seems to practically make a lot of sense 2) The theoretical results are novel and non-trivial. The authors exploit the specific Gaussianity of the noise to be able to derive these bounds 3) The experiments are limited, but they support the statements of the paper

Weaknesses

There are some considerable difficulties I have with the paper: 1) The writing and the structure of the text should be improved. For example: * It is not clear to me what the illustration in Figure 1 shows? Is it obvious that one of these is better and what does ``idea simplex'' denote? * At times a statement is given without citation, e.g. you say in "Our contributions.", pg 2 that: "since the SPA is greedy algorithm, it tends to be biased outward bound.", but it is not clear where this can be

Code & Models

Repositories

gabriel78110/vertexhunting
noneOfficial

Videos

Improved algorithm and bounds for successive projection· slideslive

Taxonomy

TopicsRobotic Mechanisms and Dynamics · Manufacturing Process and Optimization · Advanced Numerical Analysis Techniques