Gradient Structure Estimation under Label-Only Oracles via Spectral Sensitivity

Jun Liu; Leo Yu Zhang; Fengpeng Li; Isao Echizen; Jiantao Zhou

arXiv:2601.14300·cs.LG·March 24, 2026

Gradient Structure Estimation under Label-Only Oracles via Spectral Sensitivity

Jun Liu, Leo Yu Zhang, Fengpeng Li, Isao Echizen, Jiantao Zhou

PDF

Open Access

TL;DR

This paper introduces a new gradient estimation attack in hard-label black-box settings, combining spectral initialization and pattern-driven optimization, achieving state-of-the-art success and query efficiency across various models and tasks.

Contribution

It provides a theoretical framework linking existing sign-flipping attacks to gradient sign recovery and proposes a novel attack method with proven guarantees and superior empirical performance.

Findings

01

Outperforms state-of-the-art hard-label attacks in success rate and query efficiency.

02

Effective across multiple datasets and model types, including defenses.

03

Successfully bypasses advanced defenses like Blacklight.

Abstract

Hard-label black-box settings, where only top-1 predicted labels are observable, pose a fundamentally constrained yet practically important feedback model for understanding model behavior. A central challenge in this regime is whether meaningful gradient information can be recovered from such discrete responses. In this work, we develop a unified theoretical perspective showing that a wide range of existing sign-flipping hard-label attacks can be interpreted as implicitly approximating the sign of the true loss gradient. This observation reframes hard-label attacks from heuristic search procedures into instances of gradient sign recovery under extremely limited feedback. Motivated by this first-principles understanding, we propose a new attack framework that combines a zero-query frequency-domain initialization with a Pattern-Driven Optimization (PDO) strategy. We establish theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Privacy-Preserving Technologies in Data