Can we spot a fake?

Shahar Mendelson; Grigoris Paouris; Roman Vershynin

arXiv:2410.18880·math.ST·April 23, 2026

Can we spot a fake?

Shahar Mendelson, Grigoris Paouris, Roman Vershynin

PDF

TL;DR

This paper investigates the maximum size of undetectable fake data introduced by an adversary, relating it to the geometric properties of the set of possible adversarial tricks, and extends the analysis beyond Gaussian data.

Contribution

The authors establish bounds on the detectability radius for fake data based on the Gaussian width of the adversary's trick set, generalizing to non-Gaussian distributions and arbitrary sets.

Findings

01

For symmetric trick sets, the detectability radius is about twice the scaled Gaussian width.

02

Upper bounds on detectability hold for any set T and distribution of real data.

03

Conjecture that focusing on the most important directions of T can improve bounds for asymmetric sets.

Abstract

The problem of detecting fake data inspires the following seemingly simple mathematical question. Sample a data point $X$ from the standard normal distribution in $R^{n}$ . An adversary observes $X$ and corrupts it by adding a vector $r t$ , where they can choose any vector $t$ from a fixed set $T$ of the adversary's ``tricks'', and where $r > 0$ is a fixed radius. The adversary's choice of $t = t (X)$ may depend on the true data $X$ . The adversary wants to hide the corruption by making the fake data $X + r t$ statistically indistinguishable from the real data $X$ . What is the largest radius $r = r (T)$ for which the adversary can create an undetectable fake? We show that for highly symmetric sets $T$ , the detectability radius $r (T)$ is approximately twice the scaled Gaussian width of $T$ . The upper bound actually holds for arbitrary sets $T$ and generalizes to arbitrary, non-Gaussian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.