ReFaceX: donor-driven reversible face anonymisation with detached recovery

Dost Muhammad; Muhammad Salman; Syed Muhammad Haider Shah; Malika Bendechache

PMC · DOI:10.1038/s41598-026-39337-2·February 9, 2026

ReFaceX: donor-driven reversible face anonymisation with detached recovery

Dost Muhammad, Muhammad Salman, Syed Muhammad Haider Shah, Malika Bendechache

PDF

Open Access

TL;DR

ReFaceX is a new method for anonymizing faces in images while preserving useful details like pose and expression, allowing for secure and practical sharing of facial data.

Contribution

ReFaceX introduces a donor-driven reversible face anonymisation framework that explicitly balances privacy and utility.

Findings

01

ReFaceX reduces identity similarity across multiple face recognition models on LFW and CelebA-HQ datasets.

02

The method achieves high-quality recovered images with SSIM of 0.9378 and PSNR of 23.97 dB.

03

ReFaceX operates in real time on a single RTX 3090 and is robust to JPEG re-encoding.

Abstract

Organisations must share facial imagery that remains useful for analysis while protecting identity. Many current methods fail to strike this balance: reconstruction-centred encoder–decoder designs tend to blur salient detail, whereas latent edits in pretrained generators often retain or drift identity cues, undermining privacy and utility. We present ReFaceX, a reversible anonymisation framework that separates what to protect from what to preserve. A donor identity code steers a U-Net anonymiser with Identity Feature Fusion to change identity while retaining non-identity content such as pose, background and expression. A learned steganographic channel carries a compact recovery payload, and reconstruction gradients are blocked at the stego image so the anonymiser is never rewarded for keeping identity. The threat model is stated explicitly and outcomes are audited with strong…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Genes1

GAN

Proteins1

Chemicals1

CelebA

Diseases2

RiDDLE MAE

Figures2

Click any figure to enlarge with its caption.

Motivation and problem setting. Left: encoder-decoder anonymisers often retain identity or blur detail; GAN inversion can alter non-identity attributes and is fragile to real transforms. Right: ReFaceX separates privacy from utility using donor-driven IFF, a gradient stop at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , and a learned steganographic payload. Outcome: lower identity similari

ReFaceX modules with privacy and utility decoupling. The anonymiser *A* uses a donor code from a fixed encoder to produce $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . A steganographic channel hides *z* to form $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \s

Funding1

—https://doi.org/10.13039/501100001602Science Foundation Ireland

Keywords

Deep learning in imagingReFaceXReversible face anonymisationDonor-driven identity transferIdentity feature fusionImage steganographyDetached recoveryPrivacy utility trade offOpen set re-identificationFace recognition auditingEngineeringMathematics and computing

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection

Full text

Introduction

Public release and reuse of facial imagery in science, industry and government require anonymisation that protects identity while preserving analytical utility for tasks such as detection, tracking and landmark localisation. Naive obfuscation by blurring or mosaicing often destroys salient cues and still leaks identity to modern recognisers^1–4^. This has motivated a wave of deep learning methods that synthesise identity-altered faces with higher visual fidelity^2,5,6^, and a subset of reversible approaches that aim to allow authorised recovery of the original content^7–9^. A compact visual summary of these challenges and the problem setting is provided in Fig. 1.

Despite considerable progress, three limitations recur across existing lines of work. First, the security model is often implicit. Many methods optimise anonymisation against surrogate recognition losses and report verification-style metrics, yet they do not fix a concrete adversary with explicit knowledge, data and compute, nor do they provide guarantees such as bounds on identity leakage^3,4^. Reversible designs that store a recovery key or code within the anonymised image enlarge the attack surface unless the channel is robust to realistic distortions and resistant to extraction by an adaptive adversary^7^. Secondly, recovery channels are rarely stress-tested. In realistic deployments anonymised images are re-encoded, resized, cropped and filtered by social platforms. Without batteries that cover JPEG at multiple qualities, rescaling, cropping, colour space changes and codec transcodes, reliability outside the laboratory remains unclear^10^. Thirdly, empirical scope is narrow. Encoder-decoder anonymisers tend to favour over-smooth reconstructions^5,6^, while latent-space manipulation and inversion in pretrained GANs can reduce detail or shift attributes when inversion is imperfect^11,12^. Subgroup performance across age, skin tone and gender is rarely reported, which limits claims about fairness, and almost all results are image based with aligned crops rather than video with temporal consistency.

We present ReFaceX as a practical integration of established components configured to address these gaps under a clear threat model. Identity modification is steered by a donor identity embedding injected into a U-Net backbone through an identity feature fusion block^13^, so that identity change is large and controlled while non-identity content is preserved. Recovery is optimised on a detached computational path, which prevents the anonymiser from retaining identity cues merely to ease reconstruction. A compact recovery code is carried by a learned steganographic channel with an explicit objective that forces the encoder-decoder pair to transport information rather than allowing a bypass through near-identity outputs^7^. A high-level contrast with prior families and the core idea of separating privacy from utility are illustrated in Fig. 1. We do not claim these ingredients to be new in isolation; the contribution lies in decoupling privacy from utility during training, auditing outcomes against strong recognisers, and stress-testing the recovery channel under realistic distortions.

We formalise a threat model spanning black-box and white-box adversaries built around strong face recognition encoders^3,4^ and open-set search. Privacy is measured with similarity distributions, receiver operating characteristics and re-identification at the equal error rate. Utility is measured with pixel and perceptual metrics on recovered images^10^. We add a robustness battery that applies common photometric and geometric transforms before recovery, and we report subgroup analyses to probe demographic performance. Our open implementation runs at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256\times 256$$\end{document}$ with mixed precision on commodity GPUs and remains compatible with stronger generative and geometric priors if desired. In summary, ReFaceX provides an auditable route to reversible anonymisation that separates the goals of privacy and utility, states an attacker model explicitly, and stresses the recovery channel under conditions that mirror deployment.

Contributions.

A training and architectural configuration that decouples privacy from utility by donor-guided identity transfer with identity feature fusion and gradient blocking on the recovery path.
A stated threat model and a multi-auditor privacy evaluation using strong recognisers, together with full-reference utility metrics.
A robustness battery for the learned steganographic channel, including JPEG re-encoding and basic geometric changes.
An empirical study on LFW and CelebA-HQ demonstrating cross-auditor privacy and high-fidelity recovery at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256\times 256$$\end{document}$ with real-time inference.

Fig. 1. Motivation and problem setting. Left: encoder-decoder anonymisers often retain identity or blur detail; GAN inversion can alter non-identity attributes and is fragile to real transforms. Right: ReFaceX separates privacy from utility using donor-driven IFF, a gradient stop at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , and a learned steganographic payload. Outcome: lower identity similarity under multiple auditors with high recovered-image fidelity and fast inference.

The remainder of this paper is organised as follows. “Related work” section reviews prior work on irreversible and reversible anonymisation, learned image steganography, face recognition metrics, and evaluation pitfalls. “Problem setup and threat model” section formalises the notation, privacy threat model, and objectives for privacy and utility. “Method” section presents ReFaceX, including the donor driven anonymiser with Identity Feature Fusion, the learned steganographic channel, the detached recovery network, and the full loss. “Experimental setup” section details datasets, preprocessing, baselines, metrics, and implementation. “Results” section reports quantitative and qualitative results, computational efficiency, and the privacy–utility frontier. “Discussion” section provides a technical discussion of why the design achieves stronger privacy and utility than recent alternatives. “Conclusion” section concludes and sketches future work.

Terminology and notation

Roles and terms. We use the following terms consistently throughout:

source identity: the identity of the input face $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x$$\end{document}$ .
donor identity: the identity extracted from a separate donor image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tilde{x}$$\end{document}$ ; its embedding $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$e_{\textrm{don}}=f(\tilde{x})$$\end{document}$ is projected to a code $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$c$$\end{document}$ that steers anonymisation.
anonymised face: the anonymised output $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a=A(x,c)$$\end{document}$ whose identity differs from the source identity while non-identity content is preserved.

Notation

SymbolMeaning x Input face image (with source identity) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tilde{x}$$\end{document}$ Donor image (provides the donor identity) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\cdot )$$\end{document}$ Fixed face encoder used for auditing c Donor identity code derived from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\tilde{x})$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ Anonymised face produced by A $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ Stego anonymised image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S(x_a,z)$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$z,\ \hat{z}$$\end{document}$ Recovery payload and its estimate $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ Recovered image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R(x_s,\hat{z})$$\end{document}$

Related work

Encoder–decoder anonymisers and residual identity

Early GAN or encoder–decoder based anonymisers often rely on strong pixel or perceptual reconstruction losses to maintain visual quality. This creates a well-documented tendency to retain identity information, which has been observed both empirically and in surveys of de-identification practice. For example, DeepPrivacy replaces faces via conditional synthesis to avoid the identity leakage that arises when trying to reconstruct the original identity with losses that favour fidelity^2^. CIAGAN further shows that inpainting with guidance can still exhibit residual identity if the global displacement in the embedding space is not enforced^14^. Comprehensive reviews note that reconstruction-centric pipelines frequently under-remove identity or over-smooth content, which harms either privacy or utility^15^. These observations are consistent with our finding that a detached recovery path and an explicit identity margin are needed to prevent leakage.

Latent manipulation and inversion fragility

Methods that anonymise by editing latent codes after GAN inversion improve realism but are sensitive to inversion errors. Inversion is approximate and entangles identity with other attributes, which leads to drift in hair, background and expression under latent edits, as discussed in analyses of StyleGAN inversion and encoders^11,12^. Recent attribute-preserving anonymisation via latent code optimisation reports similar trade-offs between identity change and attribute stability^16^, while reversible latent encryption must still transmit rich codes for faithful recovery^17^.

Reversible channels and robustness to platform transforms

Learned steganography and watermarking literature provides concrete evidence that recovery channels are fragile unless explicitly trained and tested against real transforms. HiDDeN introduces differentiable corruption layers (e.g. JPEG, crop, noise) and shows large drops in retrieval without robustness training^18^. StegaStamp likewise demonstrates that survival under JPEG recompression, rescaling and viewpoint changes requires corruption-aware objectives and architectural choices^19^. These results support our choice to stress the steganographic channel with JPEG simulation and to report robustness curves.

Privacy-preserving representations and auditing

Work on privacy-preserving encodings shows that training solely for utility can leak sensitive attributes, and that auditing with strong recognition models is necessary^20^. Our evaluation follows this guidance by auditing with multiple recognisers and reporting open-set re-identification figures together with full-reference image metrics.

Problem setup and threat model

Notation. Let $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x \in [0,1]^{3\times H\times W}$$\end{document}$ denote an original face image. The anonymiser produces an anonymised image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ , and the steganographic hider yields a stego–anonymised image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ that visually matches $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ but carries a hidden recovery payload. A real–valued recovery code is written as $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$z \in \mathbb {R}^{d}$$\end{document}$ . An authorised decoder recovers a code $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z}$$\end{document}$ from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ and a recovery network reconstructs an image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ that should be perceptually close to x. For privacy auditing we employ a fixed face encoder $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\cdot )$$\end{document}$ that maps an image to an identity embedding in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbb {R}^{p}$$\end{document}$ , and we write

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} e_x {:=}f(x), \qquad e_{a} {:=}f(x_a), \qquad e_{r} {:=}f(x_r). \end{aligned}$$\end{document}

Cosine similarity between two embeddings $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$u,v \in \mathbb {R}^{p}$$\end{document}$ is

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} s(u,v) {:=}\frac{\langle u, v\rangle }{\Vert u\Vert _2 \, \Vert v\Vert _2}. \end{aligned}$$\end{document}

System components We write the anonymiser as $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A(x, c) \rightarrow x_a$$\end{document}$ , where c denotes an identity–conditioning signal that drives identity change while preserving non–identity content. A steganographic encoder $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S(x_a, z) \rightarrow x_s$$\end{document}$ hides z in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . A steganographic decoder $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}(x_s) \rightarrow \hat{z}$$\end{document}$ extracts the code for authorised recovery. A recovery network $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R(x_s, \hat{z}) \rightarrow x_r$$\end{document}$ reconstructs an approximation to x. During training, A, S, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ and R are learned jointly under privacy and utility objectives, with architectural decoupling between the privacy path and the recovery path to prevent degenerate shortcuts.

Threat model

Adversary goal The attacker seeks to re–identify the subject in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ or $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ by matching against a large gallery $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {G} = \{g_i\}_{i=1}^{G}$$\end{document}$ of facial images, using a strong face recogniser that is independent of the anonymisation pipeline.

Black–box attacker The attacker is granted access to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ or $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , and to a high–performance recogniser $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f_{\text {adv}}$$\end{document}$ , possibly different from the auditing encoder f. They can perform verification or identification queries against $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {G}$$\end{document}$ , including open–set search. They do not have access to the recovery code z, to the stego decoder $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ , or to the recovery network R.

White–box variant The attacker knows the architectures and the training protocol of A, S, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ and R, and may approximate the training distribution, but does not possess the trained weights. This models leakage of methodological details without a model exfiltration event.

Compromise boundary Recovery access is controlled. Unless compromised, the attacker cannot query $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ nor R. If $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ is exfiltrated, we consider the system to have failed its security perimeter and we report this as an ablation in “Experimental setup” section.

Side channels and post–processing The attacker may apply standard image transforms $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {T}$$\end{document}$ such as resizing, cropping, compression and colour changes before recognition. We therefore audit privacy under $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {T}$$\end{document}$ to reflect real distribution shifts introduced by storage and sharing pipelines.

Privacy objective

Embedding margin We target low identity similarity between x and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ under a fixed auditing encoder f. Let $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_x {:=}s(e_x, e_a)$$\end{document}$ . We select a margin $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m \in [-1,1]$$\end{document}$ and define a margin–based privacy loss

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\text {priv}} {:=}\max \{0,\, s_x - m\}. \end{aligned}$$\end{document}

The anonymiser is encouraged to decrease $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s_x$$\end{document}$ until it falls below m, beyond which no further penalty is applied so that utility terms can dominate.

Operational metric For reporting, we estimate a receiver operating characteristic by forming genuine scores $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s(e_x, e_a)$$\end{document}$ and impostor scores $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s(e_{x'}, e_a)$$\end{document}$ for $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x' \ne x$$\end{document}$ from a large impostor set. We quote the equal error rate (EER), that is the operating point where false acceptance and false rejection are equal, together with the re–identification rate at EER in an identification protocol with gallery size G. We further report top–k identification success where relevant, and we repeat the evaluation with multiple strong recognisers to reduce model bias.

Robust privacy To reflect real deployment, we evaluate the same metrics after transforms $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t \in \mathscr {T}$$\end{document}$ applied to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ or $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , for example JPEG at various qualities, centre and random crops, rescaling, and colour perturbations. Formally, we compute $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s\!\left( f(x), f\!\left( t(x_a)\right) \right)$$\end{document}$ and report the worst–case across $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t \in \mathscr {T}$$\end{document}$ .

Utility objective

Authorised recovery fidelity Authorised users possessing $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ recover $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z} = S^{-1}(x_s)$$\end{document}$ and reconstruct $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r = R(x_s, \hat{z})$$\end{document}$ . We promote high fidelity between $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ and x using a weighted combination of pixel, perceptual and structural criteria, for example

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\text {rec}} {:=}\lambda _{1}\,\Vert x_r - x\Vert _{1} \;+\; \lambda _{\text {perc}}\,\textrm{LPIPS}(x_r, x) \;+\; \lambda _{\text {ssim}}\,(1-\textrm{SSIM}(x_r, x)). \end{aligned}$$\end{document}

We report PSNR, SSIM and LPIPS for $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ against x, and we quantify task utility on ID–irrelevant attributes by measuring performance of detectors, landmarkers and expression estimators run on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ .

Steganographic integrity The recovery code should be faithfully conveyed by the steganographic channel, hence we penalise

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\text {steg}} {:=}\Vert \,S^{-1}(S(x_a, z)) - z\,\Vert _{2}^{2}, \end{aligned}$$\end{document}

and we report mean squared error of the recovered code both in clean conditions and after transforms $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$t \in \mathscr {T}$$\end{document}$ applied to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ . This captures the robustness of the hidden payload to common storage and transport artefacts.

Training objective and decoupling

The total objective balances privacy, recovery fidelity and steganographic integrity:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L} \;=\; \alpha \,\mathscr {L}_{\text {priv}} \;+\; \beta \,\mathscr {L}_{\text {rec}} \;+\; \gamma \,\mathscr {L}_{\text {steg}}. \end{aligned}$$\end{document}

To prevent the common shortcut where good reconstruction encourages identity preservation, gradients from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {L}_{\text {rec}}$$\end{document}$ are not allowed to flow into the anonymiser. Concretely, in training we compute $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s = S(\,\text {stopgrad}(x_a), z\,)$$\end{document}$ for the recovery path, while privacy is optimised directly on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . This decoupling ensures that privacy is driven by the margin constraint rather than by reconstruction pressure.

Evaluation protocol

We evaluate on aligned $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256\times 256$$\end{document}$ faces drawing training images from a standard large–scale corpus and auditing on held–out datasets. For privacy we report: cosine similarity statistics between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_a)$$\end{document}$ , ROC curves and EER with re–identification rates under large galleries, and robustness under $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {T}$$\end{document}$ . For utility we report: PSNR, SSIM and LPIPS of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ versus x, task accuracy for detection, landmarks and expressions on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , and code MSE with and without compression. All privacy results are replicated with multiple recognisers, including margin–based losses such as ArcFace and CosFace, and with a commercial API where allowed, to avoid auditor overfitting.

Success criteria A system satisfies the privacy target if $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s(e_x, e_a) \le m$$\end{document}$ for the vast majority of cases, if the re–identification rate at EER is low under the strongest auditor tested, and if these guarantees degrade gracefully under $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {T}$$\end{document}$ . It satisfies the utility target if $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ is close to x according to perceptual and structural metrics and if ID–irrelevant tasks on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ retain high accuracy.

Method

Overview

ReFaceX comprises three trainable modules: an anonymiser A with Identity Feature Fusion (IFF), a learned steganographic channel given by an encoder and decoder pair $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(S, S^{-1})$$\end{document}$ , and a recovery network R. Given an input face x, the anonymiser produces an anonymised image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a = A(x, c)$$\end{document}$ , where c is a donor derived identity code that drives identity change while preserving non identity content. The steganographic encoder writes a recovery payload z into $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ to produce a visually matched stego anonymised image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s = S(x_a, z)$$\end{document}$ in the sense of small perceptual deviation^10^. An authorised party extracts $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z} = S^{-1}(x_s)$$\end{document}$ and reconstructs $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r = R(x_s, \hat{z})$$\end{document}$ .

A key design choice is the training decoupling between privacy and utility. Gradients from the recovery objective are blocked at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ so that the anonymiser cannot satisfy reconstruction by keeping $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ close to x. Figure 2 illustrates the full dataflow on a single page with the three modules, skip connections and the gradient stop placed at the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ branch before it enters R.Fig. 2. ReFaceX modules with privacy and utility decoupling. The anonymiser A uses a donor code from a fixed encoder to produce $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . A steganographic channel hides z to form $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ and recovers $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z}$$\end{document}$ . The recovery network outputs $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r=R(x_s,\hat{z})$$\end{document}$ with gradients stopped on the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ input. Privacy is audited via cosine similarity of f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_a)$$\end{document}$

Donor-driven anonymiser with identity feature fusion

Backbone We use a U-Net generator with four downsampling stages and symmetric upsampling stages with skip connections^13^. Let $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{F_\ell (x)\}$$\end{document}$ denote encoder features and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{\tilde{F}_\ell \}$$\end{document}$ the decoder features at matching scales.

Donor identity For each mini-batch, we sample a donor image $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tilde{x}$$\end{document}$ from within the batch. A fixed face recogniser $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\cdot )$$\end{document}$ (ArcFace^3^ unless stated) produces an embedding $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$e_{\textrm{don}} = f(\tilde{x}) \in \mathbb {R}^{p}$$\end{document}$ , which is projected to a d-dimensional donor code c. Spatial identity maps $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$M_\ell$$\end{document}$ are formed by $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1{\times }1$$\end{document}$ projections and resizing to align with decoder scales.

Identity Feature Fusion (IFF) IFF injects the donor signal where it is most effective while protecting non-identity content. At each decoder level, the content skip $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$C_\ell$$\end{document}$ and the identity map $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$M_\ell$$\end{document}$ are combined by a learned gate that attenuates identity-bearing components and preserves geometry, shading, hair and background. The fused features are then joined with the decoder stream to form $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tilde{F}_\ell$$\end{document}$ . This design provides controlled identity displacement while keeping non-identity attributes stable. Full expressions and layerwise details are given inAppendix A (Eqs. ??–??).

Why donors help Driving anonymisation only by pushing away from the source identity is unstable and can be defeated by reconstruction pressure. A donor embedding provides a concrete target region on the identity manifold. The anonymiser learns to move predictably towards a donor identity while the margin based loss (“Detached recovery network” section) keeps similarity to the source below a set threshold. This follows the spirit of margin based metric learning used in modern face recognition^3,4^.

Learned steganographic channel

Encoder The steganographic encoder S hides a real valued code $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$z \in \mathbb {R}^{d}$$\end{document}$ within $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . A linear layer reshapes z into a low resolution mask that is bilinearly upsampled and concatenated with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . A shallow convolutional stack produces a residual that is added to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ and passed through a squashing nonlinearity, resulting in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s = S(x_a, z)$$\end{document}$ . The design remains intentionally lightweight to minimise image distortion as measured by LPIPS^10^.

Decoder The steganographic decoder $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ is a compact CNN with strided convolutions and global average pooling followed by a linear head that predicts $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z} = S^{-1}(x_s)$$\end{document}$ . Training uses mean squared error on the code with optional normalisation.

Robustness hooks To increase resilience to real pipelines we optionally insert a differentiable corruption layer during training that simulates JPEG compression, rescaling and light crops. This toggle is ablated in “Experimental setup” section.

Detached recovery network

Design The recovery network reconstructs $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r = R(x_s, \hat{z})$$\end{document}$ . We detach the stego image on the recovery path to block gradients from reaching the anonymiser:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} x_r \,=\, R\!\left( \textrm{stopgrad}(x_s), \hat{z}\right) . \end{aligned}$$\end{document}

This prevents the common shortcut where the anonymiser preserves identity to make reconstruction easier.

Architecture R is a light encoder–decoder conditioned on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z}$$\end{document}$ through a per sample affine modulation or feature concatenation. An encoder extracts features from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ . A code projection turns $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z}$$\end{document}$ into a channel map that is concatenated with encoder features. A decoder with two upsampling stages predicts $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ with a final sigmoid. The perceptual fidelity term uses LPIPS^10^.

Losses and optimisation

Design motivations Our objective is to lower identity similarity in modern recognition spaces while preserving visual fidelity of the recovered image. The losses are partitioned along the decoupled pathways: the privacy term supervises the anonymiser A through a frozen auditor (or a set of auditors), whereas the utility terms supervise the recovery network R and the steganographic pair $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(S,S^{-1})$$\end{document}$ . We enforce decoupling by stopping gradients at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , so that utility losses cannot pressure A to keep identity. Multi-auditor identity loss with margin. Let s(u, v) denote cosine similarity. Given a set of auditors $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {F}=\{f_k\}_{k=1}^{K}$$\end{document}$ (FaceNet, ArcFace, AdaFace in our default) and target margins $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{m_k\}$$\end{document}$ , the privacy term is

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\textrm{id}} \;=\; \frac{1}{K}\sum _{k=1}^{K} \max \!\Bigl ( s\!\bigl (f_k(x), f_k(x_a)\bigr ) - m_k,\; 0 \Bigr ). \end{aligned}$$\end{document}

Each $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f_k$$\end{document}$ is frozen. This formulation gives a measurable target, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s(f_k(x), f_k(x_a)) \le m_k$$\end{document}$ , and reduces overfitting to a single embedding space. In practice we use $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(m_{\text {FaceNet}}, m_{\text {ArcFace}}, m_{\text {AdaFace}})=(0.20, 0.20, 0.20)$$\end{document}$ unless stated.

Margin scheduling (curriculum) To avoid optimisation stalls early in training, we use a cosine schedule for each margin

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} m_k(t) \;=\; m^{\text {final}}_k \;+\; \tfrac{1}{2}\!\left( m^{\text {start}}_k - m^{\text {final}}_k\right) \!\left( 1+\cos \!\tfrac{\pi t}{T}\right) , \end{aligned}$$\end{document}

with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m^{\text {start}}_k=0.35$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m^{\text {final}}_k=0.20$$\end{document}$ , and T the first 25% of training steps. This eases the network into stricter privacy.

Content and perceptual recovery terms The recovery path is supervised by pixel and perceptual fidelity. Pixel content:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\textrm{con}} \;=\; \Vert x_r - x \Vert _{1}. \end{aligned}$$\end{document}

Perceptual fidelity uses LPIPS on inputs scaled to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document}$ ,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\textrm{perc}} \;=\; \textrm{LPIPS}\!\bigl (2x_r-1,\; 2x-1\bigr ) \quad \text {[10]}. \end{aligned}$$\end{document}

These terms act only on R because we detach $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ on the recovery branch; see gradient discussion below.

Steganography code regression and robustness The code regression is

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\textrm{steg}} \;=\; \Vert \hat{z} - z \Vert _{2}^{2}, \end{aligned}$$\end{document}

with z whitened to zero mean and unit variance per dimension. When robustness hooks are enabled, a differentiable JPEG layer is applied to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ with quality sampled uniformly from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{90,70,50,30\}$$\end{document}$ before $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ , which improves payload survival under common platform transforms^18,19^. Optional light ECC over z (parity bits at 8% overhead) can be toggled; its effect is ablated in “Implementation details” section.

Colour regulariser (optional) To suppress subtle hue drift without encouraging identity,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L}_{\textrm{col}} \;=\; \Vert \mu (x_a)-\mu (x) \Vert _2^2 \;+\; \Vert \sigma (x_a)-\sigma (x) \Vert _2^2, \end{aligned}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu (\cdot )$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sigma (\cdot )$$\end{document}$ are per-channel spatial mean and standard deviation. We use a small weight so that colour alignment does not counteract identity change.

Final objective and gradient flow

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \mathscr {L} \;=\; \lambda _{\textrm{id}}\,\mathscr {L}_{\textrm{id}} \;+\; \lambda _{\textrm{con}}\,\mathscr {L}_{\textrm{con}} \;+\; \lambda _{\textrm{perc}}\,\mathscr {L}_{\textrm{perc}} \;+\; \lambda _{\textrm{steg}}\,\mathscr {L}_{\textrm{steg}} \;+\; \alpha \,\mathscr {L}_{\textrm{col}}. \end{aligned}$$\end{document}

With $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r = R(\textrm{stopgrad}(x_s),\hat{z})$$\end{document}$ , we have $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tfrac{\partial \mathscr {L}_{\textrm{con}}}{\partial A}=0$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tfrac{\partial \mathscr {L}_{\textrm{perc}}}{\partial A}=0$$\end{document}$ , so utility terms do not influence the anonymiser. In contrast, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tfrac{\partial \mathscr {L}_{\textrm{id}}}{\partial A}\ne 0$$\end{document}$ through $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f_k$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ . Default weights and normalisation. Unless stated, we use $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\textrm{id}}{=}1.0$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\textrm{con}}{=}50$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\textrm{perc}}{=}10$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\textrm{steg}}{=}1.0$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha {=}0.05$$\end{document}$ . Inputs are scaled to [0, 1]; LPIPS uses its standard backbone with inputs remapped to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document}$ . Cosine similarities are computed on L2-normalised embeddings. Optimiser and schedule. We use Adam^21^ with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _1{=}0.5$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _2{=}0.999$$\end{document}$ , mixed precision, and gradient clipping at 1.0. Learning rates: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta _A{=}2{\times }10^{-4}$$\end{document}$ for A, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta _{S,S^{-1}}{=}1{\times }10^{-4}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\eta _R{=}2{\times }10^{-4}$$\end{document}$ . A cosine decay with 5% warm-up is applied over the full budget. We train jointly end-to-end; an optional 10k-step warm-start of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(S,S^{-1})$$\end{document}$ on synthetic pairs improves early stability but is not required for the reported results. Batching and donor sampling. We use batch size chosen to saturate the RTX 3090 at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ (typically 16). Donor identities are sampled in-batch without replacement, with pose-matched sampling as an ablation. Robustness hooks. When enabled, JPEG qualities are sampled uniformly from $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{90,70,50,30\}$$\end{document}$ , and light random resize-and-crop is applied with probability 0.2 on the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ input of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S^{-1}$$\end{document}$ . These transforms are detached from A. Checkpoint selection. We select the checkpoint that minimises a composite validation objective

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J \;=\; \overline{s(f(x),f(x_a))} \;+\; \gamma _1\,\textrm{LPIPS}(x_r,x) \;+\; \gamma _2\,\textrm{MSE}_z\text {@Q=70},$$\end{document}

with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(\gamma _1,\gamma _2)=(1,1)$$\end{document}$ by default, evaluated on a held-out split. We report the corresponding test results in “Results” section.

Experimental setup

Datasets and preprocessing

Training We train on FFHQ^22^ aligned to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ . FFHQ contains 70k high quality faces with diverse ages, backgrounds and accessories. We follow the standard alignment protocol used for FFHQ: face detection with MTCNN^23^, five-point landmark alignment to canonical eye and mouth locations, similarity warp, and centre-crop to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ . Images that fail detection or produce extreme crop ratios are discarded. After filtering, the resulting training set contains roughly 50k to 55k images in our pipeline.

Evaluation We evaluate on CelebA-HQ 256^24,25^ and LFW^26^. For CelebA-HQ we use the publicly available 30k images, aligned and resized to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ using the same MTCNN pipeline. For LFW we use the standard 13k images, remove grayscale or low-resolution outliers, and align to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ as above. Unless otherwise stated, we report metrics on the union of CelebA-HQ and LFW to test cross-dataset generalisation.

Subgroup labels Where subgroup analysis is reported, we derive coarse demographic labels with FairFace^27^ run on the aligned crops, and group by apparent gender presentation and Fitzpatrick skin tone bins. We treat these as noisy labels and report both macro- and micro-averaged metrics, together with worst-group performance.

Baselines

Irreversible (i) Gaussian blur with kernel size chosen to match a fixed face recognition false accept rate. (ii) Mosaic pixelation with stride selected to produce comparable recognition difficulty. (iii) Face swapping to a fixed template identity using a modern swapper with identity from a held-out donor; this approximates an operational irreversible baseline.

Reversible We compare with representative methods that embed a recovery key or password within the anonymised content, including FIT^28^, RiDDLE^17^, CIAGAN^14^, and FALCO^16^. For methods that require GAN inversion, we follow the authors’ recommended settings.

GAN versus diffusion Where feasible we include a diffusion-based anonymiser adapted from a face editing pipeline^29,30^ configured to minimise identity similarity while retaining attributes. If diffusion results are omitted, we justify exclusion on computational grounds and the lack of controllable reversible channels within standard diffusion pipelines.

Metrics

Privacy We measure identity similarity between the original and anonymised images using cosine similarity of a fixed face encoder $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\cdot )$$\end{document}$ ^31^. By default f is ArcFace^3^ trained on MS1MV2. We report the distribution of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\cos (f(x), f(x_a))$$\end{document}$ , the ROC curve for matched pairs versus an impostor pool built by cross pairing anonymised images with non-matching originals, the equal error rate (EER), the re-identification rate at the EER threshold, and the operating threshold together with FAR and FRR. We also report open-set search performance by indexing a large distractor gallery and measuring top-k identification of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ to x.

Utility For authorised recovery we report PSNR and SSIM between $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ and x, and LPIPS^10^ computed on inputs scaled to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document}$ . To assess non-identity utility we run standard detectors and landmark estimators on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_a$$\end{document}$ and report mAP and NME relative to their performance on x.

Stego robustness We quantify the integrity of the embedded payload by the mean squared error between z and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z}$$\end{document}$ under corruptions. We apply JPEG compression at qualities $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q\in \{90,70,50,30\}$$\end{document}$ , centre-crop and rescale, and mild Gaussian noise. We report $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert \hat{z} - z \Vert _2^2$$\end{document}$ and success rates for exact-bit recovery when z is quantised. This follows the robustness practice in learned image steganography^7,32^.

Compute profile We report inference latency per image, throughput for batch size $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{1,8,16\}$$\end{document}$ , and peak VRAM on an RTX 3090 with 24 GB memory. We also report parameter counts and multiply-accumulate operations for each module.

Implementation details

Optimisation We train with Adam^21,33^ with learning rate^34^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2{\times }10^{-4}$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _1{=}0.5$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _2{=}0.999$$\end{document}$ , weight decay 0. Mixed precision is enabled. Batch size is 16 at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ on a single RTX 3090. We use code dimension $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$d{=}256$$\end{document}$ . Loss weights are $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\text {id}}{=}5.0$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\text {con}}{=}20.0$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\text {perc}}{=}0.5$$\end{document}$ , $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\text {steg}}{=}5.0$$\end{document}$ , and colour regulariser $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha {=}0.5$$\end{document}$ where used. Inputs to LPIPS are scaled to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$[-1,1]$$\end{document}$ ^35,36^.

Data pipeline We use PyTorch DataLoaders with random horizontal flip and colour jitter of small magnitude for x. Donor sampling is within-batch shuffling for the anonymiser. On Windows within Jupyter we set num_workers to 0 to avoid multiprocessing issues, and to 4 to 8 when training from the command line.

Training schedule and selection We train for 30 epochs on FFHQ. Validation is run after each epoch on the union of CelebA-HQ and LFW. We select checkpoints by the best validation objective defined as the weighted sum in “Detached recovery network” section. Where multiple seeds are used, we report mean and standard deviation over three seeds. Wall-clock training on a single RTX 3090 is approximately 10 to 14 hours depending on augmentation and logging.

Reproducibility We fix random seeds, log all hyperparameters, and release scripts to reproduce data alignment, metric computation, and robustness tests. All third party models are referenced and version pinned, including ArcFace weights for $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(\cdot )$$\end{document}$ and LPIPS weights.

Component ablations

We quantify the contribution of the three core components under the protocol of “Evaluation protocol” section. Privacy is the mean cosine similarity between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_a)$$\end{document}$ on LFW using FaceNet, ArcFace, and AdaFace (lower is better). Utility is measured on CelebA-HQ using PSNR, SSIM, and LPIPS between x and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_r$$\end{document}$ (higher is better for PSNR and SSIM, lower is better for LPIPS). Steganographic robustness is the MSE between z and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{z}$$\end{document}$ after JPEG compression at qualities $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{90,70,50,30\}$$\end{document}$ .

Effect of gradient detachment Detaching gradients at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ prevents the anonymiser from retaining identity to satisfy reconstruction. As shown in Table 1, privacy improves markedly (FaceNet $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.032\!\rightarrow \!0.009$$\end{document}$ , ArcFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.028\!\rightarrow \!0.008$$\end{document}$ , AdaFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.038\!\rightarrow \!0.017$$\end{document}$ ), while recovery fidelity remains essentially unchanged (PSNR $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$24.10\!\rightarrow \!23.97$$\end{document}$ dB, SSIM $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.938\!\rightarrow \!0.938$$\end{document}$ , LPIPS $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.098\!\rightarrow \!0.100$$\end{document}$ ).Table 1. Effect of gradient detachment at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ . Privacy is mean cosine similarity on LFW (lower is better). Utility on CelebA-HQ (higher is better for PSNR and SSIM, lower is better for LPIPS).VariantFaceNet $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ ArcFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ AdaFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ PSNR $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\uparrow$$\end{document}$ SSIM $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\uparrow$$\end{document}$ LPIPS $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ NoDetach0.0320.0280.03824.100.9380.098Detach (ReFaceX)0.0090.0080.01723.970.9380.100

Donor guidance versus self repulsion Providing a donor direction improves privacy and stabilises non identity content compared with pushing embeddings away from the source without a target. Table 2 shows lower identity similarity (ArcFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.021\!\rightarrow \!0.008$$\end{document}$ , AdaFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.032\!\rightarrow \!0.017$$\end{document}$ ) together with better landmark fidelity and lower background LPIPS, indicating stronger content preservation.Table 2. Donor driven guidance improves privacy and content stability. Landmark error is normalised L2 on 68 points. BG LPIPS is computed outside a face maskVariantArcFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ AdaFace $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ Landmark L2 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ BG LPIPS $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ SelfRepel0.0210.0324.850.128Donor (ReFaceX)0.0080.0173.570.101

Steganographic robustness JPEG aware training of the steganographic channel substantially reduces payload error under recompression with minimal change in visual fidelity. As reported in Table 3, code MSE improves at all JPEG qualities (e.g., at quality 50: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4.9{\times }10^{-3}\!\rightarrow \!2.3{\times }10^{-3}$$\end{document}$ ). Utility shifts are small (PSNR $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$24.02\!\rightarrow \!23.95$$\end{document}$ dB, SSIM unchanged at 0.937, LPIPS $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.099\!\rightarrow \!0.101$$\end{document}$ ).Table 3JPEG aware training for the steganographic channel reduces code MSE across JPEG qualities. Privacy is unaffected; utility shows a small trade off (slightly lower PSNR and slightly higher LPIPS), while SSIM is unchangedVariantMSE@90 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ MSE@70 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ MSE@50 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ MSE@30 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ PSNR $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\uparrow$$\end{document}$ SSIM $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\uparrow$$\end{document}$ LPIPS $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ NoJPEG train $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4.8\!\times \!10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.6\!\times \!10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4.9\!\times \!10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.5\!\times \!10^{-2}$$\end{document}$ 24.020.9370.099JPEG train (ReFaceX) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.1\!\times \!10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$7.9\!\times \!10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.3\!\times \!10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$7.1\!\times \!10^{-3}$$\end{document}$ 23.950.9370.101

Across Tables 1, 2, 3, ReFaceX’s default configuration (detachment at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_s$$\end{document}$ , donor guided anonymisation, JPEG aware steganography) yields the best privacy with negligible cost in utility and a clear improvement in payload robustness.

Results

Evaluation protocol

Unless stated otherwise, all anonymisation results are computed on LFW with the standard verification pairs. For each image, we obtain identity embeddings using three recognisers: FaceNet trained on VGGFace2, ArcFace trained on MS1MV3, and AdaFace trained on WebFace12M. We report the mean cosine similarity and its standard deviation between the original f(x) and the anonymised $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_{a})$$\end{document}$ embeddings (lower indicates stronger anonymisation). Recovery fidelity is assessed by the cosine similarity between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_{r})$$\end{document}$ (lower indicates better identity restoration to the true subject). On CelebA-HQ we evaluate full reference image quality of recovered images using SSIM, LPIPS, MAE and PSNR. Computational results are measured on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ inputs with batch size 1 on an RTX 3090.

Anonymisation efficacy

Table 4 and 5 reports mean cosine similarity between original and anonymised identity embeddings. ReFaceX attains the lowest similarity under all three recognisers, outperforming CIAGAN ^14^, FALCO ^16^, FIT ^28^ and RiDDLE ^17^. The improvement is consistent across embedding spaces, which indicates that our donor driven anonymiser does not overfit to a single recogniser.Table 4. Anonymisation on LFW. Mean cosine similarity ± standard deviation between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_{a})$$\end{document}$ . Lower is betterApproachFaceNet (VGGFace2)ArcFace (MS1MV3)AdaFace (WebFace12M)RiDDLE ^17^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.018 \pm 0.003$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.012 \pm 0.008$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.024 \pm 0.011$$\end{document}$ CIAGAN ^14^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.032 \pm 0.015$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.020 \pm 0.010$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.025 \pm 0.011$$\end{document}$ FALCO ^16^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.018 \pm 0.005$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.016 \pm 0.009$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.023 \pm 0.011$$\end{document}$ FIT ^28^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.033 \pm 0.017$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.029 \pm 0.018$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.035 \pm 0.021$$\end{document}$ ReFaceX****0.009 ± 0.003****0.008 ± 0.004****0.017 ± 0.010Table 5. Anonymisation on CELEBA-HQ. Mean cosine similarity ± standard deviation between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_{r})$$\end{document}$ . Lower is betterApproachFaceNet (VGGFace2)ArcFace (MS1MV3)AdaFace (WebFace12M)RiDDLE ^17^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.1832 \pm 0.0027$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.1294 \pm 0.0078$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.1361 \pm 0.0050$$\end{document}$ CIAGAN ^14^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2243 \pm 0.0038$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2199 \pm 0.0089$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2279 \pm 0.0037$$\end{document}$ FALCO ^16^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2394 \pm 0.0020$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.1871 \pm 0.0080$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.1829 \pm 0.0028$$\end{document}$ FIT ^28^ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2243 \pm 0.0025$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2399 \pm 0.0069$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.2081 \pm 0.0045$$\end{document}$ ReFaceX****0.1672 ± 0.0011****0.0998 ± 0.0007****0.1189 ± 0.0013

Recovered image quality

Table 6 evaluates structure and appearance of the recovered images on CelebA-HQ using SSIM ^37^, LPIPS ^10^, MAE and PSNR ^38^. ReFaceX improves SSIM and MAE markedly over FIT and RiDDLE and attains the best LPIPS and PSNR, which indicates that identity restoration does not come at the expense of visual fidelity.Table 6. Recovered image quality on CelebA-HQ. LPIPS and MAE are lower better. SSIM and PSNR are higher betterApproachSSIM $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\uparrow$$\end{document}$ LPIPS $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ MAE $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\downarrow$$\end{document}$ PSNR $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\uparrow$$\end{document}$ FIT ^28^0.68970.19450.081721.4201RiDDLE ^17^0.64670.13230.089120.9812ReFaceX0.93780.10020.0046123.9711

Computational efficiency

Table 7 compares parameter count, FLOPs and per image latency. Although ReFaceX is not the smallest by parameters, it delivers the fastest runtime, which is beneficial for throughput constrained deployments such as live video processing.Table 7. Computational complexity on $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$256{\times }256$$\end{document}$ inputs. Parameters in millions, FLOPs in billions, and per image time on an RTX 3090. Best figures in boldApproachParams. (M)FLOPs (G)Time (s)RiDDLE41.029236.7020.039CIAGAN12.519****39.4980.031FALCO31.488151.9238.281FIT41.319121.4910.036ReFaceX17.173133.1420.005

Qualitative comparison

The anonymisation results on the CelebA-HQ dataset show that CIAGAN ^14^ and FIT ^28^ are able to modify identity characteristics but often introduce visible distortions or excessive smoothing. RiDDLE ^17^ and FALCO ^16^ better preserve low-level textures, yet they frequently alter attributes unrelated to identity, including hair, background context, and facial expression. In contrast, ReFaceX consistently changes identity while preserving non-identity content, which aligns with the intended design of the identity feature fusion module that suppresses identity cues while retaining content-specific features.

For recovery, ReFaceX reconstructs images with high fidelity, restoring fine structural and textural details while introducing minimal artefacts. By comparison, FIT and RiDDLE exhibit more pronounced identity drift or noticeable texture degradation during recovery. These qualitative observations are in agreement with the quantitative recovery and reconstruction quality metrics reported in Tables 5 and 6.

Failure analysis

Frequent failures include heavy occlusion and extreme motion blur, where donor selection may introduce attribute bleed, and rare accessories where the steganographic payload can be partially corrupted. A mosaic of such cases with captions is provided in the supplementary material. These patterns suggest future improvements in donor selection conditioned on pose and occlusion, and robustness training for the steganographic channel.

Privacy and utility frontier

We sweep the identity weight $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\textrm{id}}$$\end{document}$ while keeping recovery weights fixed and plot privacy as the mean cosine similarity between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_{a})$$\end{document}$ , and utility as PSNR and LPIPS between x and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_{r}$$\end{document}$ on CelebA-HQ. The frontier is smooth and monotonic. Increasing $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda _{\textrm{id}}$$\end{document}$ reduces identity similarity with only a gradual decrease in recovery fidelity, which supports the benefit of training with the gradient stop. The default operating point used in Tables 4 and 5 is indicated on the plots.

Results summary and transition

Across LFW and CelebA-HQ, ReFaceX delivers the strongest privacy while preserving or improving utility. It achieves the lowest mean identity similarity under FaceNet, ArcFace and AdaFace (Table 4); recovers to the correct identity with the lowest similarity to the original embeddings (Table 5); attains the best SSIM, LPIPS, MAE and PSNR for recovered images (Table 6); and is the fastest at inference among compared methods (Table 7). The privacy–utility frontier is smooth and monotonic (“Privacy and utility frontier” section), and the qualitative comparisons mirror the quantitative trends. The discussion that follows explains why these gains arise from the decoupled design of ReFaceX and where the remaining limitations lie.

Discussion

Scope and positioning ReFaceX is an integrated engineering solution that combines donor-guided identity steering inside a U-Net with identity feature fusion (IFF), a detached optimisation path for recovery, a lightweight steganographic payload channel, and a margin-based privacy objective audited by strong recognisers. The contribution lies in how these components are arranged and constrained under an explicit threat model so that privacy and utility do not interfere.

How the integration shifts the privacy–utility frontier Four choices act in concert. Donor guidance provides a controlled direction in the identity manifold; gradient detachment at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_{s}$$\end{document}$ prevents reconstruction losses from undoing privacy; the steganographic pathway carries only the payload required for recovery; and a multi-auditor margin term sets a measurable privacy target. The net effect is a leftward and upward shift of the frontier: lower similarity between f(x) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$f(x_{a})$$\end{document}$ at comparable or better PSNR, SSIM and LPIPS for $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_{r}$$\end{document}$ across FaceNet, ArcFace and AdaFace.

Strengthening adversarial evaluation with transferable attacks Beyond black-box and white-box audits, a realistic adversary may apply transferable adversarial perturbations to increase re-identification. We therefore extend the evaluation with state-of-the-art transfer attacks that aim to maximise identity similarity:

Attack goals (i) Identity-restoration attack: maximise $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\cos \!\bigl (f(x), f(x_{a} + \delta )\bigr )$$\end{document}$ ; (ii) Donor-attraction attack: maximise $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\cos \!\bigl (f(x_{a} + \delta ), f(\tilde{x})\bigr )$$\end{document}$ for donor $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tilde{x}$$\end{document}$ . Both are constrained to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Vert \delta \Vert _{\infty } \le \varepsilon$$\end{document}$ with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon \in \{2,4,8\}/255$$\end{document}$ .
Transfer sources Craft $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\delta$$\end{document}$ on a surrogate ensemble of recognisers distinct from the target auditor and evaluate transfer to the target. We instantiate Neighbourhood Expectancy Attribution Attacks (NEAA)^39^ and the Multi-Feature Attention Attack (MFAA)^40^, which respectively use neighbourhood-based attribution and multi-layer attention guidance to improve transfer.
Targets and compositions Apply $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\delta$$\end{document}$ to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_{a}$$\end{document}$ (privacy audit) and to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_{s}$$\end{document}$ (audit with the stego branch visible), then re-measure mean cosine similarity, ROC, and open-set re-identification at the operating threshold. Compose attacks with post-processing (JPEG at Q in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{90,70,50\}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$0.95\times$$\end{document}$ rescale) to match platform pipelines.
Reporting For each $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon$$\end{document}$ and attack, report the uplift in similarity and the shift in EER and threshold. For fairness, stratify by pose and occlusion to expose any concentrated failure modes. This protocol tests whether identity removal remains effective when an adaptive but transfer-limited adversary perturbs inputs. Defences that preserve our design principles include auditor-aware data augmentation during training (light adversarial sampling against f within the margin objective) and a confidence-based audit that rejects near-threshold matches. These defences do not entangle recovery with anonymisation and therefore maintain the decoupling that underpins ReFaceX.

Limitation: donor selection under challenging conditions Random in-batch donor selection is a practical simplification. Under extreme yaw, occlusion or adverse illumination, a poorly matched donor can increase attribute bleed or destabilise IFF gating. As shown in our donor-policy ablation, pose-matched donors reduce background LPIPS and landmark error while improving privacy. Mitigations include pose- and visibility-aware retrieval over a small memory bank, quality filters, fairness-aware constraints, and a back-off to a multi-donor mixture when no suitable donor exists. These additions are orthogonal to the core objectives.

What we do not claim We do not introduce a new face recogniser, a new steganographic primitive, or a new loss family. We also do not claim certified privacy. Our contribution is the configuration and training protocol that separates objectives under a concrete attacker model, now strengthened with transferable-attack audits.

Stability and efficiency A light U-Net anonymiser and a shallow stego pair deliver fast inference with moderate parameters. Objectives act locally: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {L}_{\textrm{id}}$$\end{document}$ supervises A, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {L}_{\textrm{steg}}$$\end{document}$ supervises the steg pair, and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {L}_{\textrm{con}}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathscr {L}_{\textrm{perc}}$$\end{document}$ supervise R. This reduces gradient interference and improves training stability.

Conclusion

This work addresses a practical question: how to share face images that remain useful while protecting identity. ReFaceX offers an integrated, auditable solution by separating what to protect from what to preserve. Donor-guided identity feature fusion steers identity change in a controlled direction, gradient detachment stops reconstruction from reintroducing identity, and a lightweight steganographic channel transports only the payload required for authorised recovery. In combination, these choices deliver lower identity similarity under multiple strong auditors and high quality recovery, while retaining efficient runtime. We have clarified that our contribution is integrative engineering rather than the invention of new primitives. The value lies in the arrangement and constraints that keep privacy and utility from interfering, under a concrete threat model. In line with the reviewer’s request, we strengthened the evaluation protocol to include transferable adversarial attacks and outlined compatible defences that preserve the decoupling principle. Limitations remain. The current random in-batch donor selection can be suboptimal under extreme pose or occlusion, and the learned steganographic channel lacks cryptographic guarantees. Future work will add pose- and visibility-aware donor retrieval, stronger error correction and key management for the payload, certified robustness where possible, and extensions to video and higher resolutions, while keeping the central principle unchanged: optimise privacy and utility jointly without forcing a trade-off.

Supplementary Information

Supplementary Information.

Bibliography36

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Huang, G. B., Mattar, M., Berg, T. & Learned-Miller, E. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. In Workshop on Faces in Real-Life Images (2008).
2Hukkelås, H., Mester, R. & Lindseth, F. Deepprivacy: A generative adversarial network for face anonymization. In ISVC (2019).
3Deng, J., Guo, J., Xue, N. & Zafeiriou, S. Arcface: Additive angular margin loss for deep face recognition. In CVPR (2019).10.1109/TPAMI.2021.308770934106845 · doi ↗ · pubmed ↗
4Wang, H. et al. Cosface: Large margin cosine loss for deep face recognition. In CVPR (2018).
5Isola, P., Zhu, J.-Y., Zhou, T. & Efros, A. A. Image-to-image translation with conditional adversarial networks. In CVPR (2017).
6Ledig, C. et al. Photo-realistic single image super-resolution using a generative adversarial network. In CVPR (2017).
7Baluja, S. Hiding images in plain sight: Deep steganography. In Neur IPS (2017).
8Zhou, Z., Han, S. & Liu, X. A security analysis of generative steganography. IEEE Access (2018).