Towards Reliable Audio Deepfake Attribution and Model Recognition: A Multi-Level Autoencoder-Based Framework

Andrea Di Pierno (1); Luca Guarnera (2); Dario Allegra (2); Sebastiano Battiato (2) ((1) IMT School of Advanced Studies; (2) University of Catania)

arXiv:2508.02521·cs.SD·October 13, 2025

Towards Reliable Audio Deepfake Attribution and Model Recognition: A Multi-Level Autoencoder-Based Framework

Andrea Di Pierno (1), Luca Guarnera (2), Dario Allegra (2), Sebastiano Battiato (2) ((1) IMT School of Advanced Studies, (2) University of Catania)

PDF

TL;DR

This paper presents LAVA, a hierarchical autoencoder-based framework for reliable attribution and recognition of audio deepfakes, achieving high accuracy and robustness across multiple datasets and open-set conditions.

Contribution

Introduces a novel multi-level autoencoder framework for audio deepfake attribution and model recognition, addressing open-set robustness and providing publicly available models and code.

Findings

01

ADA achieves over 95% F1-score on all datasets

02

ADMR reaches 96.31% macro F1 across six classes

03

LAVA demonstrates robustness on unseen attacks

Abstract

The proliferation of audio deepfakes poses a growing threat to trust in digital communications. While detection methods have advanced, attributing audio deepfakes to their source models remains an underexplored yet crucial challenge. In this paper we introduce LAVA (Layered Architecture for Voice Attribution), a hierarchical framework for audio deepfake detection and model recognition that leverages attention-enhanced latent representations extracted by a convolutional autoencoder trained solely on fake audio. Two specialized classifiers operate on these features: Audio Deepfake Attribution (ADA), which identifies the generation technology, and Audio Deepfake Model Recognition (ADMR), which recognize the specific generative model instance. To improve robustness under open-set conditions, we incorporate confidence-based rejection thresholds. Experiments on ASVspoof2021, FakeOrReal, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.