Time-Domain Voice Identity Morphing (TD-VIM): A Signal-Level Approach to Morphing Attacks on Speaker Verification Systems

Aravinda Reddy PN; Raghavendra Ramachandra; K.Sreenivasa Rao; Pabitra Mitra; Kunal Singh

arXiv:2604.05683·cs.SD·April 8, 2026

Time-Domain Voice Identity Morphing (TD-VIM): A Signal-Level Approach to Morphing Attacks on Speaker Verification Systems

Aravinda Reddy PN, Raghavendra Ramachandra, K.Sreenivasa Rao, Pabitra Mitra, Kunal Singh

PDF

TL;DR

This paper introduces TD-VIM, a novel signal-level voice morphing technique that significantly increases vulnerability in speaker verification systems, demonstrated through high success rates in attack simulations.

Contribution

The work presents a new time-domain voice morphing method and evaluates its effectiveness against multiple speaker verification systems, revealing substantial security risks.

Findings

01

High attack success rates with G-MAP values over 99%

02

Effective in both deep-learning and commercial systems

03

Demonstrates vulnerability of current speaker verification methods

Abstract

In biometric systems, it is a common practice to associate each sample or template with a specific individual. Nevertheless, recent studies have demonstrated the feasibility of generating "morphed" biometric samples capable of matching multiple identities. These morph attacks have been recognized as potential security risks for biometric systems. However, most research on morph attacks has focused on biometric modalities that operate within the image domain, such as the face, fingerprints, and iris. In this work, we introduce Time-domain Voice Identity Morphing (TD-VIM), a novel approach for voice-based biometric morphing. This method enables the blending of voice characteristics from two distinct identities at the signal level, creating morphed samples that present a high vulnerability for speaker verification systems. Leveraging the Multilingual Audio-Visual Smartphone database, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.