Accent Conversion: A Problem-Driven Survey of Sociolinguistic and Technical Constraints
Yurii Halychanskyi, Jianfeng Steven Guo, Volodymyr Kindratenko

TL;DR
This survey reviews the evolution of accent conversion techniques, highlighting challenges, linguistic considerations, datasets, and future directions for more controllable and perceptually consistent accent transformation.
Contribution
It provides a comprehensive overview of both sociolinguistic and technical constraints shaping accent conversion research and practice.
Findings
Transition from rule-based to neural methods for accent conversion
Analysis of application-specific constraints on accent modification
Identification of persistent challenges and future research directions
Abstract
Accent conversion has rapidly progressed alongside growing interest in improving global cross-cultural communication. This survey presents an overview of the evolution of accent conversion methodologies, analyzing how the field has developed in response to fundamental challenges related to data alignment, representation disentanglement, and resource scarcity. We trace the progression from early rule-based digital signal processing approaches such as spectral manipulation and formant-based analysis to modern neural architectures capable of flexible and reference-free accent transformation. In addition, the survey situates accent conversion within its linguistic foundations and examines how different application requirements impose varying constraints on the balance between accent modification and speaker identity preservation. Finally, it reviews commonly used speech datasets and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
