# Individual differences in musical melody perception moderate the speech-to-song illusion in Mandarin Chinese listeners

**Authors:** Tamara V. Rathcke, Massimiliano Canzi

PMC · DOI: 10.1038/s41598-026-44268-z · 2026-03-23

## TL;DR

This study shows how Mandarin speakers experience a speech-to-song illusion differently, possibly due to their language's tonal nature affecting how they perceive melody in speech.

## Contribution

The study reveals that Mandarin Chinese listeners' weaker melody perception abilities facilitate the speech-to-song illusion, suggesting a link to perceptual pitch distortion.

## Key findings

- Mandarin listeners showed a modest speech-to-song illusion effect, independent of sentence acoustics.
- Weaker melody perception abilities were associated with stronger speech-to-song illusion effects.
- Linguistic background influences how speech is perceived as song, linking language experience to music cognition.

## Abstract

Repeated exposure to a spoken phrase can give rise to the perception of the speech-to-song illusion (STS), whereby speech gains musical qualities and begins to sound like singing. STS is known to rely on acoustic cues and may depend on an individual’s ability to extract musical qualities (such as melody and rhythm) from speech acoustics. So far, most research has examined listeners of non-tonal languages, with preliminary evidence indicating that tonal-language listeners experience STS differently, if at all. This study investigated STS in Mandarin Chinese listeners who rated song-likeness of Mandarin sentences before and after repetition and completed the Musical Ear Test. Test sentences were designed to promote the acoustic transmission of either melody or rhythm. Results demonstrated a modest STS effect in Mandarin listeners at the group level, which was independent of sentence acoustics. Individual abilities in rhythm perception had no impact on STS while, somewhat surprisingly, weaker melody perception abilities were found to facilitate STS. This suggests that STS in Mandarin Chinese may be linked to a perceptual distortion of pitch. Overall, the findings indicate that STS mechanisms are shaped by linguistic background of listeners and provide new evidence that language experience can influence music perception and cognition.

## Full-text entities

- **Genes:** STS (steroid sulfatase) [NCBI Gene 412] {aka ARSC, ARSC1, ASC, ES, SSDD, XLI}, SLTM (SAFB like transcription modulator) [NCBI Gene 79811] {aka Met}
- **Diseases:** speech or language impairments (MESH:D001072), amusia (MESH:C566019)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13035821/full.md

---
Source: https://tomesphere.com/paper/PMC13035821