Listeners’ baseline autonomic states associated with distinct music-physiology response patterns
Mateusz Soliński, Vanessa Pope, Pier Lambiase, Elaine Chew

TL;DR
The study shows how people's baseline autonomic states influence their physiological responses to music, suggesting ways to personalize music for therapeutic effects.
Contribution
A new framework and visualization method (change point connectivity graphs) link music features to autonomic responses based on baseline physiology.
Findings
Baseline parasympathetic tone correlates with increased vagal engagement in response to novel melodies and acoustic intensity.
Sympathetic-dominant individuals show amplified sympathetic responses to music features.
Novelty in music is the most consistent driver of autonomic change across listeners.
Abstract
Personalizing music-based interventions requires an understanding of how individuals’ physiology responds to expressive music events. Although music can modulate autonomic function, variability across listeners has limited its therapeutic use. We introduce a change-point-driven framework that links music features to cardiovascular responses and test whether baseline autonomic balance is associated with group- and individual-level response patterns. Physiological signals from 112 participants (63 females, 21–79 years, including 25 with elevated resting blood pressure) collected whilst they listened to nine expressive versions of eight pieces of Western classical music were analysed using canonical correlation analysis. Listeners were stratified by their baseline sympathetic–parasympathetic balance. The first two canonical variates identified the dominant links between music and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5- —European Research Council10.13039/501100000781
- —European Union’s Horizon 2020
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHeart Rate Variability and Autonomic Control · Neuroscience and Music Perception · Music Therapy and Health
Introduction
An essential aspect of music listening is the experience of changes in acoustic properties, often described as musical prosody or expressivity. Variations in tempo, loudness, or articulation shape listeners’ perceptual experience and are central to explaining physiological responses to music. As Debussy remarked, ‘music is the space between the notes’: beyond sectional form, it is the expressive shaping of musical structure that defines listening. Such expressive dynamics may also underpin how music influences the autonomic nervous system (ANS), with implications for cardiovascular health.
Music has been explored as a non-invasive tool for stress reduction, rehabilitation, and cardiovascular therapy. Although evidence suggests that music can modulate ANS activity and support cardiovascular disease (CVD) management, clinical findings remain inconsistent. Many studies rely on whole-piece comparisons to silence or other pieces,^1–6^ sometimes leading to overgeneralized conclusions like ‘classical music is relaxing’.^4,7^ These approaches obscure fine-grained, event-based physiological reactions and limit the potential for personalized music-based interventions.
Recent advances in music information retrieval (MIR) enable segmentation of musical signals into change points—transitions reflecting shifts in the distributional properties of acoustic features.^8^ Change point detection has been applied to tonal context,^9^ dynamics,^10^ and MIDI features such as pitch and velocity,^11^ and listeners can also annotate structural boundaries in music.^12^ Beyond music, change point methods are widely used across disciplines, including physiology, where they have been applied to skin conductance, RR intervals, and movement data^13–19^ Studies linking musical boundaries to physiological responses indicate that crescendos, tension–resolution patterns, and melodic transitions predict changes in respiration, heart rate, and skin conductance^18,20–22^ However, a generalizable framework integrating computational and physiological change points in the context of cardiovascular responses is still lacking.
To address this gap, we developed an event-based framework linking expressive musical changes to physiological dynamics using canonical correlation analysis (CCA). CCA, widely used in biomedical research to relate multivariate datasets such as brain networks and clinical symptoms,^23^ identifies shared structure between two sets of variables. Here, we adapt this approach to music–physiology data and introduce change point connectivity graphs to visualize how musical and physiological features jointly contribute to listeners’ responses.
Crucially, we examined these associations in the context of baseline autonomic balance. Baseline autonomic activity shapes stress reactivity^24,25^ and may similarly influence responses to expressive musical changes, yet this has been little studied. Engagement with non-tranquilizing music can act as a mild stressor, increasing heart rate and blood pressure,^4,26,27^ with responses differing between individuals. We therefore stratified participants by parasympathetic–sympathetic balance to test whether baseline autonomic state modulates music–physiology coupling and further explored response phenotypes using cluster-based analyses.
Overall, this study integrates computational change point detection, expert musical annotation, and multivariate analysis to characterize fine-grained music–physiology interactions. Understanding these dynamics is a key step towards developing personalized music-based interventions that promote relaxation or controlled arousal, advancing the field of music theranostics: music-based digital therapeutics and precision diagnostics.
Methods
Data collection
The methodology and data processing for CCA is shown in Figure 1. Data were drawn from the HeartFM study, in which participants listened to nine expressive performances of eight Western classical piano pieces rendered on a reproducing piano. Each piece was presented in its original or altered form (e.g. louder, faster, or louder + faster), with the order and versions randomly selected; the first and last performances were different versions of the same piece. Altered versions were designed to isolate the effects of tempo and/or loudness while preserving other musical features. The repertoire was selected to span a wide range of musical structures and expressive styles and was sourced from Bösendorfer’s Legendary Artist Performance database (as MIDI files), including works by Sergei Prokofiev and Claude Debussy (it was possible by restoring their performances from the piano rolls.^28^) Details of the pieces, performers, and versions are provided in Supplementary data online, Table S1 (see Supplementary Materials).
Data preparation for CCA. (A) Eight music pieces performed by famous pianists (including Claude Debussy, shown in the figure), reconstructed as MIDI files. Versions (V2-V4) of the original (V1) with altered loudness and/or tempo were prepared and rendered on a reproducing piano. The MIDI files, music scores, and audio signals from the rendered performances were sources of the musical features; (B) The study protocol included a questionnaire on demographic information and music sophistication, a 5-minute baseline, and the sequence of music pieces and versions played in random order; (C) Data collection from listeners employed three monitoring devices: ECG + HR chest belt, respiratory sensor, and continuous BP monitor (finger cuff). The signals from these sensors were used for the extraction of physiological variables; (D) Each change point for a given variable (musical and physiological) generates a Gaussian kernel function, the sum of which forms a time series; (E) The values at the musical change point onsets, taken from Gaussian time series from musical variables and averaged times series (over participants who listened to a given piece), create input matrices for calculating canonical variates and creating change points connectivity graphs.
Prior to data collection, participants provided written informed consent, completed a questionnaire assessing musical sophistication (reduced Gold-MSI,^29^) and supplied demographic information. After a 5-min seated baseline recording, participants listened to the music rendered through the reproducing piano at a distance of approximately 3 m. Short silent intervals separated successive pieces, during which participants briefly shared impressions. Post-session feedback was collected but not analysed quantitatively; it served to monitor attention and engagement.
Inclusion criteria required participants to be ≥18 years old, have no hearing impairment, and be able to provide informed consent. The study conformed to the Declaration of Helsinki and was approved by the Oxford C Research Ethics Committee (IRAS 242471) and the Research Ethics Office at King’s College London (MRPP-22/23-34904).
Physiological data
ECG signal and RR intervals were acquired by a Polar H10 (Polar Electro Oy, Kempele, Finland) sensor centred on the sternum using a textile electrode strap. Respiratory signals were registered using a BIOPAC (BIOPAC Systems, Goleta, USA) respiration belt placed slightly below the ECG strap. The ECG and respiration data were collected concurrently and visualized using the HeartFM mobile and desktop visualization apps.^30^ Continuous blood pressure (BP) waveforms were collected using the finger cuffs of the CNAP monitor (CNSystems, Graz, Austria). A systolic (SYS), diastolic (DIA) BP, and pulse pressure (PP = SYS—DIA) were automatically extracted from each pulse wave.
Music data
All music rendered was recorded on a Zoom H5 (Zoom, Tokyo, JP) handheld recorder placed in the same position as the listeners’ chair. The recordings were aligned with the MIDI signals and used for calculating music-related signals: loudness (in sones), tempo (based on the onsets of beats), harmonic tension diameter, which is the longest distance between the pitch positions in the spiral array space at every beat,^31–33^ spectral centroid (the ‘balance point’ of the spectrum, a centroid of the frequencies in the music,^34^) and mel-frequency cepstral coefficients (MFCC, a measure of timbre.^35^)
Data processing
All collected signals were resampled to 5 Hz. RR intervals were manually inspected for ectopic beats, which were removed, and subsequently filtered using a high-pass 4th order Butterworth filter with a cut-off frequency of 0.03 Hz before calculating the HRV spectral parameters, such as instantaneous peak frequencies , and the instantaneous powers , of low (LF) and (high) HF frequency bands in a spectrogram obtained using the short-time Fourier transform [STFT using 50-sample Hamming windows and Python stft() function with default configuration from the scipy.signal library]. Power in the HF band corresponds to heart rate (or RR interval) oscillations predominantly driven by parasympathetic (vagal) activity, particularly through respiratory sinus arrhythmia (RSA). The quantifies the variance (energy) of RR interval fluctuations, while the indicates the dominant oscillatory frequency in that band. The LF band captures slower RR interval oscillations, influenced by both sympathetic and parasympathetic input, but primarily reflecting baroreflex-mediated autonomic regulation. An increase/decrease in the power spectra ( and ) expresses the shifts in the autonomic activity in time. The peak frequency values ( and ) are linked with the respiration patterns and they should be interpreted together with the respiratory interval series. For example, an increase in with a decrease in respiratory intervals is expected and reflects faster respiration, shifting the RSA to a higher frequency. In turn, a decrease in might reflect a shift in RSA from the LF to the HF band due to a musical change point.^36^ While the LF component was defined by a standard range of frequencies (0.04–0.15 Hz), the range for the HF component was made time-varying (instead of a fixed standard range of 0.15–0.4 Hz) and dependent on the respiratory frequency (based on the respiratory intervals extracted from the respiratory signal), Hz. This methodology was widely described in,^37^ where it was found to improve the estimation of RSA in continuous time. Examples of the spectrograms are shown in the Supplementary Materials (Supplementary data online, Figure S5).
The respiratory intervals (resp. intervals) were detected automatically from detrended and filtered signals measured by the respiration belt using a low-pass 4th order Butterworth filter with 1 Hz cut-off frequency; all intervals were checked and corrected manually.
The baseline RR intervals were also used to calculate the balance between the parasympathetic (PNS) and sympathetic (SNS) nervous systems prior to listening to music. PNS and SNS indexes were calculated from the RR interval series collected during baseline in the Kubios HRV Software.^38^ Each index uses three standardized HRV parameters associated with PNS (mean RR interval, RMSSD, and Poincaré Plot Index SD1 in Normalized Units) and SNS (mean HR, Baevsky’s Stress Index, and Poincaré Plot Index SD2 in Normalized Units) branches. The index values were used to cluster the participants into three groups (using k-means clustering), reflecting the three principal variants of baseline autonomic control: parasympathetic-dominant (high PNS and low SNS), balanced (comparable PNS and SNS), and sympathetic-dominant profiles (low PNS and high SNS indexes).
Change points and interpretation maps
Automatically detected change points
The automatic detection of the change points is carried out on continuous musical and physiological signals collected during the study.
We applied a unified change point detection framework across musical and physiological signals. For each signal, candidate change points were identified using two established algorithms: cpt.mean()^39^ for normally distributed inputs and cpt.np()^40^ for non-parametric distributions. To ensure robustness against algorithmic and parameter choices, change point detection was repeated across a broad set of parameter combinations, including penalty values, segmentation methods, normalization options, and sampling resolutions.
Consistency was defined operationally as the repeated detection of a change point at the same temporal location across multiple parameter configurations. Specifically, we constructed a probabilistic representation of change point occurrence over time and retained only those events whose detection frequency exceeded a predefined threshold, indicating stable support across parameterizations. This ensemble-based approach reduces sensitivity to any single configuration and favours reproducible structural transitions.
To prevent spurious clustering of closely spaced detections, a minimum spacing threshold was applied: change points occurring within a short temporal window were merged and represented by a single event. This constraint reflects the limited temporal resolution at which meaningful physiological and musical transitions can be distinguished and avoids over-segmentation of gradual changes. A schematic overview of the framework and validation examples are provided in the Supplementary Materials (Supplementary data online, Figures S11–S13).
Interpretation map (manually detected change points)
Two experts jointly annotated the onsets of compound musical structures while listening to the music under the same conditions as the participants. These annotations reflect expert interpretations of expressive and prosodic musical structures and constitute a novel set of categories, referred to as the Interpretation Map. It captures performed expressive elements likely to influence listeners’ physiological responses and has previously been applied to model musicians’ RR intervals using linear mixed models.^41^
In the present study, annotated events were grouped into 16 categories, including Novel Melody, Return, Build-up/Significant Crescendo, Climax, Resolve/Release, Melodic Interaction, Significant Melody, Significant Silence, Standout Articulation, Emphasis, Runs/Fast Sequence, Swell, Tension, Instability, Drop/Gentle, and Arrival (definitions of these categories were described in the Supplementary data online, Table S2). The onsets of these annotated events were combined with automatically detected musical change points and are collectively referred to as musical change points.
Musical and physiological change points were then transformed into time series using Gaussian kernel functions (GKF) centred at their onsets, separately for each piece and signal. The resulting GKF-based time series served as input to the canonical correlation analysis. A detailed description of the annotation-to-GKF procedure is provided in the Supplementary Materials.
Statistical analysis
Canonical correlation analysis
Canonical correlation analysis (CCA) is a multivariate statistical method used to quantify the relationship between two sets of variables. In this study, CCA was used to examine associations between musical structure and physiological dynamics, with musical change points serving as the fundamental observational units. Accordingly, each row of the input matrices corresponds to a single musical change point, defined as a consistent change detected in the musical feature time series.
Let denote the musical input matrix and , the physiological input matrix, where n is the number of musical change points and p and q are the numbers of musical (P = 12) and physiological (q = 9) variables, respectively. The matrices were constructed such that rows of X and Y are temporally aligned, representing paired musical–physiological events at each change point.
CCA identifies pairs of linear combinations of the variables in X and Y that are maximally correlated. Specifically, CCA estimates weight vectors u and v such that the correlation between the canonical variates and :
The first pair of canonical variates (U₁, V₁) captures the strongest shared association between the two datasets. Subsequent pairs (Uᵢ, Vᵢ) are derived under orthogonality constraints and describe progressively weaker, independent modes of covariation. In addition to the canonical correlation coefficients, relative covariance measures were computed to estimate the proportion of variability explained by each canonical mode.
Construction of input matrices
The musical input matrix X was not derived directly from continuous feature time series. Instead, change points detected in musical features were first converted into continuous representations by convolving each change point onset with a Gaussian kernel (GK). The values of these GK time series were then sampled at the onset of each musical change point to populate the rows of X.
The physiological input matrix Y was constructed analogously. Change points detected in physiological signals were transformed into GK time series, and values were sampled at the same musical change-point onsets. This procedure ensured that each row of X and Y represents a temporally aligned musical–physiological interaction defined at the level of change points rather than raw signal amplitudes. A detailed description of the change point detection and GK transformation procedures is provided in the Supplementary Materials. Because multiple participants listened to the same musical pieces, physiological responses were averaged across all listeners exposed to a given piece prior to CCA. This design choice enabled a unified multivariate analysis across the complete stimulus set while preserving alignment between musical structure and physiological dynamics. Consequently, the resulting CCA captures aggregated associations between musical and physiological variables at the piece level rather than individual-level predictive relationships.
Canonical loadings
Apart from the canonical weights, within-set correlations between each canonical variate and the original variables in X and Y were calculated; these are referred to as canonical loadings (structural coefficients), which are generally more interpretable than the weights.^42^ High absolute loading values indicate variables that contribute strongly to a given canonical variate. Interpretation focused on pairs of canonical variates with the highest and statistically significant canonical correlations, linking variables with the largest absolute loadings in and . The relative signs of the loadings indicate the direction of association between variables in the two datasets: opposite signs imply a negative association, whereas the same sign indicates a positive association.
A special case involves loadings associated with manually annotated musical change points (e.g. Novel Melody). Unlike continuous musical features such as loudness, these annotations represent compound structural events that cannot be described by a simple directional change. Their interpretation, therefore, relies on the sign of the associated physiological loading. For example, a negative loading for Novel Melody combined with a positive loading for respiratory interval indicates a decrease in respiratory intervals following the occurrence of a novel melody. In contrast, loadings of the same sign indicate an increase.
Statistical significance and selection of canonical variates were assessed using Wilks’ Lambda, surrogate-based testing, and permutation testing (see Supplementary Materials), and exploratory subject-wise CCA analyses were additionally performed as robustness and heterogeneity checks, without replacing the primary group-level multivariate analysis.
Change points connectivity graphs
We developed a new way to visualize the key effects observed in CCA, called change point connectivity graphs. These graphs display loading values from selected canonical variates as concentric black circles, with the outermost circle corresponding to the first variate. Individual loadings are shown as smaller red (positive) or blue (negative) circles, with musical variables on the left and physiological variables on the right. For clarity, only loadings with absolute values greater than 0.3 are shown. Arrows indicate the direction of association between musical and physiological variables based on the signs of their loadings, and the dominant effects for each variate are summarized below each graph.
Effect of age, gender, personal musical characteristics on music-physiology patterns
To examine whether individual characteristics modulate the multivariate music–physiology coupling identified at the group level, we analysed how individual response patterns relate to baseline physiological and demographic features. Specifically, CCA was performed separately for each participant (i.e. the size of the input matrices was determined by the number of musical change points in the pieces comprising a given listener’s playlist), yielding individual sets of canonical loadings.
Each individual loading set was compared with the loadings obtained for the entire study group using a cosine similarity measure, quantifying the degree of directional alignment between individual and group-level multivariate patterns. These similarity values were then analysed using generalized additive models (GAMs) as the dependent variable, with baseline physiological indices (SNS/PNS indices, systolic and diastolic blood pressure) and demographic characteristics (age, gender, musicianship level, and music preference) included as predictors.
GAMs have previously been applied in neuroimaging studies, such as by Xia et al.^43^ to examine age- and sex-related effects on multivariate patterns derived via CCA. Detailed model specification, and full statistical results are provided in the Supplementary Materials.
Differences analysis at musical change points
As an additional quantitative analysis, we calculated the differences in the mean values of physiological signals across 10-second windows before and after each musical change point. This approach provides a direct observation and quantitative insights into the instantaneous response of the listener to musical change points. The stability of the results was validated for 8-s and 12-s windows and presented in Supplementary Materials (see Supplementary data online, Figure S4).
The differences are reported as the mean value and 95% CI in the signals’ units. We used a paired t-test or Wilcoxon test (depending on the normality of the distribution of differences) to verify whether the differences between the windows before and after the change point are significant. We analysed each musical category from the Interpretation Map and the change points from acoustic signals separately, providing an increase (denoted as ) or decrease ( ) in the musical signal (17 categories in total). To reduce the risk of Type I errors, the Bonferroni correction was used to analyse each of the nine physiological signals.
Results
Participants
Data were obtained from 112 participants (63 females and 49 males) with a mean age of years (range 21–79 years). Based on the PNS and SNS normalized indexes calculated from baseline RR intervals, three clusters were created using the K-means clustering: cluster 1—with high parasympathetic index and low sympathetic index (n = 20), cluster 2—sympathovagal balance (n = 73), cluster 3—high sympathetic index and low parasympathetic index (n = 19; PNS and SNS indexes grouped in clusters were shown in Figure 3B). The demographic profiles between these clusters were compared in Supplementary data online, Figure S1 in the Supplementary Materials. Twenty-five participants (22%) had elevated baseline blood pressure at the baseline (systolic mmHg or diastolic mmHg). Twenty-three participants (21%) were assessed with a high level of musicianship.
Based on the questionnaire about music sophistication and experience, 55 participants (49%) had at least one year of formal musical training (14 participants for 0.5–1 years, 21 for 2–3 years, 11 for 4–6 years, and 12 for more than 7 years), 78 participants (70.0%) declared a regular, daily practice of a musical instrument (including voice) for at least one year (26 participants for 1–3 years, 27 participants for 3–9 years, and 25 participants for more than 10 years). Moreover, classical music was a favourite genre for 37 (33%) participants.
Change points annotations
The number of detected change points in all music pieces was 813: 520 were detected automatically in tempo (156), loudness (152), spectral centroid (79), MFCC (39), diameter signals (94), and the experts annotated manually 293 onsets (Interpretation Map): 71 of Significant Melody, 44 of Novel Melody, 32 of Return, 25 of Melodic Interaction, 25 of Standout Articulation, 19 of Run/Fast Sequence, and 19 of Resolve/Release. The following under-represented categories ( annotations) were not considered in the analysis: Build-up (13 annotations), Climax (11), Emphasis (10), Swell (6), Significant Silence (4), Tension (4), Instability (4), Arrival (3), Drop/Gentle (2), Fast Sequence Release (1). The number of change points in each piece is shown in Supplementary data online, Table S2 of the Supplementary Materials.
Aggregate patterns: canonical correlations for the whole study group
The first two canonical variates were selected for the final analysis using the multistep procedure described in the Supplementary Materials to ensure the robustness of the effects represented by the loading values. For the whole study group, canonical correlations for these variates equaled 0.56 and 0.36, respectively, both , Wilks’ lambda. They identified two distinct physiological response domains: respiratory dynamics (first variate) and cardiac control, as measured by RR intervals (second variate).
In the first variate, Novel Melody and Tempo showed the strongest associations, linked with shortened respiratory intervals and shifts in spectral power (decrease in , increase in ). These patterns reflect faster, shallower breathing and parasympathetic withdrawal in response to musical novelty and increases in intensity. In other words, expressive changes acted as acute arousal triggers.
The second variate was characterized by autonomic modulation of cardiac intervals, with high loadings for and vascular diameter. This suggests that beyond respiratory coupling, music also engages cardiovascular control systems, particularly through sympathetic influences on vascular tone and heart rhythm. In summary, we observe two primary pathways: respiratory entrainment and autonomic modulation of cardiac control. Rather than uniform ‘relaxation effects,’ music evokes specific, event-based adjustments that depend on the underlying musical features.
Differences in physiological signals before and after musical change points
When aggregated across the whole study group, several musical change-point categories showed significant physiological changes after Bonferroni correction, including Novel Melody, Spectral Centroid↑, Diameter↑/↓, Loudness↑, MFCC↑, and Tempo↑ (Figure 3; full numerical results are presented in Supplementary data online, Table S3 in the Supplementary Materials). Overall, these events produced response patterns consistent with sympathetic activation, often accompanied by changes in respiration.
Novel Melody elicited the strongest respiratory effects, with shortened respiratory intervals ( ), reduced ( ), and increased ( ). Loudness↑ and Tempo↑ had similar effects, decreasing respiratory and RR intervals while shifting HRV towards higher-frequency components, indicative of acute autonomic adjustment.
In contrast, MFCC↑ events were linked with broader cardiovascular changes, including increased diastolic blood pressure ( ) and elevated ( ). Diameter changes showed vascular-specific responses: Diameter↑ corresponded with shorter RR intervals, whereas Diameter↓ was associated with reduced pulse pressure.
These direct before–and–after comparisons reinforce the CCA-derived patterns, where Novel Melody, Tempo, and Loudness consistently loaded with respiratory and RR interval changes, while MFCC and vascular features contributed more strongly to blood pressure and pulse dynamics.
Subgroup patterns: canonical correlations in PNS-SNS clusters
In the exploratory within-cluster CCA, baseline autonomic balance was strongly linked to distinct patterns of response to musical structures (Figure 4). Listeners with high parasympathetic tone at baseline (Cluster 1, ) showed responses consistent with further parasympathetic activation (in the first variate): Novel Melody, Tempo, and spectral changes were associated with lengthened RR intervals and altered respiration intervals, reflecting relaxation-oriented dynamics. This suggests that expressive novelty in music can reinforce parasympathetic regulation in listeners already predisposed to relaxation.
By contrast, listeners with high sympathetic tone (Cluster 3, ) displayed the opposite profile. Increases in Loudness, Tempo, and related acoustic features were associated with shortened RR intervals and reduced (in the first variate), reflecting sympathetic activation. Importantly, Novel Melody was not a strong driver in this group, suggesting that intensity rather than structural novelty governs their responses. For such individuals, music may act as a mild stressor or controlled arousal stimulus, which could be either beneficial (e.g. for motivation) or potentially adverse in vulnerable patients.
The balanced group (Cluster 2), comprising the majority of participants ( ), displayed intermediate patterns that resembled the overall cohort: Novel Melody and Tempo had a strong influence on responses, with respiratory changes as the dominant physiological correlate. Note that the number of musical change points analyzed was comparable across clusters and to that of the full cohort; however, cluster-specific analyses differed in the number of participants contributing to the averaged physiological responses at each change point.
Individual patterns: subject-wise CCA and the effect of inter-subject differences on music-physiology patterns
The subject-wise exploratory CCA demonstrated substantial inter-individual heterogeneity in magnitude while showing non-random directional tendencies for several key couplings observed in the primary analysis (Figure 2), supporting the aggregated-level interpretation while explicitly acknowledging individual variability. The directional alignments were presented in Supplementary data online, Table S7 in the Supplementary Materials.
Results of CCA on the whole study group; (A) Canonical correlation coefficients for all variates for original and surrogate data. The first two variates selected for analysis were marked with a purple circle. The grey area includes variates that were lower than the maximum mean correlation coefficient found in the surrogate data; (B) Percentage covariance for each canonical variate; (C) Change point connectivity graph. The size and colour of the circles represent the value and sign of the loadings, respectively (red: positive value, blue: negative value). Only values > 0.3 were displayed for clarity. The key effects of each variable are listed below the plot.
The results of the analysis of the difference in physiological signals before and after musical change point; (A) A diagram depicting the direction of changes in physiological signals for each musical category. Two arrows mean that the difference was significant (after using Bonferroni correction, considering all 153 combinations of features, p<0.00033), while a single arrow indicates a trend (p<0.05); (B) Mean values of physiological signals before and after musical onsets, where a significant difference was found (two arrows in diagram A).
Results of CCA in three clusters based on the baseline PNS and SNS indexes; (A) Change point connectivity graphs. The size and colour of the circles represent the value and sign of the loadings, respectively (red-positive value, blue-negative value). Only values > 0.3 were displayed for clarity. The main effects of each variable are listed below the plot; (B) A scatter plot of the PNS and SNS indexes calculated for the baseline RR intervals for each participant. The clusters were created using the k-means method; (C) Canonical correlation coefficients for all variates for original and surrogate data. The variates selected for analysis were marked with a circle. The excluded area includes variates that were lower than the maximum mean correlation coefficient found in the surrogate data; (D) Percentage covariance for each canonical variate.
In the GAMs analysis, two variables showed nominally significant effects: baseline PNS index (P = 0.040, = ) and gender (P = 0.028, coefficient = ), with partial effects illustrated in Figure 5 and full results reported in the Supplementary Materials (Supplementary data online, Table S5, Supplementary data online, Figure S10).
Partial effects from generalized additive models (GAMs) illustrating the influence of gender (A) and baseline parasympathetic nervous system (PNS index; B) on the similarity between individual and group-level canonical loadings. Both variables were identified as significant predictors of similarity in the model.
Higher baseline PNS index was associated with lower cosine similarity to the group-level loading pattern, indicating greater individual deviation from the canonical profile. Female participants also exhibited lower similarity to the group-level pattern than males, suggesting more distinct physiological response profiles despite their larger representation in the cohort. However, neither effect remained significant after correction for multiple comparisons (Benjamini–Hochberg) and should therefore be interpreted as indicative trends rather than definitive findings.
Notably, the PNS-related trend is consistent with subgroup-specific CCA results, in which participants with high baseline PNS levels exhibited distinct associations between musical variables and RR interval change points, including RR interval increases during rising intensity and the emergence of novel melodies (Figure 4A).
Discussion
Our findings suggest that individuals’ baseline autonomic balance influence whether expressive musical changes promote parasympathetic relaxation or sympathetic arousal, providing a physiological basis for personalizing music-based cardiovascular interventions. Rather than producing a uniform ‘relaxing’ effect, music operates through discrete events—such as novel melodies, tempo shifts, and intensity changes—that function as micro-interventions shaping respiration and cardiac dynamics.
Across the cohort, novelty emerged as the most consistent driver of physiological change, typically shortening respiratory intervals and shifting spectral power towards increased arousal. These results extend earlier observations that crescendos and tension–resolution patterns synchronize with respiration and heart rate,^20,21^ demonstrating that autonomic engagement is driven not only by overall musical profile but also by the occurrence of novel events. Notably, most significant physiological differences were associated with increases in continuous musical features (e.g. loudness, tempo, spectral complexity), rather than decreases. This pattern suggests an overall engaging and arousing character of the expressive music stimuli and provides an important physiological context for interpreting the multivariate associations identified by CCA.
Importantly, responses diverged according to baseline physiology. Listeners with high parasympathetic tone showed lengthened RR intervals and parasympathetic reinforcement in response to novel melodies and tempo changes, whereas those with sympathetic dominance exhibited shortened RR intervals and reduced during increases in loudness and in spectral features. The balanced group showed intermediate responses. Together, these findings indicate that the direction and magnitude of music-induced autonomic modulation depend on the listener’s baseline state, which may be a marker of their autonomic tendency.
This baseline dependence has potential clinical implications. For parasympathetic-dominant individuals, music rich in novelty may further promote relaxation, whereas for sympathetic-dominant individuals similar stimuli may elicit arousal, which could be beneficial in motivational or rehabilitative contexts but less desirable in vulnerable patients. In both cases, musical change points act as targeted stimuli that modulate autonomic state, highlighting their potential as building blocks for personalized therapeutic playlists.
The general arousal observed across the cohort is consistent with eustress—a form of positive stress. Expressive musical events can elicit stress-like physiological patterns while being experienced as positive, aligning with evidence that eustress reduces cortisol and salivary alpha-amylase.^44,45^ This distinction helps reconcile inconsistencies in the literature: while tranquillizing music can induce group-level relaxation,^7,46,47^ expressive classical music resists simple categorization,^3^ underscoring the context-dependent and individualized nature of music–physiology coupling.
Exploratory analyses using GAMs suggested possible gender-related differences and reinforced the role of baseline parasympathetic tone; however, these effects should be interpreted cautiously, as they did not survive correction for multiple comparisons.
This study has several limitations. CCA was performed on piece-level–averaged physiological responses and therefore captures group-level coupling rather than subject-level prediction. Although the number of observations entering CCA was comparable across clusters, cluster-specific analyses relied on responses averaged over fewer participants and are therefore exploratory. Extending this framework to subject-level modeling will require datasets with complete stimulus overlap or explicitly hierarchical designs. Interpretation Maps were based on expert annotations, which may vary from individual perception. The cohort comprised healthy British adults exposed to Western classical music; generalizability to other cultures, genres, and clinical populations (e.g. hypertension or autonomic dysfunction) warrants further study.
Despite these caveats, the study demonstrates that musical change points are powerful drivers of physiological responses and that baseline autonomic state may be an indicator of whether these responses promote relaxation or arousal. This baseline-informed, event-based framework moves beyond generic claims about music’s ‘relaxing’ effect and provides a path towards precision music therapeutics. By aligning musical features with individual autonomic profiles, it may be possible to design personalized, non-invasive interventions not only for cardiovascular prevention, rehabilitation, stress management but also for cardiovascular imaging and practice. For example, autonomic response phenotypes derived from music–physiology coupling could be used to standardize or monitor autonomic state prior to cardiovascular imaging, where sympathetic activation is known to affect heart rate, vascular tone, and image quality. In this context, music-based modulation could serve as a non-pharmacological tool to reduce autonomic variability before imaging acquisition. In addition, music–physiology response patterns could be integrated with imaging-derived biomarkers (e.g. ventricular function, vascular stiffness, or myocardial strain) in rehabilitation or risk stratification, providing complementary information on autonomic regulation. While these applications remain exploratory, they illustrate how change-point-based music–physiology profiling could support future precision approaches in cardiovascular diagnostics and rehabilitation.
Conclusions
Our findings show that baseline autonomic state is an important factor associated with whether expressive musical changes evoke increased parasympathetic activity or sympathetic arousal. This principle provides a physiological foundation for tailoring music as a precision intervention in cardiovascular care. We demonstrate that music acts through discrete events—novel melodies, tempo shifts, changes in intensity—that function as micro-interventions modulating respiration and cardiac dynamics.
Future randomized clinical trials should test whether matching music interventions to patients’ baseline autonomic profiles improves outcomes in cardiovascular prevention, rehabilitation, and stress management. By integrating event-based analysis with baseline physiology, this study provides a step towards clinically actionable, non-pharmacological music therapeutics.
Supplementary Material
qyag013_Supplementary_Data
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bernardi L, Porta C, Sleight P. Cardiovascular, cerebrovascular, and respiratory changes induced by different types of music in musicians and non-musicians: the importance of silence. Heart 2005;92:445–52.16199412 10.1136/hrt.2005.064600 PMC 1860846 · doi ↗ · pubmed ↗
- 2Loomba RS, Arora R, Shah PH, Chandrasekar S, Molnar J. Effects of music on systolic blood pressure, diastolic blood pressure, and heart rate: a meta-analysis. Indian Heart J 2012;64:309–13.22664817 10.1016/S 0019-4832(12)60094-7PMC 3860955 · doi ↗ · pubmed ↗
- 3Trappe H-J . Music and medicine: the effects of music on the human being. Appl Cardiopulm Pathophysiol 2012;16:133–42.
- 4Koelsch S, Jäncke L. Music and the heart. Eur Heart J 2015;36:3043–9.26354957 10.1093/eurheartj/ehv 430 · doi ↗ · pubmed ↗
- 5Mojtabavi H, Saghazadeh A, Valenti VE, Rezaei N. Can music influence cardiac autonomic system? A systematic review and narrative synthesis to evaluate its impact on heart rate variability. Complement Ther Clin Pract 2020;39:101162.32379689 10.1016/j.ctcp.2020.101162 · doi ↗ · pubmed ↗
- 6Kulinski J, Ofori EK, Visotcky A, Smith A, Sparapani R, Fleg JL. Effects of music on the cardiovascular system. Trends Cardiovasc Med 2022;32:390–8.34237410 10.1016/j.tcm.2021.06.004PMC 8727633 · doi ↗ · pubmed ↗
- 7Ellis RJ, Thayer JF. Music and autonomic nervous system (dys)function. Music Percept 2010;27:317–26.21197136 10.1525/mp.2010.27.4.317PMC 3011183 · doi ↗ · pubmed ↗
- 8Killick R, Fearnhead P, Eckley IA. Optimal detection of changepoints with a linear computational cost. J Am Stat Assoc 2012;107:1590–8.
