Assessment of the Effectiveness of a Visual Coaching Device Combined with Auditory Instructions on Reproducibility and Stability in Deep Inspiration Breath-Hold Radiotherapy
Yoon Young Jo, Eunseo Bae, Ji Woon Yea, Jae Won Park, Jaehyeon Park, Se An Oh, Hyunyeol Lee

TL;DR
This study shows that combining visual and auditory coaching improves the consistency of breath-hold techniques in breast cancer radiotherapy, leading to better treatment precision.
Contribution
The study quantitatively evaluates the combined effect of visual and auditory coaching on breath-hold reproducibility and stability in radiotherapy.
Findings
Combined visual and auditory coaching reduced breath-hold variability with mean reproducibility of 0.51 mm and stability of 0.77 mm.
Reproducibility and stability improved progressively over treatment sessions, indicating a learning effect.
Poorer reproducibility correlated with greater instability (R2 = 0.26, p = 0.021).
Abstract
Background/Objectives: In left-sided breast cancer radiotherapy, deep inspiration breath-hold (DIBH) reduces radiation dose to the heart; however, its success depends on the patient’s ability to maintain reproducible and stable breath-holds. Although visual and auditory guidance systems have been introduced to improve performance, quantitative data on their combined effects remain limited. Methods: We retrospectively analyzed 400 breath-hold sessions in 20 patients undergoing DIBH treatment using synchronized visual and auditory coaching. Real-time respiratory displacement data were used to calculate reproducibility and stability. Temporal trends and correlations between reproducibility and stability were also assessed. Results: The mean reproducibility was 0.51 ± 0.22 mm (range, 0.20–1.10 mm), and the mean stability was 0.77 ± 0.21 mm (range, 0.50–1.16 mm). Both measures showed…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6- —2024 Yenungnam University Research Grant
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Radiotherapy Techniques · Advances in Oncology and Radiotherapy · Effects of Radiation Exposure
1. Introduction
Breast cancer remains one of the most common malignancies among women worldwide. Radiotherapy plays a crucial role in reducing local recurrence and improving survival rates; however, unintentional radiation exposure to the heart, particularly the left anterior descending (LAD) coronary artery, poses a significant long-term risk of radiation-induced heart disease (RIHD), particularly in patients with left-sided breast cancer [1,2,3,4].
A landmark study by Darby et al. demonstrated a linear relationship between the mean heart dose and the risk of major coronary events, with a 7.4% increase in risk per 1 Gy of heart exposure [5]. Therefore, techniques that minimize cardiac dose are essential for treating left-sided breast cancer.
Deep inspiration breath-hold (DIBH) is a widely adopted method for cardiac sparing that increases thoracic volume and displaces the heart away from the chest wall, reducing cardiac dose [6]. DIBH can significantly reduce the dose to the heart and lungs without compromising target coverage [7]. Multiple studies have confirmed that DIBH significantly reduces radiation exposure to cardiac structures [8,9]. A recent meta-analysis by Lu et al. [10] demonstrated mean heart dose reductions ranging from 26% to 75% with DIBH compared to those with free breathing. In addition, Falco et al. [11] highlighted the consistent heart-sparing benefits of DIBH across various breathing control techniques. Despite its clinical benefits, the effectiveness of DIBH largely depends on the patient’s ability to perform and maintain a consistent breath-hold throughout treatment [12].
The two key metrics used to assess the quality of DIBH are reproducibility (RPD), which refers to the ability to return to the same breath-hold position across sessions, and stability (STB), which refers to the ability to maintain a steady position during a single breath-hold. Inconsistencies can lead to geometric uncertainties or compromised treatment precision. Various patient guidance systems have been introduced to enhance DIBH performance. Visual coaching, which utilizes display monitors, and auditory feedback, such as voice cues or beeping sounds, helps patients achieve consistent breath-hold levels [7,13]. However, despite the increasing use of multimodal guidance, quantitative data validating its effectiveness in improving RPD and STB are limited.
In this study, we evaluated the breath-holding performance of 20 patients with left-sided breast cancer who underwent DIBH under visual and auditory guidance. By analyzing session-wise breath-hold data during treatment, we sought to quantify RPD and STB, assess temporal trends, and determine the impact of the multimodal guidance system. We hypothesized that synchronized audiovisual coaching would help patients achieve reproducible and stable breath-holds during radiotherapy.
2. Materials and Methods
Between January 2024 and March 2024, 20 patients with left-sided breast cancer who received radiotherapy using the DIBH technique at Yeungnam University Medical Center were recruited. Inclusion criteria encompassed patients with pathologic stage I–IIa breast cancer who underwent breast-conserving surgery and were able to perform three consecutive, stable 15–20 s DIBHs at simulation with audiovisual guidance. Exclusion criteria were patients unable to meet the above requirement. The median age of the patients was 52 years (range, 42–74). Radiotherapy was delivered using IMRT with a simultaneous integrated boost technique at a dose of 5200/4800 cGy in 20 fractions. Target contours were delineated according to the institutional protocol for whole-breast irradiation. All patients were guided during DIBH using a combination of visual coaching (Varian RGSC system, version 2.0, Varian Medical Systems, Palo Alto, CA, USA, with monitor display) and auditory cues (in-house tablet-based voice instruction system) to maintain a consistent breath-hold level (Figure 1). Each patient completed 20 treatment fractions, resulting in 400 breath-holding sessions for analysis.
2.1. Data Acquisition and Processing
Real-time respiratory signals were recorded using an in-room monitoring system capable of measuring vertical displacement during breath-holding. RPD and STB are expressed in millimeters, corresponding to the units used for the patient setup. For a series of DIBHs, RPD was defined as the maximum difference between the different DIBH levels; the more varied the DIBH levels, the larger the value. STB was defined as the maximum amplitude change between the initial and end time points of a DIBH when fitted by a least-squares line. The data were extracted in a time-series format and processed using custom Python-based scripts to calculate breath-hold performance metrics.
2.2. Evaluation Metrics
To evaluate patient compliance with breath-holding instructions during radiation therapy, the following metrics were assessed: averaging RPD (Mean_RPD), averaging LVL (Mean_LVL), standard deviations of LVL across all fractions (STD_LVL), coefficients of variation in LVL (CV_LVL), averaging STB (Mean_STB), intra-patient stability representing patient’s overall breath-hold performance for the entire therapeutic period (Intra_STB), averaging VD (Mean_VD), standard deviations of VD across all fractions (STD_VD), and coefficients of variation in VD (CV_VD). To derive these scores, two base values, in millimeters, were first computed from the records of the patient’s abdominal positions during each breath-hold time (Figure 2A): mean level (d) and end-to-end deviation (v), estimated using linear regression as follows:
where m is the slope of the regression line, and ∆t is the duration of the breath-hold.
Figure 2B shows the stepwise process used to derive metrics. Averages of d and v across the breath-holds were calculated, resulting in D and V values for each field of radiation therapy. RPD and STB for each fraction were computed using the following equations [13]:
where N_f_ is the number of fields per fraction, and RPD and STB represent the fraction-level consistency of breath-holds. Lower values indicate better performance.
In addition, the averages of D and V across the fields were calculated, yielding LVL and VD, respectively, for each fraction. Finally, the nine measures were derived for each patient. Mean_RPD, Mean_LVL, Mean_STB, and Mean_VD were obtained by averaging RPD, LVL, STB, and VD, respectively, whereas STD_LVL and STD_VD were obtained from the standard deviations of LVL and VD across all fractions. CV_LVL and CV_VD were calculated by taking the ratio of the standard deviations to the averages of LVL and VD. Furthermore, the intrapatient STB (Intra-STB), representing the patient’s overall breath-hold performance across the entire therapeutic period, was derived as follows:
where N_F_ is the number of fractions.
2.3. Statistical Analysis
Descriptive statistics (mean, standard deviation, and range) were calculated for all parameters. Linear regression was used to evaluate temporal trends across treatment fractions and to assess the relationship between RPD and STB. Statistical significance was set at p < 0.05. All analyses were performed using Python (v3.11).
3. Results
3.1. Overview of RPD and STB Evaluation
Table S1 summarizes the nine performance evaluation scores (Mean_RPD, Mean_LVL, STD_LVL, CV_LVL, Mean_STB, Intra_STB, Mean_VD, STD_VD, and CV_VD) obtained for all 20 patients, along with their group averages.
3.2. RPD of Breath-Hold Level
The patient-wise distribution of RPD is shown in Figure 3A. Most patients maintained a consistent breath-hold height with minor RPD variability. Across the 20 patients, the mean RPD of breath-hold level was 0.51 ± 0.22 mm (0.20–1.1 mm). However, several patients demonstrated broader interquartile ranges and outliers, indicating fluctuations in breath-hold control.
Linear trend analysis revealed that average RPD improved over time, accompanied by a steady decrease in variation across treatment sessions (Figure 3B). This suggests that patient adaptation and increased consistency can be achieved through repeated DIBH training under multimodal guidance. Repeated-measures analysis of variance confirmed significant improvements in RPD over time (RPD: F(19,361) = 2.41, p = 0.0009).
3.3. STB of Breath-Hold Across Fractions
Figure 4A illustrates the STB distribution. Although most patients exhibited tight and consistent distributions, the mean interfraction STB (Mean_STB) across patients was 0.77 ± 0.21 mm (0.50–1.16 mm). Certain individuals demonstrated greater variability, indicating challenges in maintaining a stable breath-hold positioning across sessions.
Linear trend analysis of STB (Figure 4B) revealed a downward trend, confirming that STB progressively improved over time, consistent with increased patient familiarity and adherence to the protocol. Repeated-measures ANOVA confirmed significant improvements in STB across treatment fractions (STB: F(19,361) = 1.64, p = 0.044).
To evaluate the relationship between breath-hold RPD and STB, linear regression was performed (Figure 5). A significant positive correlation was observed between Mean_RPD and intrafraction STB (Intra_STB) (r = 0.51, R^2^ = 0.26; p = 0.021), suggesting that patients with poor RPD tend to exhibit reduced STB within breath-hold sessions. In contrast, Mean_RPD showed a borderline correlation with Mean_STB (r = 0.42, R^2^ = 0.17; p = 0.068), indicating a possible trend toward reduced interfractional consistency in patients with less reproducible breath-holds. After applying Bonferroni correction for two prespecified comparisons, the association between Mean_RPD and Intra_STB remained significant (adjusted p = 0.042), whereas the correlation between Mean_RPD and Mean_STB did not (adjusted p = 0.136).
3.4. Case-Specific Analysis of Breath-Hold Level
Figure 6 compares breath-hold amplitudes between patients with the least and most variation. Patient 5 exhibited highly stable breath-hold levels with minimal deviation, whereas Patient 10 showed large fluctuations over the course of treatment, highlighting interindividual differences in breath-hold control.
3.5. Image-Guided Radiotherapy Results of All Treatment Sessions
Daily image-guided radiotherapy using cone beam computed tomography (CT) was performed for all treatment fractions. The mean isocenter shifts in the vertical, longitudinal, lateral, pitch, roll, and rotation directions were minimal, with median values (standard deviation) of −0.23 cm (0.23), 0.06 cm (0.37), −0.03 cm (0.15), 0.15° (1.08), 0.05° (0.56), and 0.00° (0.58), respectively (Table S2). These results indicate that patient setup variations were small and that using DIBH during radiotherapy maintained isocenter STB throughout treatment.
4. Discussion
We retrospectively analyzed 20 patients with left-sided breast cancer undergoing radiotherapy to quantitatively evaluate the impact of combined visual coaching and auditory instruction on DIBH performance. RPD was defined as the maximum difference in the average breath-hold level across 20 treatment fractions, whereas STB was defined as the maximum amplitude change between the initial and end points of each DIBH session. Based on real-time respiratory signal data from 400 breath-hold sessions, we found that most patients maintained highly consistent DIBH performance, with a mean RPD of 0.51 mm. RPD and STB improved progressively over the course of treatment, and a significant correlation was observed between poor RPD and reduced intrafraction STB.
Previous studies have demonstrated the beneficial effects of visual feedback (VF) systems in enhancing DIBH RPD [13,14,15,16,17]. For instance, Yamauchi et al. [16] reported that patients using VF exhibited significantly smaller chest-wall displacement (0.59 ± 3.64 mm) compared to that with auditory-only coaching (2.09 ± 4.96 mm; p < 0.001). Similarly, Hoshina et al. [14] utilized a laser-based sensor with VF to measure the chest wall–heart distance during DIBH and found interquartile range values approximately two to three times smaller than those without VF, indicating markedly improved interfraction RPD. Moreover, early implementation of surface imaging systems combined with visual coaching, as described by Cervino et al. [13], demonstrated improvements in RPD and STB during patient setup.
Several studies have also investigated the role of audio-based coaching in improving respiratory control during radiotherapy [18,19,20,21,22,23]. George et al. [18] demonstrated that audio-only instruction improved RPD compared to that with free breathing, although audiovisual feedback achieved superior control. Similarly, Sano et al. [22] compared visual and auditory guidance in patients with thoracic and abdominal tumors and found both approaches equally effective, with most patients preferring auditory cues because of their comfort and ease of use. Yu et al. [23] confirmed that audio-only feedback provides respiratory regularity comparable to audiovisual methods in cases of abdominal tumors. However, Nangia et al. [21] reported that, although audio-only coaching effectively regulated breathing frequency, it was less reliable in reproducing the amplitude of deep inspiration during DIBH for breast cancer. Collectively, these findings suggest that auditory guidance is a practical and patient-friendly strategy, but its effectiveness may vary depending on tumor site and is best complemented with VF to maximize breath-hold performance. Table S3 summarizes representative studies on DIBH with visual and/or auditory guidance.
Regardless of the specific feedback modality, progressive improvements in RPD and STB over the course of treatment suggested a clear learning effect. In our study, patients did not receive structured pre-training before radiotherapy. Therefore, the observed improvements likely reflect learning acquired during treatment fractions. Based on this finding, we speculate that introducing a structured pre-training session at CT simulation could enable patients to achieve stable and reproducible DIBHs from the first treatment fraction. This highlights the potential value of structured patient education and practice before treatment initiation. Supporting this, Kefeli et al. [24] demonstrated that patients who received dedicated coaching and instructional materials prior to CT simulation exhibited significantly shorter setup durations and lower LAD maximum doses during radiotherapy than did those who received standard technician-led instruction only on the day of simulation (29.5 Gy vs. 36.5 Gy, p = 0.02). These results indicate that in-room feedback systems and pretreatment preparation play complementary roles in optimizing breath-hold performance and, ultimately, clinical outcomes.
This study has some limitations. First, its retrospective design limits causal inference and introduces potential selection bias. Second, the small sample size may reduce generalizability despite the large number of breath-hold sessions analyzed. Third, the absence of a control group without audiovisual guidance prevents definitive attribution of performance improvements to the coaching system. Additionally, breath-hold quality was assessed using external surface displacement, which may not accurately reflect internal organ motion. Furthermore, no dosimetric data were analyzed, and variability in individual coaching experience was not controlled, which may have influenced the RPD and STB outcomes. Finally, this study was conducted exclusively in patients with early-stage breast cancer. Therefore, the generalizability of our findings to advanced-stage breast cancer involving regional nodal irradiation, or to other thoracic and abdominal tumors, remains uncertain. Further studies are warranted to validate the applicability of combined audiovisual coaching in these broader clinical contexts.
Although our study quantitatively assessed the effect of combined visual and auditory coaching on DIBH performance, future research should integrate real-time surface-guided radiation therapy (SGRT) systems to enhance motion management and setup accuracy. The large-scale SAVE-HEART study [25] demonstrated significant dosimetric advantages of surface-based DIBH with audiovisual feedback, including marked reductions in cardiac and pulmonary doses and projected cardiovascular risk. However, that study did not directly quantify intra- and interfractional breath-hold RPD using SGRT data. Future investigations combining quantitative breath-hold metrics with SGRT-based motion tracking are warranted to optimize patient selection, refine coaching protocols, and minimize geometric uncertainties during DIBH radiotherapy.
5. Conclusions
Although our study did not include single-modality control groups, our findings demonstrate that synchronized visual and auditory guidance in DIBH improves breath-hold RPD and STB over time, despite individual variations. Overall, multimodal coaching systems appear effective in enhancing respiratory consistency during radiotherapy.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Cheng Y.J. Nie X.Y. Ji C.C. Lin X.X. Liu L.J. Chen X.M. Yao H. Wu S.H. Long--term cardiovascular risk after radiotherapy in women with breast cancer J. Am. Heart Assoc.20176 e 00563310.1161/JAHA.117.00563328529208 PMC 5524103 · doi ↗ · pubmed ↗
- 2Nielsen K.M. Offersen B.V. Nielsen H.M. Vaage-Nilsen M. Yusuf S.W. Short and long term radiation induced cardiovascular disease in patients with cancer Clin. Cardiol.20174025526110.1002/clc.2263428139844 PMC 6589645 · doi ↗ · pubmed ↗
- 3van den Bogaard V.A.B. Spoor D.S. van der Schaaf A. van Dijk L.V. Schuit E. Sijtsema N.M. Langendijk J.A. Maduro J.H. Crijns A.P.G. The importance of radiation dose to the atherosclerotic plaque in the left anterior descending coronary artery for radiation-induced cardiac toxicity of breast cancer patients?Int. J. Radiat. Oncol. Biol. Phys.20211101350135910.1016/j.ijrobp.2021.03.00433713741 · doi ↗ · pubmed ↗
- 4Zureick A.H. Grzywacz V.P. Almahariq M.F. Silverman B.R. Vayntraub A. Chen P.Y. Gustafson G.S. Jawad M.S. Dilworth J.T. Dose to the left anterior descending artery correlates with cardiac events after irradiation for breast cancer Int. J. Radiat. Oncol. Biol. Phys.202211413013910.1016/j.ijrobp.2022.04.01935483540 · doi ↗ · pubmed ↗
- 5Darby S.C. Ewertz M. Mc Gale P. Bennet A.M. Blom-Goldman U. Brønnum D. Correa C. Cutter D. Gagliardi G. Gigante B. Risk of ischemic heart disease in women after radiotherapy for breast cancer N. Engl. J. Med.201336898799810.1056/NEJ Moa 120982523484825 · doi ↗ · pubmed ↗
- 6Remouchamps V.M. Letts N. Vicini F.A. Sharpe M.B. Kestin L.L. Chen P.Y. Martinez A.A. Wong J.W. Initial clinical experience with moderate deep-inspiration breath hold using an active breathing control device in the treatment of patients with left-sided breast cancer using external beam radiation therapy Int. J. Radiat. Oncol. Biol. Phys.20035670471510.1016/S 0360-3016(03)00010-512788176 · doi ↗ · pubmed ↗
- 7Vikström J. Hjelstuen M.H. Mjaaland I. Dybvik K.I. Cardiac and pulmonary dose reduction for tangentially irradiated breast cancer, utilizing deep inspiration breath-hold with audio-visual guidance, without compromising target coverage Acta Oncol.201150425010.3109/0284186 X.2010.51292320843181 · doi ↗ · pubmed ↗
- 8Hayden A.J. Rains M. Tiver K. Deep inspiration breath hold technique reduces heart dose from radiotherapy for left--sided breast cancer J. Med. Imag. Radiat. Oncol.20125646447210.1111/j.1754-9485.2012.02405.x 22883657 · doi ↗ · pubmed ↗
