From blueprint to practice: an innovative dynamic 9-box grid improves exercise participation and aerobic capacity in graduate students
Lei Chen, Aichun Li, Wenhao Chen, Hongzhuang Wang, Junlai Zhou, Mingyun Sun

TL;DR
A dynamic 9-box grid model with weekly feedback improves aerobic capacity and exercise participation in graduate students more effectively than monthly feedback.
Contribution
The study introduces a novel 9-box grid model with dual-axis feedback to enhance exercise behavior and aerobic capacity in graduate students.
Findings
Weekly feedback led to greater and sustained improvements in aerobic capacity and total exercise volume compared to monthly feedback.
Weekly feedback significantly increased self-efficacy for exercise and shifted participants toward healthier grid zones.
Both feedback groups improved exercise identity, but only the weekly group showed consistent behavior modification over time.
Abstract
Graduate students face considerable health risks due to sedentary behavior and academic stress, often demonstrating a marked intention–behavior gap in physical activity (PA). This study developed a two-dimensional 9-box grid model based on total exercise volume (TEV) and aerobic capacity (AC) to compare the efficacy of weekly versus monthly feedback in improving AC and PA participation among graduate students. A quasi-experimental trial was conducted over eight weeks. Thirty-two graduate students, stratified by baseline AC evaluated in a 12-minute run test, were randomized into two arms, weekly feedback (n = 16) and monthly feedback (n = 16). Feedback was delivered through a dynamic 9-box grid that integrated weekly TEV on the x-axis and periodic AC on the y-axis. The primary outcomes were changes in the 12-minute run performance and TEV. Secondary outcomes included scores from the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6| Characteristic | Total sample ( | Weekly feedback group ( | Monthly feedback group ( | |
|---|---|---|---|---|
| Age (years) | 23.63 ± 1.58 | 23.73 ± 1.79 | 23.53 ± 1.41 | 0.74 |
| Sex, | 1.00 | |||
| Male | 22 (73.3) | 12 (75) | 12 (75) | |
| Female | 8 (26.7) | 4 (25) | 4 (25) | |
| BMI (kg/m2) | 22.49 ± 2.74 | 21.67 ± 2.03 | 23.31 ± 3.15 | 0.10 |
| Education level, | ||||
| Master’s student | 30 (100) | 15 (100) | 15 (100) | |
| Field of study, | ||||
| Humanities/social sciences | 30 (100) | 15 (100) | 15 (100) | |
| Chronic disease history, | 0 (0) | 0 (0) | 0 (0) | |
| Exercise contraindications, | 0 (0) | 0 (0) | 0 (0) | |
| Outcome | Time point | Weekly feedback group ( | Monthly feedback group ( | Time | Group | Time × group | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| ||||
| 12-min run distance (m) | Baseline | 2139.33 ± 270.22 | 2168.00 ± 226.15 | 41.45 | <0.001*** | 0.754 | 0.64 | 0.429 | 0.022 | 4.29 | 0.024* | 0.241 |
| Week 4 | 2369.07 ± 222.96 | 2246.93 ± 209.96 | ||||||||||
| Week 8 | 2497.33 ± 189.49 | 2417.33 ± 160.20 | ||||||||||
| 12-min run score | Baseline | 42.67 ± 28.09 | 47.00 ± 24.19 | 37.64 | <0.001*** | 0.736 | 0.47 | 0.50 | 0.016 | 6.49 | 0.005** | 0.325 |
| Week 4 | 65.67 ± 19.07 | 52.33 ± 25.06 | ||||||||||
| Week 8 | 77.00 ± 12.93 | 71.00 ± 18.54 | ||||||||||
| Variable | Group | Comparison | Mean change | SD of difference | Hedges’ | ||
|---|---|---|---|---|---|---|---|
| 12-min run distance (m) | Weekly feedback | Baseline vs. mid-test | 229.73 | 151.29 | 6.28 | <0.001*** | 1.58 (0.81, 2.32) |
| Baseline vs. post-test | 358.00 | 249.78 | 7.10 | <0.001*** | 1.78 (0.95, 2.59) | ||
| Mid-test vs. post-test | 128.27 | 182.99 | 5.03 | <0.001*** | 1.26 (0.57, 1.93) | ||
| Monthly feedback | Baseline vs. mid-test | 78.93 | 148.66 | 2.06 | 0.059 | 0.52 (−0.02, 1.04) | |
| Baseline vs. post-test | 249.33 | 170.02 | 5.68 | <0.001*** | 1.43 (0.70, 2.13) | ||
| Mid-test vs. post-test | 170.40 | 131.75 | 5.01 | <0.001*** | 1.26 (0.57, 1.92) | ||
| 12-min run score | Weekly feedback | Baseline vs. mid-test | 23.00 | 14.37 | 6.20 | <0.001*** | 1.56 (0.79, 2.30) |
| Baseline vs. post-test | 34.33 | 20.95 | 6.35 | <0.001*** | 1.60 (0.82, 2.35) | ||
| Mid-test vs. post-test | 11.33 | 9.54 | 4.60 | <0.001*** | 1.16 (0.50, 1.79) | ||
| Monthly feedback | Baseline vs. mid-test | 5.33 | 14.33 | 1.44 | 0.171 | 0.36 (−0.15, 0.86) | |
| Baseline vs. post-test | 24.00 | 15.38 | 6.05 | <0.001*** | 1.52 (0.76, 2.25) | ||
| Mid-test vs. post-test | 18.67 | 14.57 | 4.96 | <0.001*** | 1.25 (0.56, 1.91) |
| Variable | Group | Pre-test | Post-test | Within-group comparison | Between-group comparison (post-test) | ||
|---|---|---|---|---|---|---|---|
| Hedges’ | Hedges’ | ||||||
| EIS score | Weekly feedback ( | 41.13 ± 8.98 | 55.67 ± 7.12 | 2.21 (4.66, 2.00) | −0.16 (−0.86, 0.54) | ||
| Monthly feedback ( | 43.47 ± 7.15 | 56.73 ± 5.86 | 2.63 (3.70, 1.54) | ||||
| SEE-C score | Weekly feedback ( | 48.00 ± 11.86 | 73.80 ± 8.49 | 3.32 (3.15, 1.25) | 0.77 (0.39, 1.49) | ||
| Monthly feedback ( | 48.33 ± 15.85 | 59.93 ± 23.29 | 0.57 (1.10, 0.03) | ||||
| Time point | Weekly feedback group ( | Monthly feedback group ( | Hedges’ | ||
|---|---|---|---|---|---|
| Baseline | 50.67 ± 31.95 | 47.47 ± 34.17 | 0.27 | 0.793 | 0.09 (−0.60, 0.79) |
| Week 1 | 64.27 ± 29.63 | 57.47 ± 28.47 | 0.64 | 0.527 | 0.23 (−0.47, 0.93) |
|
|
|
|
|
| |
|
|
|
|
|
| |
| Week 4 | 67.20 ± 16.37 | 54.47 ± 19.23 | 1.95 | 0.061 | 0.69 (−0.31, 1.41) |
|
|
|
|
|
| |
| Week 6 | 76.80 ± 14.36 | 66.67 ± 17.93 | 1.71 | 0.099 | 0.61 (−0.11, 1.32) |
|
|
|
|
|
| |
|
|
|
|
|
| |
| 6.059 | 4.017 | – | – | – | |
| – | – | – | |||
| 0.874 | 0.821 | – | – | – | |
| Between-group | |||||
| Within-group | |||||
| Group × time | |||||
| Group | Time point | Center of gravity | Change in center of gravity |
|---|---|---|---|
| Weekly feedback group | Baseline | 3.73 | −1.93 |
| Post-intervention | 1.80 | ||
| Monthly feedback group | Baseline | 3.93 | −1.47 |
| Post-intervention | 2.47 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhysical Activity and Health · Behavioral Health and Interventions · Heart Rate Variability and Autonomic Control
Introduction
1
As the core cohort of knowledge workers, graduate students’ health capital accumulation directly impacts the sustainable development of academic productivity and innovative capacity (1). However, prevalent issues, including intensified research pressure and prolonged sedentary behavior, have led to marked increases in chronic diseases and psychological disorders among graduate students (2, 3). A national survey targeting Chinese graduate students revealed that only 21.2% maintained regular exercise participation, whereas 76.2% remained in “occasional participation” or “preparation stages” (4). This significant intention–behavior gap reflects deficiencies in behavioral maintenance mechanisms within conventional health intervention strategies (5, 6). Consequently, developing scientific intervention strategies to increase exercise motivation, optimize behavioral patterns, and improve physical fitness has become an urgent research priority.
Aerobic capacity (AC, also termed cardiopulmonary endurance or cardiorespiratory fitness) refers to the sustained ability to perform aerobic work, which integrally reflects oxygen uptake, transport, and utilization capacities. As a core component of physical fitness (7), AC serves as a crucial indicator of overall health status (8) and has been recognized as “the fifth clinical vital sign” (9). Empirical evidence indicates that prolonged sedentary behavior results in significantly lower AC levels in graduate students than those of age-matched population norms, triggering cascading effects, including impaired concentration and diminished academic efficiency (10). Current exercise interventions predominantly focus on prescription design but often lack systematic integration of theoretical constructs critical for behavioral maintenance phases, such as dynamic goal calibration and visual feedback mechanisms (11). This theoretical deficit has led to suboptimal exercise adherence and AC improvement outcomes in graduate student populations.
In recent years, performance management theory has shown unique potential in health behavior modification, facilitating behavioral change through goal setting, dynamic feedback mechanisms, and incentive systems (12, 13). Furthermore, the 9-box performance matrix has been widely used in talent management and leadership development (14). Developed by General Electric (GE) in the 1970s, this matrix categorizes employees into nine quadrants on the basis of their performance and potential, helping organizations identify key talents and develop targeted training programs. It evaluates talent through performance outcomes and developmental potential, providing a dual-axis framework for targeted developmental strategies. Empirical studies have shown that this model can significantly increase the proportion of high-performing talent, and its applicability has been verified in business management and healthcare services (15).
Crucially, the scientific value of the 9-box grid extends beyond classification functionality to its dynamic evolution mechanism. Through periodic assessments and data visualization dashboards, this system enables managers to map talent development trajectories, establishing a closed-loop management cycle of “evaluation-feedback-optimization.” This characteristic aligns with behavioral change theories in health management, where phased feedback enhances self-efficacy to drive the transition from “behavioral intention” to “action maintenance,” as per the Transtheoretical Model (16). However, existing research has focused predominantly on corporate contexts, with health promotion applications remaining largely theoretical (17). Critical gaps persist regarding dynamic multidimensional health competencies and comparative studies on feedback frequency optimization. Evidence-based research targeting special populations is particularly scarce, resulting in intervention strategies with insufficient alignment to target groups’ cognitive profiles (18).
Therefore, this study aims to develop and preliminarily validate a dynamic 9-box grid health management model to address the persistent intention-behavior gap and insufficient sustainability of feedback mechanisms among graduate students. This interdisciplinary framework incorporates wearable-monitored behavioral data (total exercise volume) and periodic aerobic capacity assessments via standardized running tests, forming a dual-axis evaluation system that enables personalized and tiered interventions. A central goal is to empirically compare the relative efficacy of weekly versus monthly feedback in enhancing exercise adherence, self-efficacy, and aerobic fitness. Ultimately, this research seeks to advance health promotion in higher education by innovatively adapting a corporate performance management tool for health behavior guidance.
Materials and methods
2
This study was conducted in accordance with the Declaration of Helsinki and was approved by the Ethics Committee of Anqing Normal University (Approval Number: AQNU2024110). All procedures, including participant recruitment and data collection, were carried out at Hainan Normal University with institutional support and collaboration. Prior to participation, all individuals provided written informed consent after being thoroughly informed of the study’s purpose, procedures, potential risks, and benefits. Participants were assured of their voluntary involvement, the right to withdraw at any time without penalty, and the confidentiality of their anonymized data.
Participants
2.1
This preliminary quasi-experimental study examined a novel dynamic 9-box grid management system. A priori power analysis (G*Power 3.1) indicated a minimum sample size of 27 participants (α = 0.05, 1−β = 0.80), based on previously reported standard deviations for aerobic capacity (19). Accounting for a 20% attrition rate, the target sample was set to 32.
Participants were recruited through convenience sampling from a natural class of postgraduate students at Hainan Normal University between September 12 and October 1, 2024, under the supervision of the principal investigator (Li). Eligible participants were healthy postgraduate students aged 18–30 years. Exclusion criteria comprised cardiovascular disease, any contraindications identified via the Physical Activity Readiness Questionnaire (PAR-Q) (20), lower-limb injuries within the past 3 months, or symptoms of chest pain. Following baseline testing, eligible participants were stratified according to their 12-min running performance and randomly assigned to either a weekly feedback group (n = 16) or a monthly feedback group (n = 16). Personnel responsible for outcome assessment and statistical analysis were blinded to group assignment. The screening and allocation process is illustrated in Figure 1.
Flowchart of study participation and follow-up.
Experimental design
2.2
A randomized, two-group, repeated-measures design was employed to evaluate the intervention. Aerobic capacity was assessed using the 12-min run test at baseline (week 0), mid-intervention (week 5), and post-intervention (week 9). Total exercise volume (TEV) was recorded weekly throughout the 8-week intervention. Exercise identity and self-efficacy were measured using validated scales at baseline and post-intervention. The overall experimental workflow is illustrated in Figure 2.
Experimental flow design.
Methods of intervention
2.3
Based on the integrated framework centered around the 9-box grid model (Figure 3), a comprehensive intervention was implemented. The stratification matrix was constructed according to established public health and exercise science standards, specifically based on ACSM guidelines (21) and Li’s aerobic capacity assessment framework (22). This model stratified participants by cross-referencing their TEV scores and 12-min run performance, both categorized into tertiles, which positioned each individual into one of nine distinct zones.
Aerobic capacity 9-grid management model integrating health behavior change and performance management theories.
The management strategies for these zones were designed as a dynamic and tiered system aligned with health behavior change principles (23). Individuals identified as Health Star in Grid I with high motivation and high capacity received recognition and opportunities for experience sharing to bolster their self-identity and serve as benchmarks for others. Those in the High Performers grids (II & III) were provided with personalized goal planning to maintain their ability and address specific weaknesses, fostering continued growth through structured outcome expectations.
Participants in the intermediate grids (IV, V, VI) underwent optimized training regimens incorporating appropriate intensity progression and coordination exercises, supported by ongoing monitoring to stabilize exercise habits. For individuals in grids with lower motivation or capacity (VII & VIII), strategies focused on building self-efficacy through supervised guidance and lightweight activities. The most at-risk group in Grid IX received intensive one-on-one coaching involving psychological motivation and internal calibration to address initiative barriers.
To evaluate the effect of feedback timing within this model, all participants were educated on the grid rules and assessment criteria prior to the study. Each week, researchers dynamically updated participant grid positions based on the latest TEV scores. Consistent with the random assignment, the weekly feedback group received updated position and strategy recommendations each week, whereas the monthly feedback group received consolidated feedback only once at the beginning of each four-week cycle. This approach operationalized the closed-loop performance management process (24), integrating continuous assessment and tailored feedback to create a responsive intervention system aimed at sustaining behavior change.
Measurements
2.4
Exercise volume measurement (x-axis)
2.4.1
Objective exercise data (intensity, duration, frequency) were collected via Huawei Band 6 (Model: FRA-B19), with raw datasets automatically exported and synchronized to a cloud-based intelligent management platform via the Huawei Health App. To ensure the reliability of the data, two trained researchers performed a single-blind check of exercise intensity, duration, and frequency on a predesigned exercise log sheet to eliminate recording bias through cross-validation. The validated data were transformed into weekly TEV scores every Monday morning via standardized algorithms, ensuring temporal validity and computational traceability through blockchain timestamping. The TEV metric was calculated as TEV = (intensity score) × (duration score) × (frequency score), where each dimension was normalized to a 0 ~ 100 scale via z-score transformation. TEV scores were stratified into tertiles: high (80 ~ 100), moderate (60 ~ 79), and low (60), with detailed scoring criteria per dimension provided in Supplementary Table S2.
Aerobic capacity assessment (y-axis)
2.4.2
This study used a standardized 12-min run test to assess participants’ AC. Measurements were administered outdoors on a standard 400-m track. The participants wore the Huawei Band 6 to monitor exercise heart rate and intensity zones in real time, whereas the Keep application was employed for timing and dynamic tracking of running distance, enabling synchronized multidimensional data collection of exercise intensity, duration, and distance. All the data were cross-validated between the two systems to ensure an objective and accurate assessment of aerobic capacity. Performance was stratified into tertiles according to the Peking University Institute of Sports Science criteria: high (80 ~ 100), moderate (60 ~ 79), and low (60), with detailed classification parameters in Supplementary Table S3.
Exercise identity evaluation
2.4.3
The Exercise Identity Scale (EIS), developed by Anderson and Cychosz was administered to evaluate exercise identity (25). This 9-item instrument employs a 7-point Likert scale (1 = “strongly disagree” to 7 = “strongly agree”), yielding summative scores from 9 to 63, with higher scores reflecting stronger exercise identity. The Cronbach’s alpha coefficient of the scale was 0.9, indicating excellent internal consistency reliability.
Exercise self-efficacy measurement
2.4.4
The Chinese Self-Efficacy for Exercise Scale (SEE-C), translated by Lingling Li was used to measure the self-efficacy of the participants for exercise (26). The scale consists of 9 items, each of which is rated from 0 (not very confident) to 10 (very confident), and the total score ranges from 0 to 90 points. The scale has a Cronbach’s alpha coefficient of 0.75, demonstrating good internal consistency, and is suitable for assessing changes in subjects’ exercise self-efficacy in different contexts.
Data analysis
2.5
Statistical analyses were performed using SPSS 27 for data analysis, and GraphPad Prism 9 and Origin 2022 for graphing. Continuous data are presented as mean ± standard deviation (SD). Repeated-measures ANOVA with Greenhouse–Geisser correction for non-sphericity was applied to evaluate group × time interactions for primary outcomes, including 12-min run distance and score. Within-group changes were analyzed using paired t-tests, and between-group differences in secondary outcomes such as EIS and SEE-C scores were compared using independent t-tests. For significant interactions, Bonferroni post-hoc tests were conducted. Effect sizes are reported as partial eta-squared for ANOVA and Hedges’ g for t-tests, with a two-sided alpha level of 0.05 defining statistical significance. Furthermore, a weighted center of gravity index was computed based on tier numbering from 1 to 5 to assess spatial distribution shifts within the nine-square grid before and after the intervention.
Results
3
Demographic characteristics
3.1
A total of 32 participants were initially recruited, but two withdrew (one from each group due to injury or voluntary withdrawal), leaving 30 participants included in the final analysis, comprising 22 males (73.3%) and 8 females (26.7%). The mean age was 23.63 ± 1.58 years, and the mean BMI was 22.49 ± 2.74 kg/m^2^. All participants were graduate students in humanities or social sciences, with no chronic diseases or exercise contraindications. Independent samples t-tests indicated no significant differences in baseline characteristics between the weekly and monthly reminder groups (Table 1).
Table 1: Demographic characteristics of the participants.
<table><thead><tr><th align="left" rowspan="1" colspan="1">Characteristic</th><th align="center" rowspan="1" colspan="1">Total sample (<italic>n</italic> = 30)</th><th align="center" rowspan="1" colspan="1">Weekly feedback group (<italic>n</italic> = 15)</th><th align="center" rowspan="1" colspan="1">Monthly feedback group (<italic>n</italic> = 15)</th><th align="center" rowspan="1" colspan="1"><italic>p</italic>-value</th></tr></thead><tbody><tr><td align="left" rowspan="1" colspan="1">Age (years)</td><td align="center" rowspan="1" colspan="1">23.63 ± 1.58</td><td align="center" rowspan="1" colspan="1">23.73 ± 1.79</td><td align="center" rowspan="1" colspan="1">23.53 ± 1.41</td><td align="center" rowspan="1" colspan="1">0.74</td></tr><tr><td align="left" rowspan="1" colspan="1">Sex, <italic>n</italic> (%)</td><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td rowspan="1" colspan="1"/><td align="center" rowspan="1" colspan="1">1.00</td></tr><tr><td align="left" rowspan="1" colspan="1"> Male</td><td align="center" rowspan="1" colspan="1">22 (73.3)</td><td align="center" rowspan="1" colspan="1">12 (75)</td><td align="center" rowspan="1" colspan="1">12 (75)</td><td rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1"> Female</td><td align="center" rowspan="1" colspan="1">8 (26.7)</td><td align="center" rowspan="1" colspan="1">4 (25)</td><td align="center" rowspan="1" colspan="1">4 (25)</td><td rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">BMI (kg/m<sup>2</sup>)</td><td align="center" rowspan="1" colspan="1">22.49 ± 2.74</td><td align="center" rowspan="1" colspan="1">21.67 ± 2.03</td><td align="center" rowspan="1" colspan="1">23.31 ± 3.15</td><td align="center" rowspan="1" colspan="1">0.10</td></tr><tr><td align="left" colspan="5" rowspan="1">Education level, <italic>n</italic> (%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Master’s student</td><td align="center" rowspan="1" colspan="1">30 (100)</td><td align="center" rowspan="1" colspan="1">15 (100)</td><td align="center" rowspan="1" colspan="1">15 (100)</td><td rowspan="1" colspan="1"/></tr><tr><td align="left" colspan="5" rowspan="1">Field of study, <italic>n</italic> (%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Humanities/social sciences</td><td align="center" rowspan="1" colspan="1">30 (100)</td><td align="center" rowspan="1" colspan="1">15 (100)</td><td align="center" rowspan="1" colspan="1">15 (100)</td><td rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">Chronic disease history, <italic>n</italic> (%)</td><td align="center" rowspan="1" colspan="1">0 (0)</td><td align="center" rowspan="1" colspan="1">0 (0)</td><td align="center" rowspan="1" colspan="1">0 (0)</td><td rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">Exercise contraindications, <italic>n</italic> (%)</td><td align="center" rowspan="1" colspan="1">0 (0)</td><td align="center" rowspan="1" colspan="1">0 (0)</td><td align="center" rowspan="1" colspan="1">0 (0)</td><td rowspan="1" colspan="1"/></tr></tbody></table>Effects on 12-min run distance
3.2
Repeated-measures ANOVA revealed a significant main effect of time and a significant group × time interaction on 12-min run distance, but no main effect of group (Table 2). Analysis of simple effects demonstrated a clear differential pattern of improvement between the groups (Table 3). The weekly feedback group showed significant, large improvements between all consecutive timepoints (all p < 0.001, Hedges’ g ranging from 1.26 to 1.78). In contrast, the monthly feedback group significantly improved from baseline to post-test (p < 0.001, g = 1.43) and from mid-test to post-test (p < 0.001, g = 1.26), but not from baseline to mid-test (p = 0.059, g = 0.52).
Table 2: Repeated-measures ANOVA results for 12-min run distance and score between feedback groups.
<table><thead><tr><th align="left" rowspan="2" colspan="1">Outcome</th><th align="left" rowspan="2" colspan="1">Time point</th><th align="center" rowspan="2" colspan="1">Weekly feedback group (<italic>n</italic> = 15)</th><th align="center" rowspan="2" colspan="1">Monthly feedback group (<italic>n</italic> = 15)</th><th align="center" colspan="3" rowspan="1">Time</th><th align="center" colspan="3" rowspan="1">Group</th><th align="center" colspan="3" rowspan="1">Time × group</th></tr><tr><th align="center" rowspan="1" colspan="1"> <italic>F</italic> </th><th align="center" rowspan="1" colspan="1"> <italic>p</italic> </th><th align="center" rowspan="1" colspan="1"> <italic>η<sup>2</sup></italic> </th><th align="center" rowspan="1" colspan="1"> <italic>F</italic> </th><th align="center" rowspan="1" colspan="1"> <italic>p</italic> </th><th align="center" rowspan="1" colspan="1"> <italic>η<sup>2</sup></italic> </th><th align="center" rowspan="1" colspan="1"> <italic>F</italic> </th><th align="center" rowspan="1" colspan="1"> <italic>p</italic> </th><th align="center" rowspan="1" colspan="1"> <italic>η<sup>2</sup></italic> </th></tr></thead><tbody><tr><td align="left" rowspan="3" colspan="1">12-min run distance (m)</td><td align="left" rowspan="1" colspan="1">Baseline</td><td align="center" rowspan="1" colspan="1">2139.33 ± 270.22</td><td align="center" rowspan="1" colspan="1">2168.00 ± 226.15</td><td align="center" rowspan="3" colspan="1">41.45</td><td align="center" rowspan="3" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="3" colspan="1">0.754</td><td align="center" rowspan="3" colspan="1">0.64</td><td align="center" rowspan="3" colspan="1">0.429</td><td align="center" rowspan="3" colspan="1">0.022</td><td align="center" rowspan="3" colspan="1">4.29</td><td align="center" rowspan="3" colspan="1">0.024<sup>*</sup></td><td align="center" rowspan="3" colspan="1">0.241</td></tr><tr><td align="left" rowspan="1" colspan="1">Week 4</td><td align="center" rowspan="1" colspan="1">2369.07 ± 222.96</td><td align="center" rowspan="1" colspan="1">2246.93 ± 209.96</td></tr><tr><td align="left" rowspan="1" colspan="1">Week 8</td><td align="center" rowspan="1" colspan="1">2497.33 ± 189.49</td><td align="center" rowspan="1" colspan="1">2417.33 ± 160.20</td></tr><tr><td align="left" rowspan="3" colspan="1">12-min run score</td><td align="left" rowspan="1" colspan="1">Baseline</td><td align="center" rowspan="1" colspan="1">42.67 ± 28.09</td><td align="center" rowspan="1" colspan="1">47.00 ± 24.19</td><td align="center" rowspan="3" colspan="1">37.64</td><td align="center" rowspan="3" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="3" colspan="1">0.736</td><td align="center" rowspan="3" colspan="1">0.47</td><td align="center" rowspan="3" colspan="1">0.50</td><td align="center" rowspan="3" colspan="1">0.016</td><td align="center" rowspan="3" colspan="1">6.49</td><td align="center" rowspan="3" colspan="1">0.005<sup>**</sup></td><td align="center" rowspan="3" colspan="1">0.325</td></tr><tr><td align="left" rowspan="1" colspan="1">Week 4</td><td align="center" rowspan="1" colspan="1">65.67 ± 19.07</td><td align="center" rowspan="1" colspan="1">52.33 ± 25.06</td></tr><tr><td align="left" rowspan="1" colspan="1">Week 8</td><td align="center" rowspan="1" colspan="1">77.00 ± 12.93</td><td align="center" rowspan="1" colspan="1">71.00 ± 18.54</td></tr></tbody></table>Table 3: Post hoc pairwise comparisons of 12-min run performance between assessment time points.
<table><thead><tr><th align="left" rowspan="1" colspan="1">Variable</th><th align="left" rowspan="1" colspan="1">Group</th><th align="left" rowspan="1" colspan="1">Comparison</th><th align="center" rowspan="1" colspan="1">Mean change</th><th align="center" rowspan="1" colspan="1">SD of difference</th><th align="center" rowspan="1" colspan="1"><italic>t</italic>-value</th><th align="center" rowspan="1" colspan="1"><italic>p</italic>-value</th><th align="center" rowspan="1" colspan="1">Hedges’ <italic>g</italic> (95% CI)</th></tr></thead><tbody><tr><td align="left" rowspan="6" colspan="1">12-min run distance (m)</td><td align="left" rowspan="3" colspan="1">Weekly feedback</td><td align="left" rowspan="1" colspan="1">Baseline vs. mid-test</td><td align="center" rowspan="1" colspan="1">229.73</td><td align="center" rowspan="1" colspan="1">151.29</td><td align="center" rowspan="1" colspan="1">6.28</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.58 (0.81, 2.32)</td></tr><tr><td align="left" rowspan="1" colspan="1">Baseline vs. post-test</td><td align="center" rowspan="1" colspan="1">358.00</td><td align="center" rowspan="1" colspan="1">249.78</td><td align="center" rowspan="1" colspan="1">7.10</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.78 (0.95, 2.59)</td></tr><tr><td align="left" rowspan="1" colspan="1">Mid-test vs. post-test</td><td align="center" rowspan="1" colspan="1">128.27</td><td align="center" rowspan="1" colspan="1">182.99</td><td align="center" rowspan="1" colspan="1">5.03</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.26 (0.57, 1.93)</td></tr><tr><td align="left" rowspan="3" colspan="1">Monthly feedback</td><td align="left" rowspan="1" colspan="1">Baseline vs. mid-test</td><td align="center" rowspan="1" colspan="1">78.93</td><td align="center" rowspan="1" colspan="1">148.66</td><td align="center" rowspan="1" colspan="1">2.06</td><td align="center" rowspan="1" colspan="1">0.059</td><td align="center" rowspan="1" colspan="1">0.52 (−0.02, 1.04)</td></tr><tr><td align="left" rowspan="1" colspan="1">Baseline vs. post-test</td><td align="center" rowspan="1" colspan="1">249.33</td><td align="center" rowspan="1" colspan="1">170.02</td><td align="center" rowspan="1" colspan="1">5.68</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.43 (0.70, 2.13)</td></tr><tr><td align="left" rowspan="1" colspan="1">Mid-test vs. post-test</td><td align="center" rowspan="1" colspan="1">170.40</td><td align="center" rowspan="1" colspan="1">131.75</td><td align="center" rowspan="1" colspan="1">5.01</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.26 (0.57, 1.92)</td></tr><tr><td align="left" rowspan="6" colspan="1">12-min run score</td><td align="left" rowspan="3" colspan="1">Weekly feedback</td><td align="left" rowspan="1" colspan="1">Baseline vs. mid-test</td><td align="center" rowspan="1" colspan="1">23.00</td><td align="center" rowspan="1" colspan="1">14.37</td><td align="center" rowspan="1" colspan="1">6.20</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.56 (0.79, 2.30)</td></tr><tr><td align="left" rowspan="1" colspan="1">Baseline vs. post-test</td><td align="center" rowspan="1" colspan="1">34.33</td><td align="center" rowspan="1" colspan="1">20.95</td><td align="center" rowspan="1" colspan="1">6.35</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.60 (0.82, 2.35)</td></tr><tr><td align="left" rowspan="1" colspan="1">Mid-test vs. post-test</td><td align="center" rowspan="1" colspan="1">11.33</td><td align="center" rowspan="1" colspan="1">9.54</td><td align="center" rowspan="1" colspan="1">4.60</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.16 (0.50, 1.79)</td></tr><tr><td align="left" rowspan="3" colspan="1">Monthly feedback</td><td align="left" rowspan="1" colspan="1">Baseline vs. mid-test</td><td align="center" rowspan="1" colspan="1">5.33</td><td align="center" rowspan="1" colspan="1">14.33</td><td align="center" rowspan="1" colspan="1">1.44</td><td align="center" rowspan="1" colspan="1">0.171</td><td align="center" rowspan="1" colspan="1">0.36 (−0.15, 0.86)</td></tr><tr><td align="left" rowspan="1" colspan="1">Baseline vs. post-test</td><td align="center" rowspan="1" colspan="1">24.00</td><td align="center" rowspan="1" colspan="1">15.38</td><td align="center" rowspan="1" colspan="1">6.05</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.52 (0.76, 2.25)</td></tr><tr><td align="left" rowspan="1" colspan="1">Mid-test vs. post-test</td><td align="center" rowspan="1" colspan="1">18.67</td><td align="center" rowspan="1" colspan="1">14.57</td><td align="center" rowspan="1" colspan="1">4.96</td><td align="center" rowspan="1" colspan="1"><0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">1.25 (0.56, 1.91)</td></tr></tbody></table>Effects on 12-min run score
3.3
A similar pattern was observed for the 12-min run scores. Repeated-measures ANOVA indicated a significant main effect of time and a significant group × time interaction, with no main effect of group (Table 2). Simple effects analysis confirmed that the weekly feedback group significantly improved between all assessment points with large effects (all p < 0.001, Hedges’ g ranging from 1.16 to 1.60). The monthly feedback group, however, demonstrated significant score increases only from baseline to post-test (p < 0.001, g = 1.52) and from mid-test to post-test (p < 0.001, g = 1.25), with no significant change from baseline to mid-test (p = 0.171, g = 0.36). Detailed results are presented in Table 3.
Effects on exercise identity
3.4
Both groups demonstrated significant, large improvements in EIS scores after the 9-grid management (both p < 0.001). No significant difference was found between the groups at post-test (p = 0.455), indicating that the feedback frequency did not have differing effects (see Table 4; Figure 4A).
Table 4: Comparison of EIS scores and SEE-C scores between two intervention groups before and after management.
<table><thead><tr><th align="left" rowspan="2" colspan="1">Variable</th><th align="left" rowspan="2" colspan="1">Group</th><th align="center" rowspan="2" colspan="1">Pre-test</th><th align="center" rowspan="2" colspan="1">Post-test</th><th align="center" colspan="2" rowspan="1">Within-group comparison</th><th align="center" colspan="2" rowspan="1">Between-group comparison (post-test)</th></tr><tr><th align="center" rowspan="1" colspan="1"><italic>t</italic>-value, <italic>p</italic>-value</th><th align="center" rowspan="1" colspan="1">Hedges’ <italic>g</italic> (95% CI)</th><th align="center" rowspan="1" colspan="1"><italic>t</italic>-value, <italic>p</italic>-value</th><th align="center" rowspan="1" colspan="1">Hedges’ <italic>g</italic> (95% CI)</th></tr></thead><tbody><tr><td align="left" rowspan="2" colspan="1">EIS score</td><td align="left" rowspan="1" colspan="1">Weekly feedback (<italic>n</italic> = 15)</td><td align="center" rowspan="1" colspan="1">41.13 ± 8.98</td><td align="center" rowspan="1" colspan="1">55.67 ± 7.12</td><td align="center" rowspan="1" colspan="1"><italic>t</italic> = 13.22, <italic>p</italic> < 0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">2.21 (4.66, 2.00)</td><td align="center" rowspan="2" colspan="1"><italic>t</italic> = −0.45, <italic>p</italic> = 0.455</td><td align="center" rowspan="2" colspan="1">−0.16 (−0.86, 0.54)</td></tr><tr><td align="left" rowspan="1" colspan="1">Monthly feedback (<italic>n</italic> = 15)</td><td align="center" rowspan="1" colspan="1">43.47 ± 7.15</td><td align="center" rowspan="1" colspan="1">56.73 ± 5.86</td><td align="center" rowspan="1" colspan="1"><italic>t</italic> = 10.47, <italic>p</italic> < 0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">2.63 (3.70, 1.54)</td></tr><tr><td align="left" rowspan="2" colspan="1">SEE-C score</td><td align="left" rowspan="1" colspan="1">Weekly feedback (<italic>n</italic> = 15)</td><td align="center" rowspan="1" colspan="1">48.00 ± 11.86</td><td align="center" rowspan="1" colspan="1">73.80 ± 8.49</td><td align="center" rowspan="1" colspan="1"><italic>t</italic> = 8.80, <italic>p</italic> < 0.001<sup>***</sup></td><td align="center" rowspan="1" colspan="1">3.32 (3.15, 1.25)</td><td align="center" rowspan="2" colspan="1"><italic>t</italic> = 2.17, <italic>p</italic> = 0.044<sup>*</sup></td><td align="center" rowspan="2" colspan="1">0.77 (0.39, 1.49)</td></tr><tr><td align="left" rowspan="1" colspan="1">Monthly feedback (<italic>n</italic> = 15)</td><td align="center" rowspan="1" colspan="1">48.33 ± 15.85</td><td align="center" rowspan="1" colspan="1">59.93 ± 23.29</td><td align="center" rowspan="1" colspan="1"><italic>t</italic> = 2.28, <italic>p</italic> = 0.039<sup>*</sup></td><td align="center" rowspan="1" colspan="1">0.57 (1.10, 0.03)</td></tr></tbody></table>Effects of different feedback conditions on exercise identity and self-efficacy. (A): Exercise Identity Scale (EIS) scores; (B): Self-Efficacy for Exercise Scale (SEE-C) scores.
Effects on self-efficacy
3.5
Analysis of SEE-C scores revealed that while both groups improved after the 9-grid management, the weekly-feedback group exhibited a significantly greater increase than the monthly-feedback group at post-test (p = 0.044), with a large effect size (see Table 4; Figure 4B). This indicates that more frequent feedback is more effective in enhancing self-efficacy regarding exercise.
Temporal progression of TEV scores
3.6
A repeated-measures ANOVA revealed a significant group × time interaction (F = 2.96, p = 0.022, η^2^ = 0.530) in TEV scores. As illustrated in Figure 5, while both groups showed an increasing trend over time, the weekly feedback group exhibited a more substantial improvement. Post-hoc between-group comparisons confirmed that the weekly group achieved significantly higher TEV scores than the monthly feedback group at multiple assessments (see Table 5), with medium to large effect sizes (Hedges’ g ranging from 0.90 to 0.97).
Temporal trajectories of the TEV scores under the 9-box grid intervention. “a, b, c” indicate different levels of significance within the weekly reminder group (p < 0.05); “A, B, C, D” indicate different levels of significance within the monthly reminder group (p < 0.05); green indicates a significant difference between the two groups (p < 0.05).
Table 5: Comparison of TEV scores between weekly and monthly feedback groups across multiple time points.
<table><thead><tr><th align="left" rowspan="1" colspan="1">Time point</th><th align="center" rowspan="1" colspan="1">Weekly feedback group (<italic>n</italic> = 15)</th><th align="center" rowspan="1" colspan="1">Monthly feedback group (<italic>n</italic> = 15)</th><th align="center" rowspan="1" colspan="1"><italic>t</italic>-value</th><th align="center" rowspan="1" colspan="1"><italic>p</italic>-value</th><th align="center" rowspan="1" colspan="1">Hedges’ <italic>g</italic> (95% CI)</th></tr></thead><tbody><tr><td align="left" rowspan="1" colspan="1">Baseline</td><td align="center" rowspan="1" colspan="1">50.67 ± 31.95</td><td align="center" rowspan="1" colspan="1">47.47 ± 34.17</td><td align="center" rowspan="1" colspan="1">0.27</td><td align="center" rowspan="1" colspan="1">0.793</td><td align="center" rowspan="1" colspan="1">0.09 (−0.60, 0.79)</td></tr><tr><td align="left" rowspan="1" colspan="1">Week 1</td><td align="center" rowspan="1" colspan="1">64.27 ± 29.63</td><td align="center" rowspan="1" colspan="1">57.47 ± 28.47</td><td align="center" rowspan="1" colspan="1">0.64</td><td align="center" rowspan="1" colspan="1">0.527</td><td align="center" rowspan="1" colspan="1">0.23 (−0.47, 0.93)</td></tr><tr><td align="left" rowspan="1" colspan="1"> <bold>Week 2</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>74.67 ± 17.67</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>54.53 ± 23.81</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>2.63</bold> </td><td align="center" rowspan="1" colspan="1"><bold>0.014</bold> <sup>*</sup> </td><td align="center" rowspan="1" colspan="1"> <bold>0.93 (0.19, 1.66)</bold> </td></tr><tr><td align="left" rowspan="1" colspan="1"> <bold>Week 3</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>68.00 ± 16.77</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>51.20 ± 19.22</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>2.55</bold> </td><td align="center" rowspan="1" colspan="1"><bold>0.017</bold> <sup>*</sup> </td><td align="center" rowspan="1" colspan="1"> <bold>0.91 (0.16, 1.60)</bold> </td></tr><tr><td align="left" rowspan="1" colspan="1">Week 4</td><td align="center" rowspan="1" colspan="1">67.20 ± 16.37</td><td align="center" rowspan="1" colspan="1">54.47 ± 19.23</td><td align="center" rowspan="1" colspan="1">1.95</td><td align="center" rowspan="1" colspan="1">0.061</td><td align="center" rowspan="1" colspan="1">0.69 (−0.31, 1.41)</td></tr><tr><td align="left" rowspan="1" colspan="1"> <bold>Week 5</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>82.07 ± 16.38</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>67.07 ± 17.77</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>2.61</bold> </td><td align="center" rowspan="1" colspan="1"><bold>0.014</bold> <sup>*</sup> </td><td align="center" rowspan="1" colspan="1"> <bold>0.93 (0.18, 1.66)</bold> </td></tr><tr><td align="left" rowspan="1" colspan="1">Week 6</td><td align="center" rowspan="1" colspan="1">76.80 ± 14.36</td><td align="center" rowspan="1" colspan="1">66.67 ± 17.93</td><td align="center" rowspan="1" colspan="1">1.71</td><td align="center" rowspan="1" colspan="1">0.099</td><td align="center" rowspan="1" colspan="1">0.61 (−0.11, 1.32)</td></tr><tr><td align="left" rowspan="1" colspan="1"> <bold>Week 7</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>81.53 ± 15.43</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>64.47 ± 18.65</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>2.73</bold> </td><td align="center" rowspan="1" colspan="1"><bold>0.011</bold> <sup>*</sup> </td><td align="center" rowspan="1" colspan="1"> <bold>0.97 (0.22, 1.70)</bold> </td></tr><tr><td align="left" rowspan="1" colspan="1"> <bold>Week 8</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>84.40 ± 13.27</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>71.60 ± 14.42</bold> </td><td align="center" rowspan="1" colspan="1"> <bold>2.53</bold> </td><td align="center" rowspan="1" colspan="1"><bold>0.017</bold> <sup>*</sup> </td><td align="center" rowspan="1" colspan="1"> <bold>0.90 (0.16, 1.63)</bold> </td></tr><tr><td align="left" rowspan="1" colspan="1"><italic>F</italic> value</td><td align="center" rowspan="1" colspan="1">6.059</td><td align="center" rowspan="1" colspan="1">4.017</td><td align="center" rowspan="1" colspan="1">–</td><td align="center" rowspan="1" colspan="1">–</td><td align="center" rowspan="1" colspan="1">–</td></tr><tr><td align="left" rowspan="1" colspan="1"><italic>p</italic>-value</td><td align="center" rowspan="1" colspan="1"><italic>p</italic> = 0.014<sup>*</sup></td><td align="center" rowspan="1" colspan="1"><italic>p</italic> < 0.042<sup>*</sup></td><td align="center" rowspan="1" colspan="1">–</td><td align="center" rowspan="1" colspan="1">–</td><td align="center" rowspan="1" colspan="1">–</td></tr><tr><td align="left" rowspan="1" colspan="1"><italic>η</italic> <sup>2</sup> </td><td align="center" rowspan="1" colspan="1">0.874</td><td align="center" rowspan="1" colspan="1">0.821</td><td align="center" rowspan="1" colspan="1">–</td><td align="center" rowspan="1" colspan="1">–</td><td align="center" rowspan="1" colspan="1">–</td></tr><tr><td align="left" rowspan="1" colspan="1">Between-group</td><td align="center" colspan="5" rowspan="1"><italic>F</italic> = 4.262, <italic>p</italic> = 0.048<sup>*</sup>, <italic>η</italic><sup>2</sup> = 0.132</td></tr><tr><td align="left" rowspan="1" colspan="1">Within-group</td><td align="center" colspan="5" rowspan="1"><italic>F</italic> = 9.658, <italic>p</italic> < 0.001<sup>***</sup>, <italic>η</italic><sup>2</sup> = 0.786</td></tr><tr><td align="left" rowspan="1" colspan="1">Group × time</td><td align="center" colspan="5" rowspan="1"><italic>F</italic> = 2.959, <italic>p</italic> = 0.022<sup>*</sup>, <italic>η</italic><sup>2</sup> = 0.530</td></tr></tbody></table>Tier transition dynamics between promotion groups
3.7
The changes in the center of gravity of the two feedback groups in the 9-box grid before and after management were compared to determine whether there was a trend of distribution shift (Table 6). The center of gravity of the weekly feedback group in the 9-box grid decreased from 3.73 before management to 1.80 after management, with a change of −1.93. The distribution of the number of prompts shifted significantly from the IV/V Tier to the top three tiers. The monthly feedback group’s center of gravity decreased from 3.93 to 2.47, with a change of −1.47, also showing a migration trend but with a smaller amplitude. These results indicate that the high-frequency feedback mechanism has a better promotional effect on optimizing tiered levels, particularly demonstrating outstanding efficacy in the conversion of low-performing talent. See Figure 6 for changes in trends before and after management. The complete baseline to eight-week management trend changes in the nine-grid ladder are shown in Supplementary Figures S1–S9.
Table 6: Comparison of changes in the center of gravity in the nine-square grid before and after management of the two feedback groups.
<table><thead><tr><th align="left" rowspan="1" colspan="1">Group</th><th align="left" rowspan="1" colspan="1">Time point</th><th align="center" rowspan="1" colspan="1">Center of gravity</th><th align="center" rowspan="1" colspan="1">Change in center of gravity</th></tr></thead><tbody><tr><td align="left" rowspan="2" colspan="1">Weekly feedback group</td><td align="left" rowspan="1" colspan="1">Baseline</td><td align="center" rowspan="1" colspan="1">3.73</td><td align="center" rowspan="2" colspan="1">−1.93</td></tr><tr><td align="left" rowspan="1" colspan="1">Post-intervention</td><td align="center" rowspan="1" colspan="1">1.80</td></tr><tr><td align="left" rowspan="2" colspan="1">Monthly feedback group</td><td align="left" rowspan="1" colspan="1">Baseline</td><td align="center" rowspan="1" colspan="1">3.93</td><td align="center" rowspan="2" colspan="1">−1.47</td></tr><tr><td align="left" rowspan="1" colspan="1">Post-intervention</td><td align="center" rowspan="1" colspan="1">2.47</td></tr></tbody></table>Hierarchical comparison of the two feedback groups before and after management. (A): Before Management; (B): After Eight Weeks of Implementation.
Discussion
4
This study established a dual-dimensional 9-box grid management framework integrating exercise engagement and AC, examining the intervention effects of two feedback frequencies on graduate students’ AC and exercise behavior maintenance. The results demonstrated that the framework significantly enhanced AC and TEV, while concurrently reinforcing exercise identity and self-efficacy. Comparative analysis confirmed that weekly dynamic feedback produced superior beneficial effects on AC improvement and exercise habit formation relative to monthly feedback.
Regarding the mechanism of action, the 9-box grid management model facilitates precise regulation of AC adaptation through dynamic feedback. By integrating precise quantification of exercise volume (X-axis) and periodic assessment of AC (Y-axis), the model enables tailored adjustments of exercise plans and loads based on each subject’s position within the grid, thereby promoting more efficient stimulation of cardiopulmonary adaptive changes. Moreover, goal calibration based on matrix position changes addresses the static limitations of traditional exercise prescriptions (27). Previous studies have further confirmed that high-frequency feedback can improve the processing efficiency of the prefrontal cortex in responding to exercise reward signals and facilitate the adaptive remodeling of cardiopulmonary function (28). The more favorable progress trajectory observed in the weekly feedback group—as evidenced by greater improvements in the 12-min run distance and scores—further supports the practical utility of the dynamic 9-box grid feedback model in guiding participants to adjust training loads according to their grid position, thereby efficiently inducing cardiopulmonary adaptations.
In terms of exercise participation, the 9-box grid management model demonstrated unique effectiveness in promoting sustainable behavioral patterns. The steady growth trend in TEV and the significant shift in the hierarchical center of gravity index collectively indicate that this model effectively narrows the intention–behavior gap through dual psychological mechanisms. The significant improvement in EIS scores suggests that grid-based classification reinforces participants’ self-identification as “exercisers,” while the between-group differences in SEE-C scores underscore the critical role of feedback frequency. This finding aligns with the conclusion of Kuvaas et al. (29) that the perceived effectiveness of performance feedback is positively correlated with task performance—a correlation highly dependent on the timeliness and frequency of feedback. This principle is echoed in the medical field, where a lack of feedback has been shown to deter nurses from reporting errors, thereby hindering learning and improvement (30). In the present model, the weekly updates of grid positioning translate abstract health goals into concrete, hierarchical objectives. This allows participants to visually track their progress through changes in grid coordinates, thereby consistently activating intrinsic motivation. Furthermore, analysis of the temporal evolution of exercise persistence (TEV) revealed that the weekly feedback group established a stable upward trend after the fourth intervention week. This trend is consistent with the motivation–reward neural circuit theory proposed by Michaelsen and Esch (31), which posits that high-frequency positive reinforcement facilitates the formation of exercise habit circuits. This incentive mechanism, grounded in grid advancement, establishes a tight closed loop of “assessment–feedback–optimization,” continuously reinforcing participants’ successful experiences and bolstering their exercise confidence. Thus, it effectively translates performance commitment theory from corporate management into a sustainable driver for health behavior change.
Beyond the direct physical and behavioral outcomes, the observed improvements in exercise self-efficacy and identity suggest potential indirect benefits for mental well-being. The model’s structured feedback and achievement system may foster a sense of accomplishment and control—factors known to alleviate academic stress and enhance psychological resilience (32). Although mental health was not directly assessed in this study, the well-established link between physical activity and mental health suggests the potential of this model as a multifaceted intervention worthy of further investigation.
Compared with the traditional health management paradigm, the advantages of the 9-box grid management model lie in its efficiency in interdisciplinary integration. Through a systematic and modular structural design, it enables full-cycle and multidimensional intervention and support for managed subjects (33). Due to the lack of a dynamic calibration mechanism, exercise prescriptions in traditional models often lead to participant dropout because of suboptimal intervention effects or inadequate adaptability (34, 35). In contrast, the 9-box grid management model constructed in this study not only incorporates key components such as health assessment, behavioral intervention, and outcome feedback but also emphasizes management synergy, resource synergy, and information synergy, thereby significantly enhancing participant engagement and self-management capabilities. This dynamic adjustment mechanism compensates for the lagging response to changes in individual needs observed in traditional health management (36) and solves the bottleneck of difficulty in maintaining participation in long-term interventions caused by insufficient continuous feedback and flexible adaptation (37). Additionally, by virtue of multi-agent collaboration and data-driven decision support, the 9-box grid model not only improves the systematicness and sustainability of physical health management but also provides a scientifically grounded and efficient novel approach for the field of health management.
Limitations and future research directions
5
This study has several limitations. First, the sample size, though determined by a power analysis targeting a large effect, was relatively small and consisted primarily of male graduate students, which may somewhat limit the generalizability of the findings. The gender imbalance reflects the demographics of the source program; however, post-hoc analysis found no significant moderating effect of gender on the primary outcomes. Second, although the 8-week intervention duration was adequate for capturing initial changes, the lack of a no-intervention control group somewhat constrains the ability to establish definitive causal inferences. Nonetheless, the objective activity monitoring and the large effect sizes observed bolster the plausibility of an intervention-effect relationship. Finally, due to the lack of a comprehensive mental health assessment, it is not possible to determine whether the nine-grid management increases psychological stress.
Despite these limitations, this study offers encouraging initial evidence supporting the feasibility and efficacy of a dynamic 9-box grid model, originally adapted from corporate performance management, in promoting physical activity and fitness among graduate students. It emphasizes the importance of incorporating structured, feedback-driven frameworks into behavior change interventions and illustrates the promise of digital tools for personalized health management. Future research should feature randomized controlled trials with larger, more diverse populations, including a no-intervention control group. Such designs are crucial to firmly establish causality, further explore the differential effects of feedback timing, evaluate long-term sustainability, and include mental health assessments to thoroughly assess the intervention’s effectiveness.
Conclusion
6
This study innovatively developed a dynamic 9-box grid management model based on a two-dimensional assessment of total physical activity and aerobic capacity, and validated its effectiveness in graduate student health management through an 8-week intervention experiment. This 8-week proof-of-concept study demonstrates that the model effectively enhances aerobic capacity, exercise participation, and psychological resilience indicators through a closed-loop feedback mechanism, with weekly high-frequency feedback showing advantages in behavior maintenance. It provides a promising and replicable interdisciplinary practical framework for health management in higher education institutions. Future research should validate these findings in larger and more diverse populations over longer periods to establish long-term efficacy and explore integration with mental health support systems.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Han X Xu Q Xiao J Liu Z. Academic atmosphere and graduate students’ innovation ability: the role of scientific research self-efficacy and scientific engagement. Eur J Psychol Educ. (2024) 392:1027–44. doi: 10.1007/s 10212-023-00737-x · doi ↗
- 2Ting C Luying C Zhijie Z. Global prevalence and trend of anxiety among graduate students: a systematic review and meta-analysis. Brain Behav. (2023) 134:e 2909. doi: 10.1002/BRB 3.2909 PMC 1009709236852520 · doi ↗ · pubmed ↗
- 3Sweetman MM Tripathi N Danella K Hupp S Muse M Rothrock T. The lived experiences of health science graduate students with anxiety and depression. Learn Environ Res. (2022) 263:11–8. doi: 10.1007/S 10984-022-09448-4PMC 980023136597495 · doi ↗ · pubmed ↗
- 4Xia X Huang J Liu D. An empirical study on the physical exercise behavior of graduate students. J East China Norm Univ Educ Sci Ed. (2018) 365:17. doi: 10.16382/j.cnki.1000-5560.2018.05.011 · doi ↗
- 5Naomi W Jelena M Bdr R Amy R Maria G. The mental labyrinth of postgraduate research: a qualitative study of postgraduate mental health and wellbeing and the impact of the supervisory relationship. Higher Educ. (2023) 875:1211–26. doi: 10.1007/S 10734-023-01061-5 · doi ↗
- 6Yanhao Z Xiaoli L Lina Y Xue B Xiangyu L Wenfei L. Sources of stress and coping strategies among Chinese medical graduate students: a qualitative study. BMC Med Educ. (2024) 24:624. doi: 10.1186/S 12909-024-05603-Y 38840091 PMC 11155159 · doi ↗ · pubmed ↗
- 7Lee D-c Artero EG Sui X Blair SN. Review: mortality trends in the general population: the importance of cardiorespiratory fitness. J Psychopharmacol. (2010) 24:27–35. doi: 10.1177/1359786810382057 PMC 295158520923918 · doi ↗ · pubmed ↗
- 8Dainty KN Colquitt B Bhanji F Hunt EA Jefkins T Leary M. Understanding the importance of the lay responder experience in out-of-hospital cardiac arrest: a scientific statement from the American Heart Association. Circulation. (2022) 145:e 852–67. doi: 10.1161/CIR.0000000000001054, PMID: 35306832 · doi ↗ · pubmed ↗
