Aurora: a mobile-based cognitive behavioral therapy intervention for anxiety and depression in Mexico
Alejandro López-Tello, Adriana Pérez-Gómez, Christian Gabriel Toledo-Lozano, María del Pilar Callejas-Gómez, Raúl Durón-Figueroa, Ana Moreno-Coutiño, Antonio Ramirez-Treviño, Sebastián Nava, Diego Antonio Ocampo-Gutiérrez de Velasco, Edith Zárate, Oscar Arias-Carrión

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5| Characteristic | Value |
|---|---|
| Age, mean (SD), y | 31.2 (6.4) |
| Age range, y | 18–45 |
| Female sex, | 28 (73.7) |
| Education, university or higher, | 30 (78.9) |
| Diagnosis, | Anxiety only: 14 (36.8) |
| Duration of pharmacological treatment, mean (SD), mo | 11.4 (7.8) |
| Psychotropic medication use, | 38 (100) |
| Number of medications, mean (range) | 1.4 (1–3) |
| Outcome | Baseline, mean (SD) | Post, mean (SD) | Mean Δ (SD) | Cohen’s | ||
|---|---|---|---|---|---|---|
| Anxiety score | 13.2 (5.6) | 10.6 (5.1) | −2.6 (3.5) | 3.81 (37) | 0.001 | 0.49 (0.22–0.76) |
| Depression score | 11.1 (5.2) | 8.4 (4.9) | −2.7 (3.4) | 4.75 (37) | <0.001 | 0.54 (0.28–0.80) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Mental Health Interventions · Child and Adolescent Psychosocial and Emotional Development · Impact of Technology on Adolescents
Introduction
1
Mental health disorders are a leading contributor to the global burden of disease. Anxiety and depression together account for more than 10% of all years lived with disability worldwide, affecting over 600 million people annually (GBD 2019 Diseases and Injuries Collaborators, 2020). Despite the availability of evidence-based treatments such as cognitive behavioral therapy (CBT), more than 75% of individuals in low- and middle-income countries (LMICs) receive no adequate mental health care (Patel et al., 2018).
In Mexico, anxiety and depression are the most prevalent psychiatric disorders, with lifetime prevalence estimates of 14.3 and 9.2%, respectively (Medina-Mora et al., 2005). The national health system faces significant challenges: a shortage of fewer than three psychiatrists per 100,000 inhabitants, fragmented integration between psychological and pharmacological care, stigma, and high out-of-pocket costs (Berenzon Gorn et al., 2013). The COVID-19 pandemic further intensified unmet needs by increasing psychological distress and disrupting access to in-person treatment (Hernandez-Diaz et al., 2022). Although the World Bank classifies Mexico as an upper-middle-income country (UMIC), its health system continues to share critical structural limitations with LMICs, including resource scarcity, inequitable access, and digital divides. This contextual nuance highlights that findings from Mexico can inform broader Latin American strategies, as many middle-income health systems in the region face similar constraints.
Digital therapeutics represent a promising approach to bridge this gap. Mobile health (mHealth) applications can deliver validated psychological interventions at scale, overcoming barriers of geography, stigma, and limited specialist availability (Naslund et al., 2017). Robust evidence demonstrates that digital CBT reduces anxiety and depressive symptoms across diverse populations (Firth et al., 2017; Linardon et al., 2019). In Latin America, emerging studies have confirmed the feasibility and effectiveness of smartphone-based mental health interventions (Araya et al., 2021; Karyotaki et al., 2023; Kim et al., 2023). Nevertheless, most commercially available mental health apps lack clinical validation, cultural adaptation, and regulatory oversight (Larsen et al., 2016).
Aurora was developed to address these shortcomings. It is a Spanish-language, self-guided mobile application delivering structured CBT through eight interactive modules. Its content—behavioral activation, cognitive restructuring, and mindfulness-based practices—was selected based on evidence linking these elements to improved outcomes in digital CBT (Cuijpers et al., 2022; Furukawa et al., 2025). Development followed a co-creation framework, incorporating clinicians, patients, and usability experts to ensure both therapeutic fidelity and user-centered design. Importantly, engagement has emerged as a consistent mediator of digital CBT outcomes, with module completion predicting greater symptom reduction (Gan et al., 2021). Assessing usability and engagement was therefore a core objective of this feasibility study.
This pilot study aimed to evaluate the feasibility, usability, and preliminary clinical impact of Aurora in a sample of pharmacologically treated patients with anxiety and depression in Mexico. We hypothesized that Aurora would be associated with reductions in self-reported symptoms, demonstrate high usability, and show a graded relationship between engagement and clinical outcomes.
Methods
2
Study design and setting
2.1
We conducted a mixed-methods, pre–post feasibility study of Aurora, a Spanish-language, self-guided digital therapeutic mobile application based on CBT principles. The trial was carried out in Mexico between March and October 2024 and was not powered for definitive efficacy testing. We adhered to the CONSORT extension for pilot and feasibility trials (Eldridge et al., 2016) and simulation-based recommendations for pilot sample sizes (Teare et al., 2014; Whitehead et al., 2016). A sample of 38 completers was sufficient to estimate feasibility and usability parameters, as well as symptom change variance, to inform a future randomized controlled trial (RCT). The study followed an iterative human-centered design process incorporating co-design with expert stakeholders, clinical usability testing, and pre–post evaluation of anxiety and depression symptoms.
Ethical considerations
2.2
The study complied with the Declaration of Helsinki and the ethical standards of the Comisión Nacional de Bioética (CONBIOÉTICA) in Mexico. The protocol (INNOVA 2024-01) received approval from the Comité de Ética en Investigación (Folio CEI-000002) and the Comité de Investigación (Folio CI-000002) of Médica Sur, S.A.B. de C.V. Review covered all procedures, including consent forms, participant interaction materials, and data protection safeguards. Written informed consent was obtained from all participants. As the intervention was a non-invasive digital therapeutic without pharmacological components, no additional physical risk was posed.
Expert advisory and co-creation panels
2.3
A multidisciplinary advisory board, comprising psychiatrists, psychologists, digital health specialists, and UX designers, guided the development of the app. Three activities structured this process: (i) workshops with psychiatrists and psychologists (n = 20) to assess clinical validity and therapeutic coherence; (ii) heuristic and accessibility evaluation by UX experts based on Nielsen’s principles, which identified issues such as excessive cognitive load, unclear navigation, and insufficient accessibility; and (iii) structured focus groups with psychiatric patients (n = 12) to assess emotional needs, use contexts, and perceived value. These activities informed successive redesigns of the interface and content.
Participants and recruitment
2.4
Participants were recruited through referrals from psychiatrists and online outreach across metropolitan areas in Mexico. Inclusion criteria were: age 18–45 years; a psychiatrist-confirmed diagnosis of mild-to-moderate anxiety and/or depression; stable pharmacotherapy for at least 1 month; and access to a smartphone with internet. Exclusion criteria were severe psychiatric comorbidity (e.g., psychosis, active suicidality) or concurrent psychotherapy. Excluding psychotherapy reduced co-intervention bias and reflected prior digital CBT trials (Johansson et al., 2019; Kambeitz-Ilankovic et al., 2022). By contrast, stable pharmacotherapy was permitted to reflect real-world practice. Of the 46 participants who initiated app use, 38 completed both pre- and post-assessments and were included in the analyses. Demographics, treatment history, and engagement were collected via digital questionnaires.
Intervention: Aurora app
2.5
Aurora is a Spanish-language, self-guided digital therapeutic classified as a Software as a Medical Device (SaMD) designed to complement pharmacotherapy for anxiety and depression. Its development followed human-centered design principles, informed by clinical guidelines, heuristic evaluation, and patient co-creation. The app integrates CBT elements with established digital efficacy, including cognitive restructuring, behavioral activation, and mindfulness training, which are prioritized both by the advisory board and by evidence from meta-analyses.
Aurora comprises eight sequentially unlockable modules targeting emotional self-regulation and cognitive restructuring (Figure 1). Modules incorporate audio-guided breathing exercises, mindfulness training, psychoeducation, behavioral activation, and structured activities to promote reframing and emotional awareness (Figure 2). At module completion, users receive personalized reports summarizing activities and symptom trajectories. Activity logs and gamified progression reinforce adherence. The interface was iteratively refined using Nielsen’s heuristics, with accessibility features such as high-contrast design, simplified icons, captions, and text-to-audio conversion. The app is compatible with both Android and iOS devices and requires continuous internet access for synchronization and to receive updated content. All data were encrypted and managed in accordance with the Mexican Federal Law on the Protection of Personal Data Held by Private Parties.
Aurora app architecture and therapeutic features. Schematic overview of Aurora, a Spanish-language, self-guided mobile application delivering cognitive behavioral therapy (CBT) for depression and anxiety. The app includes eight sequentially unlockable modules integrating breathing exercises, mindfulness training, psychoeducation, cognitive restructuring, and behavioral activation. Core functions include: (1) self-assessment tools for daily mood and symptom monitoring, (2) multimedia psychoeducational videos, (3) guided audio relaxation and CBT exercises, (4) structured activity logs and reflections, and (5) progressive unlocking of modules to encourage sustained engagement. Accessibility features include high-contrast icons, text-to-audio options, and captioned multimedia content.
Aurora user interface and digital experience. Representative screenshots of the Aurora app, illustrating core features and user journey. Panels show: (1) the home interface with module progression, (2) breathing and mindfulness exercises, (3) psychoeducational content on cognitive restructuring, (4) self-assessment scales for anxiety, depression, and positive emotions, (5) graphical feedback of scores, (6) access to technical support, and (7) automatically generated personal and medical reports. Multimedia integration, gamified feedback, and encrypted data management support user engagement and adherence.
Outcome measures
2.6
Primary outcomes. Usability was assessed with the Spanish-validated System Usability Scale (SUS) (Sevilla-Gonzalez et al., 2020) and the user Mobile App Rating Scale (uMARS) (Martin-Payo et al., 2021). Qualitative feedback was thematically analyzed to identify barriers, benefits, and design improvements. Engagement was measured via in-app analytics of time, module completion, and frequency of use, and categorized into low (0–2), moderate (3–5), and high (6–7) engagement groups to test dose–response associations (Gan et al., 2021).
Secondary outcomes. Self-reported symptoms of anxiety and depression were measured using the Spanish Goldberg Anxiety and Depression Scale (GADS), comprising two nine-item subscales. Validated versions in Latin America were used (Espinosa et al., 2015; Martín Carbonell et al., 2016; Reivan-Ortiz et al., 2019). Internal consistency was assessed in our sample, with Cronbach’s α reported in the Results. Details of scale items, scoring, and validation are provided in Supplementary Table S1.
Statistical analysis
2.7
Analyses were conducted in Python and SPSS v27. Within-group changes in anxiety and depression were tested using paired two-tailed t-tests, with Cohen’s d and 95% CIs reported. Engagement–outcome associations were examined using Kruskal–Wallis and Jonckheere–Terpstra tests, Spearman’s ρ, and adjusted OLS regressions (controlling for baseline severity, age, sex). Regression models reported β coefficients, SEs, 95% CIs, and p-values. Exploratory subgroup analyses were conducted stratifying participants by age, sex, and baseline severity. Feasibility metrics included recruitment, retention, and usability scores.
Results
3
Participant characteristics
3.1
A total of 46 individuals initiated the intervention, of whom 38 (82.6%) completed both the pre- and post-assessments and were included in the final analysis. Participants were predominantly female (73.7%), aged 18–45 years (mean = 31.2 years; SD = 6.4), with the majority holding a university-level education (78.9%). All participants had been previously diagnosed with anxiety and/or depression by a psychiatrist and were undergoing pharmacological treatment during the study period (mean = 1.4 psychotropic medications; range: 1–3). No participant received concurrent psychotherapy, as this was not in line with the eligibility criteria. Baseline demographic and clinical characteristics are shown in Table 1.
The Goldberg Anxiety and Depression Scale (GADS) demonstrated acceptable internal consistency in our sample (Cronbach’s α = 0.82 for anxiety; 0.80 for depression). The item structure and scoring are summarized in Supplementary Table S1.
Usability and user experience
3.2
Usability testing with a subset of 15 users identified barriers, including excessive text density, limited interaction feedback, and inconsistencies in navigation. Iterative refinements addressed these issues.
Final usability metrics indicated high acceptability, with a mean System Usability Scale score of 78.5/100, exceeding the commonly accepted threshold for “good” usability. On the uMARS, users rated engagement, functionality, esthetics, and information quality above 4.0/5, consistent with validated benchmarks.
Participants reported that the app was easy to use, relevant to their daily lives, and aligned with their therapeutic needs.
Perceived impact
3.3
Thirty-one of 38 participants (81.6%) reported perceivable psychological improvements attributed to Aurora, particularly in emotional self-regulation, anxiety control, and integration of breathing and mindfulness exercises into daily routines. The highest perceived impact was observed among users with high engagement.
Qualitative responses reinforced quantitative findings. Representative themes included: “learning how to manage thoughts more effectively,” “feeling calmer during the day,” and “feeling supported between therapy sessions.” Common barriers included limited offline access and the desire for more personalized options.
Changes in anxiety and depression symptoms
3.4
Use of the Aurora app was associated with statistically significant and clinically meaningful reductions in self-reported symptoms of anxiety and depression (Table 2).
Anxiety symptoms: Mean scores decreased from 13.2 (SD = 5.6) at baseline to 10.6 (SD = 5.1) post-intervention (Δ = −2.6, SD = 3.5), t(37) = 3.81, p = 0.001, Cohen’s d = 0.49, 95% CI [0.22, 0.76]. Figure 3 illustrates pre–post distributions.
Reductions in anxiety symptoms following Aurora use. Distribution of anxiety scores (Goldberg Anxiety subscale, 0–21) at baseline (Pre) and post-intervention (Post). Mean anxiety scores decreased from 13.2 (SD = 5.6) to 10.6 (SD = 5.1), Δ = −2.6 (SD = 3.5), t(37) = 3.81, p = 0.001, Cohen’s d = 0.49 (95% CI [0.22, 0.76]). Boxplots display medians, interquartile ranges, and outliers. Symptom reductions were consistent across age and gender strata.
Depression symptoms: Mean scores decreased from 11.1 (SD = 5.2) to 8.4 (SD = 4.9) (Δ = −2.7, SD = 3.4), t(37) = 4.75, p < 0.001, Cohen’s d = 0.54, 95% CI [0.28, 0.80]. Figure 4 illustrates pre–post distributions.
Reductions in depression symptoms following Aurora use. Distribution of depression scores (Goldberg Depression subscale, 0–21) at baseline and post-intervention. Mean depression scores decreased from 11.1 (SD = 5.2) to 8.4 (SD = 4.9), Δ = −2.7 (SD = 3.4), t(37) = 4.75, p < 0.001, Cohen’s d = 0.54 (95% CI [0.28, 0.80]). Boxplots show distributions with consistent reductions across subgroups.
Symptom improvements were consistent across subgroups stratified by age and gender, suggesting the broad applicability of Aurora across various user profiles.
App engagement profiles
3.5
Based on the number of modules completed, participants were categorized into three engagement groups: low (0–2 modules, n = 9), moderate (3–5 modules, n = 12), and high engagement (6–7 modules, n = 17). The median total time dedicated to in-app activities was 98 min (range, 22–193 min).
Dose–response analysis demonstrated a graded association between engagement and clinical benefit. Compared with the low-engagement group, participants with high engagement reported significantly larger reductions in both anxiety (Kruskal–Wallis H = 7.82, p = 0.02) and depression (H = 9.14, p = 0.01). A Jonckheere–Terpstra test confirmed an ordered trend (p = 0.01 for anxiety; p = 0.008 for depression).
Spearman’s correlations indicated significant associations between modules completed and change scores (ρ = −0.41, p = 0.01 for anxiety; ρ = −0.44, p = 0.007 for depression). In adjusted OLS regression, each additional completed module predicted a mean reduction of 0.38 points in anxiety (β = −0.38, SE = 0.14, 95% CI [−0.66, −0.10], p = 0.009) and 0.41 points in depression (β = −0.41, SE = 0.15, 95% CI [−0.71, −0.12], p = 0.006). These results are detailed in Supplementary Tables S2–S3 and illustrated in Figure 5.
Dose–response association between engagement and symptom change. Scatterplots with regression lines illustrate the relationship between the number of modules completed (x-axis) and changes in anxiety and depression scores (y-axis). Higher engagement predicted greater symptom reduction: anxiety β = −0.38 (SE = 0.14), p = 0.009; depression β = −0.41 (SE = 0.15), p = 0.006. Shaded regions represent 95% confidence intervals. Group means (low = 0–2, moderate = 3–5, high = 6–7 modules) are overlaid for comparison. Results demonstrate a graded dose–response effect.
Discussion
4
This feasibility study provides preliminary evidence that Aurora, a culturally adapted, Spanish-language mobile application delivering CBT, is usable, acceptable, and clinically relevant for patients with anxiety and depression in Mexico. Among participants on stable pharmacotherapy, the app was associated with moderate reductions in self-reported symptoms, high usability ratings, and a precise dose–response between engagement and outcomes. These findings extend the literature on digital therapeutics, which has been predominantly shaped by evidence from high-income countries, to an underrepresented Latin American context.
A central strength of this work is the co-creation methodology, which integrated clinicians, patients, and usability experts from the outset. This process ensured therapeutic fidelity while addressing common shortcomings of commercial mental health apps, including excessive cognitive load and inadequate cultural adaptation. The observed usability scores (mean SUS = 78.5/100; uMARS = 4.2/5) compare favorably with benchmarks reported in digital CBT interventions from high-income countries (Firth et al., 2017; Linardon et al., 2019), underscoring the potential for deployment in Spanish-speaking health systems with resource limitations.
The dose–response association between module completion and symptom reduction strengthens the case for Aurora’s clinical relevance. Participants in the high-engagement group achieved reductions of nearly four points on both anxiety and depression subscales, compared with minimal change in the low-engagement group. Regression analyses confirmed that each additional module completed predicted incremental symptom improvement, independent of baseline severity, age, and sex. This finding is consistent with meta-analytic results indicating that the intensity of use is a key mediator of digital CBT outcomes (Furukawa et al., 2025; Gan et al., 2021). Engagement thus emerges as a modifiable determinant of clinical benefit, with implications for future app design and implementation strategies.
This study also contributes to the debate on the external validity of digital therapeutics in UMICs. Although Mexico is classified as a UMIC, its mental health system continues to share critical constraints with LMICs, including workforce shortages, stigma, fragmented care, and high out-of-pocket costs. By situating our findings within this broader context, we show how scalable digital solutions may help bridge structural gaps across Latin America. Evidence from pragmatic trials in Chile, Brazil, and other LMICs has similarly demonstrated the feasibility and acceptability of mobile CBT interventions in under-resourced environments (Araya et al., 2021; Karyotaki et al., 2023; Kim et al., 2023).
Several limitations should temper interpretation. First, the absence of a control group precludes causal inference; symptom reductions may partly reflect regression to the mean, expectancy effects, or concurrent pharmacotherapy. Second, although all participants were receiving medication (mean, 1.4 drugs; range, 1–3), outcomes were not stratified by pharmacological class. Exploratory sensitivity analyses suggest that engagement–outcome associations were robust to medication count, but larger studies are needed to evaluate interactions between pharmacotherapy and digital CBT. Third, data on illness chronicity were not collected. Chronicity may moderate treatment response, and future trials should include this variable (Cuijpers et al., 2022; Kambeitz-Ilankovic et al., 2022; Haller et al., 2023; Sextl-Plotz et al., 2024). Fourth, although the Spanish GADS has been validated in Mexico and other Latin American populations (Espinosa et al., 2015; Martín Carbonell et al., 2016; Reivan-Ortiz et al., 2019), the minor linguistic adaptations applied here were not formally re-validated. We confirmed good internal consistency in this sample; however, psychometric validation in Mexican populations remains necessary. Finally, participants were predominantly urban and highly educated, which may limit the generalizability of the findings to rural or underserved populations.
Despite these limitations, the findings carry important implications. First, the results demonstrate that a self-guided CBT app can deliver clinically meaningful benefits as an adjunct to pharmacotherapy, supporting its potential role in integrated care. Second, the graded dose–response highlights the importance of engagement, suggesting that strategies such as gamification, reminders, or integration with telepsychiatry platforms could enhance effectiveness. Third, the co-creation framework—balancing clinical fidelity with user-centered design—offers a scalable model for adapting digital therapeutics to diverse cultural contexts.
Future research should prioritize adequately powered, randomized controlled trials with active comparators, longer follow-up periods, and clinician-rated outcomes. A collection of chronicity, medication regimens, and sociodemographic moderators will help identify who benefits most. Broader linguistic and cultural adaptation will also be essential before Aurora can be scaled across Latin America, but its modular architecture facilitates such expansion.
In conclusion, Aurora represents a feasible, usable, and culturally tailored digital therapeutic for anxiety and depression in Mexico. By combining evidence-based CBT with rigorous co-design and usability testing, it offers a scalable strategy to reduce treatment gaps in Spanish-speaking middle-income settings. If validated in larger controlled trials, Aurora and similar interventions could contribute to addressing one of the most urgent challenges in global mental health: the persistent inequity in access to effective treatment for common psychiatric disorders.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Araya R.Menezes P. R.Claro H. G.Brandt L. R.Daley K. L.Quayle J.. (2021). Effect of a digital intervention on depressive symptoms in patients with comorbid hypertension or diabetes in Brazil and Peru: two randomized clinical trials. JAMA 325, 1852–1862. doi: 10.1001/jama.2021.4348, PMID: 33974019 PMC 8114139 · doi ↗ · pubmed ↗
- 2Berenzon Gorn S.Saavedra Solano N.Medina-Mora Icaza M. E.Aparicio Basauri V.Galvan Reyes J. (2013). Evaluation of the mental health system in Mexico: where is it headed? Rev. Panam. Salud Publica 33, 252–258. doi: 10.1590/S 1020-49892013000400003, PMID: 23698173 · doi ↗ · pubmed ↗
- 3Cuijpers P.Ciharova M.Quero S.Miguel C.Driessen E.Harrer M.. (2022). The contribution of "individual participant data" meta-analyses of psychotherapies for depression to the development of personalized treatments: a systematic review. J. Pers. Med. 12:93. doi: 10.3390/jpm 12010093, PMID: 35055408 PMC 8781368 · doi ↗ · pubmed ↗
- 4Eldridge S. M.Chan C. L.Campbell M. J.Bond C. M.Hopewell S.Thabane L.. (2016). CONSORT 2010 statement: extension to randomised pilot and feasibility trials. BMJ 355:i 5239. doi: 10.1136/bmj.i 5239, PMID: 27777223 PMC 5076380 · doi ↗ · pubmed ↗
- 5Espinosa M. C.Orozco L. A.Ybarra J. L. (2015). Síntomas de ansiedad, depresión y factores psicosociales en hombres que solicitan atención de salud en el primer nivel. Salud Mental 38, 201–208. doi: 10.17711/SM.0185-3325.2015.028 · doi ↗
- 6Firth J.Torous J.Nicholas J.Carney R.Pratap A.Rosenbaum S.. (2017). The efficacy of smartphone-based mental health interventions for depressive symptoms: a meta-analysis of randomized controlled trials. World Psychiatry 16, 287–298. doi: 10.1002/wps.20472, PMID: 28941113 PMC 5608852 · doi ↗ · pubmed ↗
- 7Furukawa T. A.Noma H.Tajika A.Toyomoto R.Sakata M.Luo Y.. (2025). Personalised & optimised therapy (POT) algorithm using five cognitive and behavioural skills for subthreshold depression. NPJ Digit. Med. 8:531. doi: 10.1038/s 41746-025-01906-6, PMID: 40835713 PMC 12368086 · doi ↗ · pubmed ↗
- 8Gan D. Z. Q.Mc Gillivray L.Han J.Christensen H.Torok M. (2021). Effect of engagement with digital interventions on mental health outcomes: a systematic review and meta-analysis. Front. Digit. Health 3:764079. doi: 10.3389/fdgth.2021.764079, PMID: 34806079 PMC 8599127 · doi ↗ · pubmed ↗
