Validity of an AI-Assisted Dietary Recording Application for Family-Based Nutritional Management in Young Patients with Anorexia Nervosa
Nao Shiraishi, Rieko Kawase, Haruka Ogawa, Tatsuo Akechi

TL;DR
This study tested an AI app for tracking food intake in young patients with anorexia nervosa and found it to be reasonably accurate when used by parents.
Contribution
The study provides evidence for the validity of an AI-assisted dietary recording app in family-based nutritional management of anorexia nervosa.
Findings
The app's estimated energy intake did not significantly differ from the reference method.
Moderate to high correlations were observed for energy and major nutrient intakes.
Excluding outliers improved agreement and correlation for energy intake.
Abstract
Background: Evidence regarding the validity of digital dietary recordings in the family-based nutritional management of anorexia nervosa (AN) remains limited. This study evaluated the validity of an AI-assisted dietary recording application (app) used by the parents of young patients with AN, with total energy intake as the primary outcome and major nutrient intake as the secondary outcome. Methods: During hospital leave prior to discharge, one day of dietary intake was recorded by parents using the app based on meal photographs. The energy and nutrient intakes estimated by the app were compared with a registered dietitian using visual estimation of the photographs. Differences were examined using the Wilcoxon signed-rank test, associations were assessed using Spearman’s correlation coefficients, and agreement was evaluated using Bland–Altman analysis. Sensitivity analyses excluding…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5- —a Grant-in-Aid for Scientific Research from the Japanese Ministry of Education, Science, and Technology
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEating Disorders and Behaviors · Digital Mental Health Interventions · Mobile Health and mHealth Applications
1. Introduction
Anorexia nervosa (AN) is a severe and potentially life-threatening eating disorder primarily affecting adolescents and young adults, particularly females. With an estimated lifetime prevalence of 0.8–6.3% in women [1], its incidence peaks during adolescence, as approximately 40% of cases occur in 15- to 19-year-old females [2]. In recent decades, the incidence and prevalence of eating disorders, including AN, have steadily increased worldwide, highlighting a growing public health concern [3,4]. AN often leads to extreme malnutrition and medical complications requiring intensive treatment, and has one of the highest mortality rates among mental disorders, with approximately 5% of patients dying from organ failure or suicide [5]. These epidemiological realities highlight the need for enhanced therapeutic strategies for AN, especially those supported by digital health technologies that facilitate nutritional rehabilitation for the affected youth.
Digital health interventions have been explored for individuals with AN, including smartphone applications (apps) and Internet-based programs aimed at supporting weight restoration and preventing relapses. A pilot trial of a smartphone aftercare app reported high patient adherence and acceptability but no significant improvements in body mass index (BMI) and eating disorder symptoms [6]. Internet-based relapse prevention demonstrated no significant increase in weight gain and a significant decrease in eating disorder symptoms among patients discharged from hospital for AN [7]. However, patient-focused digital approaches may be limited by their reliance on self-monitoring. This reliance is problematic given that patients with AN are prone to misjudging portion sizes and overreporting their food intake [8]. Moreover, existing digital interventions have focused almost exclusively on the patients themselves, with minimal integration with family management. Family members, particularly parents, play a pivotal role in the nutritional restoration of adolescents with AN. Family-Based Treatment (FBT) is currently considered the most effective evidence-based treatment for adolescent AN [9,10]. In FBT, parents are expected to take primary responsibility for nutritional restoration because the patient’s eating behavior is strongly influenced by illness. However, little is known about the extent to which family caregivers use digital health technologies to assist in their children’s nutritional restoration.
The present study was designed to address this gap by leveraging a dietary recording app for family-based nutritional management of AN. We selected the Asken app, a widely used Japanese AI-assisted dietary recording tool, because of its demonstrated validity in estimating nutrient intake compared to traditional weighed dietary records [11]. In evaluations of dietary recording apps, the Asken app also showed strong concordance with paper-based dietary records used as the reference method, with a median Spearman correlation coefficient of 0.80 for nutrient intake [12]. Notably, recent clinical trials have incorporated the Asken app into nutritional interventions for other medical conditions, including dietary therapy for type 2 diabetes [13] and LDL-cholesterol management for familial hypercholesterolaemia [14]. Accordingly, this study aimed to evaluate the validity of the Asken application as an AI-assisted dietary recording tool in the therapeutic context of AN. Given that energy intake constitutes a central component of nutritional management in AN, the primary objective was to examine the validity of the app’s estimated total energy intake compared to the reference method assessed by a registered dietitian. As a secondary objective, we further assessed the validity of major macronutrient intake (i.e., protein, fat, and carbohydrates). By focusing on these nutritional parameters, this study explored whether an AI-assisted dietary recording application could provide sufficiently accurate intake estimates to support family-based nutritional management in young patients with AN.
2. Materials and Methods
2.1. Participants
The study participants were patients with AN admitted to the psychiatric ward of Nagoya City University Hospital and their parents. The purpose of admission was to undergo nutritional therapy to restore the severely low body weight. The inclusion criteria were inpatients who (1) aged between 10 and 24 years; (2) diagnosed with AN according to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition [15]; (3) BMI of less than 17; (4) had pre-admission meals provided by their participating parents. This study excluded inpatients who were (1) male, (2) pregnant, or (3) requiring nutritional therapy for conditions other than AN. Male patients were excluded to maintain a more homogeneous study sample as sex-related differences in body composition and nutritional requirements introduce additional variability that might affect the validation of dietary intake estimates.
Eligible participants were recruited once inpatients had reached a BMI of 15.5, prior to consuming meals during hospital leave in preparation for discharge. The BMI threshold was set with reference to the manual for the physical management of AN [16], which defines a BMI of 16.0 as the minimum weight required for safe outpatient treatment. After receiving a thorough explanation of the research purpose and methods, written informed consent was obtained from all the participants. The study was approved by the Institutional Review Boards of Nagoya City University Graduate School of Medical Sciences and Nagoya City University Hospital (protocol code: 60-20-0060) and was conducted in accordance with the principles of the Declaration of Helsinki.
2.2. Dietary Record
During hospital leave, the patients’ parents were asked to record their dietary intake using the Asken app installed on an iPad provided by the hospital for research purposes. Before lending the iPad, parents received a brief 20 min orientation on app use, as outlined below.
Parents were instructed to prepare meals that were typical for their household, while also reflecting on what they considered necessary to support weight restoration after discharge. When they prepared a meal (breakfast, lunch, or dinner), each meal was photographed with the iPad before eating, ensuring that the patient did not participate in meal preparation. Parents were also instructed to photograph only a single serving and place a ruler alongside the dishes, allowing portion sizes to be accurately estimated using the photographs as a reference [17].The photographs were analyzed using the AI assessment system embedded in the app. The system automatically displayed the candidate food items for each dish (Figure 1). If the suggested items were correct, the parents confirmed them. If an item was incorrect, they selected the correct item from among the options displayed. When no appropriate option was available, the participants searched for foods by entering their names into the search box. The Asken app is supported by a comprehensive food database containing more than 150,000 menu items. Parents could search for and confirm foods using commercial products, common menus, or individual ingredient names.Upon returning from hospital leave, parents returned the iPad to the ward staff. The analysis results obtained from the app were then used to collect data on energy intake (kcal/day) and the following 13 nutrient intakes: protein (g/day), fat (g/day), saturated fatty acids (SFAs, g/day), carbohydrates (g/day), dietary fiber (g/day), calcium (mg/day), iron (mg/day), vitamin A (µg/day), vitamin E (mg/day), vitamin B1 (mg/day), vitamin B2 (mg/day), vitamin C (mg/day), and sodium (g/day).
2.3. Reference Method
A registered dietitian served as an independent evaluator. She had 13 years of professional experience as a registered dietitian, including 9 years of experience in the nutritional rehabilitation of patients with eating disorders, mainly AN. The dietitian was blinded to the results obtained from the app. She received encrypted meal photographs and analyzed them using a secure computer within the university hospital network. She estimated energy and 13 nutrient intakes through a visual assessment of food types and portion sizes using a ruler alongside dishes in the photographs and referencing the Standard Tables of Food Composition in Japan (2020 edition) [18].
2.4. Statistical Analysis
The required sample size was determined based on correlation analysis to evaluate the similarity in total energy intake between the app and the dietitian-assessed reference method. Referring to the study by Matsuzaki et al. [11], we assumed a correlation coefficient of 0.6, with a two-sided α of 0.05 and a power of 0.90. This calculation indicated that 25 participants were required, accounting for an anticipated 20% of the missing data. The final target sample size was set to 30. Missing data were to be excluded from subsequent analyses.
Descriptive statistics are presented as the median and interquartile range (IQR) for energy and nutrient intake estimated by the app and the reference method. The differences between the two methods were evaluated using the Wilcoxon signed-rank test (two-sided, α = 0.05). The association between the two methods was then quantified through Spearman’s rank correlation coefficient (ρ), with 95% confidence intervals (95% CIs) estimated by nonparametric bootstrap resampling (10,000 replicates). In addition to unadjusted correlations, energy-adjusted correlations were quantified using the residual (energy-adjustment) method: residuals from a linear regression of each variable on total energy intake were obtained for both methods, and Spearman’s ρ values with 95% CIs were estimated between the corresponding residuals [19]. The strength of Spearman’s ρ was interpreted as follows: values of 0–0.29 indicated negligible correlation, 0.30–0.49 low correlation, 0.50–0.69 moderate correlation, 0.70–0.89 high correlation, and 0.90–1.00 very high correlation [20]. The agreement between the two methods was further evaluated using Bland–Altman analysis [21]. For each variable, the mean difference (bias) between the two methods and the 95% limits of agreement (LOAs; mean difference ± 1.96 × standard deviation of the differences) were calculated. Bland–Altman plots were created to visualize the agreement and identify any systematic bias or trends across the range of measurements.
Sensitivity analysis was conducted in cases where participants showed large discrepancies in energy intake between the app and the reference method. For each participant, the difference in the total energy intake between the two methods was calculated and illustrated using a bar chart. A threshold of ±1000 kcal was applied to highlight cases with notably higher or lower estimates from the application compared with the reference method. Based on the manual for the physical treatment of AN [16], this threshold corresponds to the additional energy intake required to achieve a weight gain of approximately 1 kg/week. One thousand kcal is regarded as the upper limit of the acceptable physical burden per day during AN treatment. After excluding participants whose energy intake differences exceeded this threshold, the same statistical procedures as described above—namely, the Wilcoxon signed-rank test, Spearman’s rank correlation, and Bland–Altman analysis—were repeated to examine whether the exclusion of outliers affected the overall agreement between the two methods.
All statistical analyses were performed using Python (version 3.12.11; Python Software Foundation, Wilmington, DE, USA) on Google Colaboratory (Google LLC, Mountain View, CA, USA). A large language model (ChatGPT, version 5.2; OpenAI) was used for English language editing of the manuscript, including improvements to grammar, wording, and clarity.
3. Results
3.1. Participant Characteristics
A total of 31 female patients with AN and their mothers were enrolled between September 2020 and July 2025. No fathers were enrolled during the study period. One parent–child pair was excluded from the analyses because the mother photographed meals served to the entire family. All 30 participants provided complete dietary records. The sociodemographic and clinical characteristics are summarized in Table 1. All patients were students aged 10–22 years, with a mean illness duration of 12.3 months. The restrictive subtype accounted for the majority of the patients (86.7%). During the current admission, the mean BMI increased from 12.7 at admission to 16.4 at the time of dietary recording over an average hospital stay of 74.4 days. All involuntarily admitted patients (40%) were converted to voluntary admissions prior to dietary recording. The mothers were aged 38–56 years. Regarding education, 43.3% and 36.7% had completed a two-year college and university education, respectively. One-third of the mothers were regularly employed, whereas an additional one-third reported irregular employment. All mothers were married, and 93.3% had one to three children. Only 10% of the mothers received treatment for mental health problems.
3.2. Differences and Associations Between the App and the Reference Method
The energy and major nutrient intakes estimated by the app were generally comparable to those obtained by the reference method (Table 2). Total energy intake did not differ significantly between the two methods (median: 2462 vs. 2439 kcal/day; p = 0.903). No significant differences were observed in the protein or carbohydrate intake between the two methods (p = 0.452 and p = 0.061, respectively). In contrast, the app yielded significantly higher values for 11 nutrients: fat, saturated fatty acids, dietary fiber, calcium, iron, vitamins A, E, B1, B2, C, and sodium (all p < 0.05).
In the unadjusted analyses, Spearman’s correlation coefficients (ρ) indicated moderate to high associations between the app and the reference method across most nutrients (ρ = 0.56–0.79). After adjustment for energy intake, the associations remained moderate to high for 10 of the 13 nutrients (ρ = 0.50–0.72), except for SFAs and vitamins A and E.
3.3. Agreement Between the App and the Reference Method
Bland–Altman analysis was conducted to assess the agreement between the app and the reference method for energy and nutrient intake. As shown in Figure 2, the mean bias between the two methods was generally small for total energy, protein, fat, and carbohydrate intakes, with most of the data points falling within the 95% limits of agreement (LoAs). No systematic trends were evident across the range of measurements, indicating overall good agreement between the two methods for energy and major nutrients. For the remaining nutrients, SFAs, dietary fiber, calcium, iron, vitamins A, E, B1, B2, C, and sodium, the Bland–Altman plots revealed no apparent systematic bias, with most observations falling within the 95% LoAs (Supplementary Figure S1).
However, the 95% LoA was relatively wide for several nutrients—particularly for total energy (−822.03 to 1007.47 kcal/day)—suggesting the presence of considerable interindividual variability among participants. A few participants showed marked deviations from the mean bias, which may have affected overall agreement.
3.4. Individual Differences in Energy Intake
Figure 3 illustrates the individual differences in the total energy intake between the two methods. Two participants showed differences exceeding 1000 kcal/day, suggesting a potential overestimation by the app compared with the reference method. The following cases describe the main misclassifications of dish type and portion size for participants ID12 and ID23. In the former, lunch included a discrepancy, in which a plain rolled omelet was recorded in the app as an omelet rice dish, resulting in an overestimation of 522 kcal. For dinner, although “ramen noodles” was correctly selected, the portion size was entered in the app as a full serving instead of the actual smaller amount, leading to an additional overestimation of 261 kcal. Overall, the portion sizes entered into the app did not match those shown in the photographs. In the latter case, as a dinner dish, the app selected a type of meat with a higher estimated fat content than the reference method, resulting in an overestimation of 953 kcal. For lunch, “tenshinhan,” a Chinese-style rice dish with egg and sauce, was recorded in the app as a restaurant-style portion that contained larger amounts of egg and oil, producing a further overestimation of 759 kcal.
3.5. Sensitivity Analysis
As shown in Table 3, the exclusion of participants ID12 and ID23 improved the overall consistency between the app and the reference methods. The median energy intake estimated using the app remained comparable to that estimated using the reference method (2445 vs. 2439 kcal/day; p = 0.614), which is in line with the findings of the primary analysis. Moreover, the correlations between the two methods became stronger for most nutrients, with unadjusted Spearman’s ρ ranging from 0.59 to 0.84 and energy-adjusted ρ ranging from 0.53 to 0.71. In particular, for total energy—a central parameter in the treatment of AN—Spearman’s correlation coefficient increased from 0.62 (95% CI: 0.32–0.82) to 0.74 (95% CI: 0.49–0.89), indicating stronger consistency in ranking between the two methods. Correspondingly, Bland–Altman analysis revealed that the 95% LoA narrowed, with the lower limit improving from −822.03 kcal to −416.79 kcal and the upper limit from 1007.47 kcal to 379.71 kcal. Figure 4 and Supplementary Figure S2 show no evident systematic trend across the range of measurements, thereby confirming a stable agreement between the two methods after the exclusion of outliers.
4. Discussion
4.1. Summary of Main Findings
This study evaluated the validity of an AI-assisted dietary recording application in comparison with dietitian-assessed visual estimation in young inpatients with AN receiving nutritional therapy. The total energy intake estimated by the app did not significantly differ from that obtained by the reference method (median: 2462 vs. 2439 kcal/day) and showed a moderate correlation (ρ = 0.62). Energy-adjusted analyses also confirmed moderate-to-high correlations for 10 of the 13 nutrients. Although the app tended to overestimate energy and nutrient intakes, sensitivity analyses excluding outliers strengthened the validity of results, with an improved correlation for total energy intake (ρ = 0.74) and major nutrients. Consistently, Bland–Altman analysis demonstrated a narrower LoA for energy and major nutrients after the exclusion of outliers, indicating an enhanced agreement between the two methods. These findings suggest that the app can estimate the total energy and major nutrient intake with an accuracy comparable to that of a trained dietician.
4.2. Interpretation of Study Findings
Although evidence regarding the accuracy of parent-assessed dietary records for children with AN is limited, an overestimation trend has been documented in dietary self-reports among individuals with the illness. Previous studies [22,23] indicated that patients with AN tend to overestimate their energy intake. Hadigan et al. [22] reported that women with AN (n = 30) had a higher caloric intake in their diet history than that measured during the observation period (mean: 1602 vs. 1289 kcal/day; p < 0.05). Similarly, Schebendach et al. [23] found that patients with weight-restored AN (n = 12) overestimated their energy intake by approximately 16% during a controlled meal challenge. Despite the limited generalizability of these findings owing to small sample sizes and laboratory-based observations, overestimation in self-reports could be attributable to the clinical characteristics of individuals with AN, such as body image distortion, perfectionism, cognitive rigidity [24], and impaired interoceptive awareness [25]. These characteristics are known to bias patients’ perceptions of portion size, caloric content, and internal satiety cues [8]. Participating mothers were instructed to ensure that their daughters did not interfere with the patients’ meal preparation. Nevertheless, mother–daughter interactions may have allowed the patients’ biased perceptions to influence the estimation process, possibly contributing to the app’s overestimation of energy intake.
The overestimation of energy intake might partly reflect parents’ limited experience with tablet-based data entry or their low confidence in using the app. Moreover, family dynamics presumably played a role in the observed overestimation of energy intake, particularly through parental behavior. One hypothesis is that a subset of parents is engaged in accommodating eating disorders by modifying their behavior to reduce their child’s distress [26]. Anderson et al. [27] reported that family caregivers’ anxiety was associated with accommodating behaviors (e.g., bending rules to avoid conflict) in patients with eating disorders. In our study, such accommodating behaviors could partly explain the overreporting of food intake out of sympathy, particularly with the intention of hastening hospital discharge. Another hypothesis is that careless dietary logging results from parental indifference toward their child’s care. Caregiver burdens and burnout can lead to emotional disengagement as a coping mechanism. Parks et al. [28] found that avoidance-based coping styles were more likely to be adopted by mothers who described their caregiving experiences negatively. In our study, some parents may have entered the data inaccurately or without vigilance, potentially contributing to an unintentional overestimation of food intake.
4.3. Strengths and Limitations
The present study had several limitations. Firstly, the generalizability of our findings may be limited because the sample consisted exclusively of inpatients with AN undergoing weight restoration at a university hospital. Therefore, our findings may not extend to males, outpatients, patients treated in other clinical settings, or individuals at different stages of illness. Nevertheless, the clinical homogeneity of this sample allowed for a more controlled comparison between the app-based estimates and reference method. Secondly, the reference method relies on the visual estimation of meal photographs by an experienced dietitian. Although this approach has been widely used in dietary assessments [17], it is not the gold standard and is inherently subject to measurement errors, particularly in the estimation of portion size and food content. Such errors may be influenced by photographic conditions, such as angle, distance, and lighting. Therefore, the differences observed between the app and reference methods do not provide definitive evidence of systematic overestimation or underestimation by either method. Thirdly, the estimation of micronutrient intake from meal photographs was uncertain. Visual assessment cannot reliably capture vitamin and mineral contents without weight records or chemical composition data, even when performed by an experienced dietitian. Therefore, the micronutrient results should be considered exploratory. Fourthly, inter-rater reliability was not assessed as the reference evaluation was conducted by an experienced dietitian, which may limit the generalizability of the reference measurements. Fifthly, dietary intake was assessed for only a single day, which did not account for habitual intake or day-to-day variability within individuals [29]. This may limit the generalizability of intake values and their applicability to long-term dietary assessments. However, this study aimed to examine the agreement between the two assessment methods rather than to estimate the usual intake. If both methods assessed the same meals under comparable conditions, a single-day record was considered appropriate. Finally, the app occasionally misclassified dish types or portion sizes, leading to a substantial overestimation for a small number of participants. Nevertheless, sensitivity analysis that excluded these outliers demonstrated that the main findings remained robust.
5. Conclusions
This study provides preliminary evidence that AI-based dietary assessment can serve as an alternative to expert evaluation for the clinical management of young patients with AN. The overall level of agreement suggests that the app has potential clinical utility when used with appropriate attention to portion size inputs and dish selection. Further validation studies in broader clinical settings as well as refinement of the assessment system are needed.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Silén Y. Keski-Rahkonen A. Worldwide prevalence of DSM-5 eating disorders among young people Curr. Opin. Psychiatry 20223536237110.1097/YCO.000000000000081836125216 · doi ↗ · pubmed ↗
- 2Charrat J.P. Massoubre C. Germain N. Gay A. Galusca B. Systematic review of prospective studies assessing risk factors to predict anorexia nervosa onset J. Eat. Disord.20231116310.1186/s 40337-023-00882-037730675 PMC 10510169 · doi ↗ · pubmed ↗
- 3Galmiche M. Déchelotte P. Lambert G. Tavolacci M.P. Prevalence of eating disorders over the 2000–2018 period: A systematic literature review Am. J. Clin. Nutr.20191091402141310.1093/ajcn/nqy 34231051507 · doi ↗ · pubmed ↗
- 4GüzelÂ. Mutlu N.L. Molendijk M. COVID-19-related changes in eating disorder pathology, emotional and binge eating and need for care: A systematic review with frequentist and Bayesian meta-analyses Eat. Weight Disord.-Stud. Anorex. Bulim. Obes.2023281910.1007/s 40519-023-01547-2PMC 994124236805344 · doi ↗ · pubmed ↗
- 5Arcelus J. Mitchell A.J. Wales J. Nielsen S. Mortality rates in patients with anorexia nervosa and other eating disorders. A meta-analysis of 36 studies Arch. Gen. Psychiatry 20116872473110.1001/archgenpsychiatry.2011.7421727255 · doi ↗ · pubmed ↗
- 6Neumayr C. Voderholzer U. Tregarthen J. Schlegl S. Improving aftercare with technology for anorexia nervosa after intensive inpatient treatment: A pilot randomized controlled trial with a therapist-guided smartphone app Int. J. Eat. Disord.2019521191120110.1002/eat.2315231429974 · doi ↗ · pubmed ↗
- 7Fichter M.M. Quadflieg N. Nisslmüller K. Lindner S. Osen B. Huber T. Wünsch-Leiteritz W. Does internet-based prevention reduce the risk of relapse for anorexia nervosa?Behav. Res. Ther.20125018019010.1016/j.brat.2011.12.00322317754 · doi ↗ · pubmed ↗
- 8Milos G. Kuenzli C. Soelch C.M. Schumacher S. Moergeli H. Mueller-Pfeiffer C. How much should I eat? Estimation of meal portions in anorexia nervosa Appetite 201363424710.1016/j.appet.2012.12.01623276722 · doi ↗ · pubmed ↗
