Impact of lifestyle behaviors on the development of lifestyle diseases: A retrospective cohort study
Takafumi Okawa, Hikaru Negishi, Yuki Aoki, Mitsuo Uchida, Yumi Sato, Mai Ishikawa, Rie Matsui, Kaori Hotta, Takayuki Saitoh, Mulu Tiruneh, Mulu Tiruneh, Mulu Tiruneh, Mulu Tiruneh

TL;DR
This study shows that lifestyle behaviors like weight gain and smoking are strongly linked to the development of lifestyle diseases such as metabolic syndrome and hypertension.
Contribution
The study identifies specific lifestyle behaviors with the strongest longitudinal associations to lifestyle diseases using a large cohort.
Findings
Weight gain of ≥10 kg since age 20 showed the strongest association with metabolic syndrome (OR: 2.01).
Smoking and weight gain were common risk factors for metabolic syndrome, hypertension, and dyslipidemia.
Lifestyle behaviors assessed via questionnaires can predict future health risks effectively.
Abstract
Questionnaires are used to collect data on lifestyle behaviors during specific health checkups; however, the results cannot conclusively determine whether the behaviors influence the onset of lifestyle diseases. By analyzing data from a retrospective cohort, this study aimed to determine the specific lifestyle behaviors that most strongly contribute to the onset of lifestyle diseases, such as metabolic syndrome, hypertension, diabetes, and dyslipidemia. We administrated the data of 924,932 individuals insured under Gunma Prefecture’s National Health Insurance who underwent specific health checkups between 2011 and 2016. A logistic regression analysis was conducted to assess the association between the responses to 10 lifestyle questions and the future onset of lifestyle diseases. We examined 47,803 individuals who were not identified with lifestyle disorders at the initial checkup. In…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig 1- —http://dx.doi.org/10.13039/501100002241Japan Science and Technology Agency
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDietary Effects on Health · Nutritional Studies and Diet · Health Promotion and Cardiovascular Prevention
Introduction
Aging has recently become a prevalent issue in developed countries. Japan, particularly, is one of the leading super-aged societies worldwide. The increase in non-communicable diseases (NCDs) with the aging population has led to a rise in national healthcare expenditures, making cost containment a critical public health priority [1–3]. Specific health checkups (SHC) have emerged as initiatives aimed at curbing the rising national healthcare costs in Japan’s aging society [4].
The SHC program provides health screening and promotion services tailored to insured individuals aged 40–74 years and their dependents. Its primary goal is to prevent bedridden conditions in older adults through the early detection and intervention of metabolic syndrome (MetS). The SHC is a unique initiative in Japan, established by the Ministry of Health, Labour and Welfare (MHLW) in 2008 under the “Act on Assurance of Medical Care for Elderly People” [4]. Currently, approximately 25.4 million beneficiaries are covered by the municipal national health insurance, representing a significant effort to promote the health of older adults at the municipal level [5]. Notably, the MHLW’s “Guide to Creating a Data Health Plan, 3rd Revised Edition” emphasizes that utilizing SHC data can facilitate early detection of lifestyle diseases, thereby contributing to the reduction of national medical care expenditures [6]. To date, health promotion initiatives, utilizing the SHC data, are currently being implemented in Shizuoka and Osaka Prefectures [7,8].
The SHC utilizes the use of anthropometric measurements, laboratory values, and self-administered questionnaires designed to assess lifestyle behaviors. These questionnaires conform to the “standard questionnaire” established by the MHLW [9]. However, the evidence derived from the questionnaire results was not consistent in predicting the onset of lifestyle diseases [10–14]. For example, some studies highlight a correlation between skipping breakfast and the onset of diabetes [15,16], whereas others refute this connection [17]. This inconsistency has resulted in the inadequate utilization of data for formulating health management policies.
The discrepancy in conclusions stems from several factors, such as the inclusion of individuals with positive results, which could affect the accurate assessment of the effects of lifestyle behaviors [18], inadequate long-term evaluations [17], and insufficient consideration of the interaction effects of lifestyle behaviors on the onset of lifestyle diseases. Addressing these issues through the use of adequate sample size and the conduct of a longitudinal analysis could lead to more consistent conclusions regarding this relationship.
Using SHC data collected over 10 years from the National Health Insurance (NHI) database, our retrospective cohort study aimed to investigate lifestyle behaviors that are genuinely associated with the onset of lifestyle diseases such as MetS and its components such as hypertension, diabetes, and dyslipidemia.
Materials and methods
Database
We used SHC data from the Gunma Prefecture’s NHI database, covering a 10-year period from 2011 to 2020. The Gunma Prefecture has a population of approximately 2 million, with about 20% of individuals’ insurance data updated annually in the NHI database.
Study population
The selection process for the study population is outlined in Fig 1. The study included 924,932 total counts of individuals insured under Gunma Prefecture’s NHI who underwent SHC between 2011 and 2016 (150,610 in 2011, 154,579 in 2012, 156,454 in 2013, 156,531 in 2014, 155,690 in 2015, and 151,068 in 2016). The onset of lifestyle diseases was assessed 4 years after the initial checkup, based on previous research [11]. To ensure the suitability of the study samples, individuals with missing health checkup data 4 years post-initial checkup (N = 447,511); whose presence or absence of lifestyle diseases were not determined (N = 3,470); who were identified with MetS, diabetes, hypertension, or dyslipidemia at the initial checkup (N = 363,453); who reported a history of stroke, heart disease, or chronic kidney failure in the self-administered questionnaire during the initial health checkup (N = 3,946); and who underwent multiple checkups during the study period (N = 58,749) were excluded. Hence, 47,803 participants were included in the final analysis.
Study population.The selection process for the study population. Abbreviations: NHI, National Health Insurance; SHC, Specific health checkups; MetS, metabolic syndrome.
Variables
The following data were extracted from the SHC records: personal information (sex, date of birth, and date of SHC); anthropometric measurements (body mass index [BMI] and waist circumference [WC]); laboratory values (systolic blood pressure [SBP], diastolic blood pressure [DBP], hemoglobin A1c [HbA1c], triglyceride [TG], high-density lipoprotein cholesterol [HDL-C], low-density lipoprotein cholesterol [LDL-C], aspartate aminotransferase [AST], alanine aminotransferase [ALT], and γ-glutamyl transpeptidase [γ-GTP] levels). The results of self-administered questionnaire survey on lifestyle behaviors (smoking status, weight change, exercise habits, eating habits, drinking habits, and sleep habits), medications used (for hypertension, dyslipidemia, and diabetes), and disease history (stroke, heart disease, and kidney disease) were also extracted. Age, BMI, and laboratory values were categorized according to the medical guidelines, as detailed in S1 Table. Data on lifestyle behaviors at the initial checkup were extracted from the results of the “standard questionnaire” provided by the MHLW. From this questionnaire, 11 questions that have consistently provided valid responses over the years were selected (as shown in S2 Table). The presence or absence of lifestyle behavior risks is also presented in S2 Table. With regard to drinking habits, the presence or absence of risk was determined based on the frequency of drinking (Question 9) and amount of alcohol consumed (Question 10), following the criteria used in previous studies [19]. Individuals who reported drinking “every day” or “sometimes” and consuming “180 mL or more” of alcohol were classified as having a risk. Conversely, those who drank “hardly ever” or consumed “less than 180 mL” were classified as not having a risk. A serving of 180 mL of refined saké (rice wine) is equivalent to a medium bottle of beer (500 mL), 110 mL of shochu (25% alcohol content), a double shot of whiskey (60 mL), or two glasses of wine (240 mL).
In this study, we selected MetS and its components—hypertension, diabetes, and dyslipidemia—to investigate their association with lifestyle behaviors. MetS was defined according to the criteria of the National Health and Nutrition Survey (NHNS) [20], which requires the presence of visceral fat accumulation (WC ≥ 85 cm in men and ≥ 90 cm in women) and two or more of the following criteria: 1) an SBP > 130 mmHg, DBP of >85 mmHg, or the use of antihypertensive drugs; 2) an HDL-C level <40 mg/dL or the use of antihyperlipidemic drugs; and 3) an HbA1c level ≥6.0% or the use of antidiabetic drugs. Hypertension was defined as an SBP ≥ 140 mmHg, a DBP ≥ 90 mmHg, or the use of antihypertensive drugs. Diabetes was defined as an HbA1c level ≥6.5% or the use of antidiabetic drugs. Dyslipidemia was defined as a TG level ≥150 mg/dL, an HDL-C level <40 mg/dL, an LDL-C level ≥140 mg/dL, or the use of antihyperlipidemic drugs. Individuals who did not meet any of the above criteria were considered healthy.
Statistical analysis
Chi-square tests were performed to compare the participant characteristics. The odds ratios (OR) and corresponding 95% confidence intervals (CI) for the onset of lifestyle diseases were calculated using a logistic regression model. Initially, a simple regression model was employed, using 10 lifestyle behaviors as independent variables and MetS, hypertension, diabetes, and dyslipidemia as dependent variables. Two adjusted models were subsequently employed. Model 1 included the 10 lifestyle behaviors as independent variables and was adjusted for sex and age as covariates. Model 2 was adjusted for Model 1 and addition of BMI and laboratory tests as covariates. Correlation coefficients and variance inflation factors (VIFs) were calculated for each variable to assess multicollinearity among the independent variables, confirming that all VIF values were below 5. All data were analyzed using Python 3.10 and a p value of <0.05 was considered significant.
Ethical considerations
The study was approved by the Ethics Committee for Research Involving Humans of the Gunma University Faculty of Medical, Gunma, Japan (HS2023−137). This study involved a secondary analysis of data from the existing Kokuho database. As the data had already been anonymized at the time of acquisition, informed consent was not required.
Results
The characteristics of participants who developed lifestyle diseases and those who did not are presented in Table 1. The BMI and laboratory values were compared between the participants who developed lifestyle diseases and those who did not (S3 Table). The prevalence of MetS 4 years after the initial health checkup was 4.8%. Individuals with MetS were more likely to engage in risky lifestyle behaviors, such as smoking, weight gain ≥10 kg since the age of 20 years, decreased physical activity, slow walking, fast eating, eating before bedtime, skipping breakfast, and problematic drinking habits. Conversely, they were less likely to exhibit adopting poor sleeping habits. The prevalence of hypertension 4 years after the initial health checkup was 26.1%. Individuals with hypertension were more likely to exhibit risky lifestyle behaviors such as smoking, weight change ≥10 kg since the age of 20 years, eating before bedtime, and problematic drinking habits. Conversely, they were less likely to exhibit risky behaviors, including the lack of regular exercise, decreased physical activity, skipping breakfast, and adopting poor sleeping habits. The prevalence of diabetes 4 years after the initial health checkup was 2.5%. Individuals with diabetes were more likely to exhibit risky lifestyle behaviors such as smoking, weight change ≥10 kg since the age of 20 years, and fast eating. Conversely, they were less likely to engage in risk behaviors, such as the lack of regular exercise, skipping breakfast, and adopting poor sleeping habits. The prevalence of dyslipidemia 4 years after the initial health checkup was 33.2%. Individuals with dyslipidemia were more likely to exhibit risky lifestyle behaviors, such as smoking, weight change ≥10 kg since the age of 20 years, the lack of regular exercise, decreased physical activity, fast eating, skipping breakfast, and adopting poor sleeping habits. Conversely, they were less likely to exhibit eating behaviors before bedtime and adopt problematic drinking habits.
Table 1: Baseline characteristics of individuals who developed and did not develop lifestyle diseases.
Table 2 presents the relationship between lifestyle behaviors and lifestyle diseases, determined using a logistic regression analysis. The results of the simple regression model and Model 1 are presented in Table 2. The results of Model 2, adjusted for sex, age, BMI, and laboratory values, are presented below (see Supplementary S1 Fig). MetS demonstrated positive associations with smoking, weight gain ≥10 kg since the age of 20 years, slow walking, and fast eating (Table 2A). The highest OR was observed for weight gain ≥10 kg since the age of 20 years (OR: 2.01, CI: 1.79–2.25). Hypertension was positively associated with smoking, weight gain ≥10 kg since the age of 20 years, the lack of regular exercise, slow walking, and problematic drinking habits (Table 2B). Among these, the highest OR was observed for problematic drinking habits (OR: 1.22, CI: 1.15–1.30). Diabetes was positively associated with fast eating (OR: 1.17, CI: 1.01–1.36). Conversely, skipping breakfast showed a negative relationship with diabetes (OR: 0.70, CI: 0.51–0.95) (Table 2C). Dyslipidemia was positively associated with smoking, weight gain ≥10 kg since the age of 20 years, skipping breakfast, and insufficient sleep. Conversely, drinking habits showed a negative association with dyslipidemia (OR: 0.90, CI: 0.85–0.95) (Table 2D).
Table 2: Correlation between lifestyle behaviors and lifestyle diseases determined using a logistic regression model.
Discussion
This retrospective study analyzed 10 years of accumulated health checkup data to examine lifestyle behaviors associated with the development of MetS and its components—hypertension, diabetes, and dyslipidemia. To our knowledge, this study is the first to assess the impact of lifestyle behaviors on the development of lifestyle diseases using such an extensive dataset, which enhances the statistical power and reliability of the findings. The assessment accurately calculated the positive measurement test results and accounted for the interaction effects of lifestyle behaviors on the onset of lifestyle diseases. New insights were obtained using a two-stage multivariable logistic regression model, where Model 1 and Model 2 were sequentially adjusted to account for potential confounding variables.
First, we identified an association between diabetes and lifestyle behaviors. Smoking [21–23], weight gain of ≥10 kg since the age of 20 years [24,25], the lack of regular exercise [26,27], slow walking [23], fast eating [14,17,23,28,29], and adopting poor sleeping habits [23] have been reported as risk factors for diabetes. Our study showed that fast eating was the sole risk factor in Model 2, while three factors were identified in Model 1 (Table 2C). The remaining two factors, weight gain of ≥10 kg since the age of 20 years and problematic drinking habit, were significantly associated with measurement tests that influence the onset of diabetes and may serve as confounders in this relationship [30,31]. Notably, our study showed that skipping breakfast protects against the onset of diabetes. Previous studies have reported a positive correlation between skipping breakfast and diabetes onset [15,16], whereas others have reported no correlation [17]. Our research suggests a negative correlation between skipping breakfast and diabetes onset, which contradicts those of previous studies and presents new insights into this issue. A previous study has highlighted the importance of considering both energy intake and dietary content [26]. However, the unique dietary habits in the Gunma Prefecture may have influenced this result. According to the NHNS in 2016, despite similar daily energy intake, the residents of Gunma Prefecture did consume significantly more carbohydrates compared with the national average, potentially affecting glucose metabolism and insulin sensitivity. Additionally, meal timing and overall dietary patterns could play a role. A previous study suggest that prolonged fasting intervals may improve metabolic health by enhancing insulin sensitivity and glycemic control [32]. It is possible that individuals who skip breakfast in this population may compensate with well-balanced meals later in the day, mitigating the adverse effects typically associated with breakfast skipping. Nonetheless, alternative explanations should be considered. For example, reverse causation may partly explain this finding—for instance, individuals with preclinical diabetes may have modified their eating habits by skipping breakfast to manage symptoms or weight. In addition, since dietary habits were self-reported, reporting bias may have influenced the observed association. Furthermore, our study was limited to insured adults within Gunma Prefecture, which may restrict the generalizability of our findings. Finally, unmeasured confounders such as sleep patterns, work schedules, and socioeconomic status could also have affected the results. Taken together, these factors highlight the need for cautious interpretation of our findings and suggest that further studies using objective dietary assessments and longitudinal data are warranted to clarify these associations.
Second, we identified an association between MetS and lifestyle behaviors. Previous studies have reported smoking [33], weight gain ≥10 kg since the age of 20 years [34], slow walking [13], fast eating [35,36], skipping breakfast (for men) [37], problematic drinking habits [13,38,39], and adopting poor sleeping habits (for men) [37] as MetS risk factors. Our study showed that smoking, weight gain ≥10 kg since the age of 20 years, slow walking, and problematic drinking habits were risk factors in Model 2, while five factors were identified in Model 1 (Table 2A). The remaining factor, fast eating, might be associated with measurement tests that influence the onset of MetS and previous study have suggested the need for adjustments based on these test items [10].
Third, we identified an association between hypertension and lifestyle behaviors. Previous studies have reported smoking [23,40], weight gain ≥10 kg since the age of 20 years [34], the lack of regular exercise [26], slow walking [23], fast eating [23], problematic drinking habits [39–41], and adopting poor sleeping habits [23] as hypertension risk factors. Our study showed that smoking, weight gain of ≥10 kg since the age of 20 years, the lack of regular exercise, slow walking, and problematic drinking habits were risk factors in Model 2, while six factors were identified in Model 1 (Table 2B). The remaining factor, eating before bedtime, might be associated with measurement tests that influence the onset of hypertension and previous study have suggested the need for adjustments based on these test items [11].
Fourth, we identified an association between dyslipidemia and lifestyle behaviors. Previous studies have reported smoking [42], weight gain ≥10 kg since the age of 20 years [34], skipping breakfast [43], and adopting poor sleeping habits [44] as dyslipidemia risk factors. These lifestyle behaviors were similarly confirmed as risk factors in Model 2, while six factors were identified in Model 1 (Table 2D). The remaining two factors, fast eating and eating before bedtime might be associated with measurement tests that influence the onset of dyslipidemia and previous studies have suggested the need for adjustments based on these test items [11,36]. Additionally, our study showed that drinking habits are protective against the onset of dyslipidemia, which aligns with previous research finding [39].
Thus, we showed that utilizing the results of self-administered questionnaires from SHC is effective in preventing lifestyle diseases. Notably, smoking and weight gain ≥10 kg since the age of 20 years were identified as common risk factors for MetS, hypertension, and dyslipidemia. The findings of this study provide essential evidence to support and strengthen targeted public health interventions, such as smoking cessation programs and weight management strategies based on the Health Promotion Act. Addressing these two factors through such interventions could play a crucial role in disease prevention and risk reduction.
The impact of lifestyle factors such as smoking, weight management, physical activity, and alcohol consumption on cardiovascular disease and life expectancy has been widely recognized across different populations [45,46]. However, cultural and dietary differences may influence these associations. Our study, which adjusted for 10 detailed lifestyle habits, offers a broader perspective than previous studies. Further research in diverse populations is needed to confirm these findings.
This study has several limitations. First, we did not account for any changes in lifestyle behaviors among individuals who underwent SHCs during the 4-year period following the initial health checkup. Longitudinal studies often struggle to incorporate the changes in background factors into their models; therefore, future research incorporating longitudinal tracking of lifestyle behaviors to account for these changes is necessary. Second, the lifestyle assessment data collected using self-administered questionnaires lack objectivity and are subject to potential biases, such as recall bias and social desirability bias. However, this method has been validated in studies by Takeuchi et al. [47] and Fukasawa et al. [19], demonstrating its statistical validity. To mitigate potential biases, future studies should consider complementing self-reported data with objective measures, such as wearable devices for physical activity monitoring. Third, the database used comprised information of individuals insured under the Gunma Prefecture’s NHI. Consequently, it remains unclear whether these findings can be generalized to other regions or insurance groups. Regional characteristics, economic indicators, and the social determinants of health may influence the outcomes. Future studies in diverse populations and healthcare systems are needed to validate these findings and ensure broader applicability.
In conclusion, this study elucidated the time-line changes associations between lifestyle behaviors and MetS, hypertension, diabetes, and dyslipidemia using SHC data accumulated over 10 years. Additionally, the correlations between lifestyle-related diseases and specific lifestyle habits were evaluated. These results suggest that self-administered questionnaires assessing lifestyle behaviors are useful for predicting future health issues. Notably, smoking and weight gain emerged as common risk factors for these four diseases, highlighting the need for targeted interventions to address these issues.
Supporting information
S1 TableCutoff values.(XLSX)
S2 TableList of “standard questionnaire”.(XLSX)
S3 TableBMI, and laboratory values of individuals who developed and did not develop lifestyle diseases.Supplemental data for Table 1.(XLSX)
S1 FigCorrelation between lifestyle behaviors and lifestyle diseases determined using a multivariable logistic regression model (Supplementary to Table 2).The odds ratio (black bars) with corresponding 95% confidence intervals (error bars) is shown for each category of lifestyle diseases. *P < 0.05, **P < 0.01, ***P < 0.001 compared with the reference category.(TIFF)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Curtis LH, Hammill BG, Bethel MA, Anstrom KJ, Gottdiener JS, Schulman KA. Costs of the metabolic syndrome in elderly individuals: findings from the Cardiovascular Health Study. Diabetes Care. 2007;30(10):2553–8. doi: 10.2337/dc 07-0460 17623825 · doi ↗ · pubmed ↗
- 2Muka T, Imo D, Jaspers L, Colpani V, Chaker L, van der Lee SJ, et al. The global impact of non-communicable diseases on healthcare spending and national income: a systematic review. Eur J Epidemiol. 2015;30(4):251–77. doi: 10.1007/s 10654-014-9984-2 25595318 · doi ↗ · pubmed ↗
- 3Nishida Y, Anzai T, Takahashi K, Kozuma T, Kanda E, Yamauchi K, et al. Multimorbidity patterns in the working age population with the top 10% medical cost from exhaustive insurance claims data of Japan Health Insurance Association. P Lo S One. 2023;18(9):e 0291554. doi: 10.1371/journal.pone.0291554 37768909 PMC 10538783 · doi ↗ · pubmed ↗
- 4Ministry of Health La W. Specific health checkups and specific health guidance [24.10.15]. Available from: https://www.mhlw.go.jp/stf/seisakunitsuite/bunya/kenkou_iryou/kenkou/seikatsu/index.html
- 5Ministry of Health La W. Data on specific health checkups and specific health guidance [2024.10.15]. Available from: https://www.mhlw.go.jp/stf/newpage_03092.html
- 6Ministry of Health La W. Guide to Creating a Data Health Plan 3rd revised Edition [2024.10.15]. Available from: https://www.mhlw.go.jp/stf/seisakunitsuite/bunya/0000061273.html
- 7Nakatani E, Tabara Y, Sato Y, Tsuchiya A, Miyachi Y. Data Resource Profile of Shizuoka Kokuho Database (SKDB) Using Integrated Health- and Care-insurance Claims and Health Checkups: The Shizuoka Study. J Epidemiol. 2022;32(8):391–400. doi: 10.2188/jea.JE 20200480 33518592 PMC 9263618 · doi ↗ · pubmed ↗
- 8Seto H, Oyama A, Kitora S, Toki H, Yamamoto R, Kotoku J, et al. Gradient boosting decision tree becomes more reliable than logistic regression in predicting probability for diabetes with big data. Sci Rep. 2022;12(1):15889. doi: 10.1038/s 41598-022-20149-z 36220875 PMC 9553945 · doi ↗ · pubmed ↗
