Body Mass Index and Spinopelvic Alignment as Predictors of Incident Knee Osteoarthritis: An 8-Year Longitudinal Study from the TOEI Cohort of Older Japanese Women
Yuki Murakami, Mitsuru Hanada, Kazuki Nomoto, Kensuke Hotta, Yuki Yamagishi, Yukihiro Matsuyama

TL;DR
This study finds that higher body mass index and certain spinal alignment issues predict the development of knee osteoarthritis in older Japanese women over 8 years.
Contribution
The study identifies BMI and spinopelvic alignment as novel predictors of incident knee osteoarthritis in older women.
Findings
Higher BMI was the strongest predictor of incident knee osteoarthritis.
Pelvic tilt and PI–LL mismatch showed limited standalone prediction but improved accuracy when combined with BMI.
Results suggest weight management and spinopelvic parameters can refine risk stratification for knee osteoarthritis.
Abstract
Background/Objectives: Knee osteoarthritis (KOA) is multifactorial, and longitudinal evidence isolating early predictors remains limited. We investigated predictors of incident KOA in community-dwelling older adult Japanese women. Methods: We analyzed 191 knees from 105 women aged ≥50 years (baseline Kellgren–Lawrence (KL) grade 0–1) and followed them for 8 years. Incident KOA was defined as KL ≥ 2 at the 8-year follow-up. Baseline measures included body mass index (BMI), physical function (one-leg stance, functional reach), Geriatric Locomotive Function Scale (GLFS-25), EuroQol 5-Dimension (EQ-5D) questionnaire, standing lateral whole-spine radiographs (sagittal spinopelvic parameters), and standing full-length anteroposterior (AP) lower-limb radiographs (coronal alignment parameters). Incident KOA was defined as KL ≥ 2 at follow-up. Group comparisons, multivariable logistic…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLower Extremity Biomechanics and Pathologies · Osteoarthritis Treatment and Mechanisms · Hip disorders and treatments
1. Introduction
Knee osteoarthritis (KOA) is a leading cause of pain, disability, and reduced quality of life (QOL) in older adults, particularly women [1,2,3]. With the increase in global aging, the burden of KOA is expected to rise dramatically, nearly doubling by 2050 [4,5]. Therefore, identifying modifiable risk factors—ideally prior to disease onset to enable primary prevention and risk-stratified screening—is essential for early intervention and disease prevention.
Previous epidemiological studies identified several risk factors for KOA, including older age, higher body mass index (BMI), female sex, history of knee injury, and impaired musculoskeletal function [6,7,8,9,10]. However, many reports have examined mixed outcomes, often conflating incident KOA with disease progression; focusing on incident KOA may help clarify the highly modifiable targets most relevant for prevention. Recently, attention has shifted toward biomechanical factors beyond the knee joint. Notably, spinal and pelvic alignment may influence knee joint loading [11,12,13]. For example, anterior trunk tilt and decreased lumbar lordosis may increase mechanical stress on the knees and contribute to the development of KOA [14,15,16]. In Asian populations, varus alignments of the proximal tibia may increase medial compartment loading and elevate the risk for KOA [17,18].
However, whether factors such as musculoskeletal decline and spinal alignment abnormalities contribute to incident KOA, or instead reflect secondary changes accompanying KOA progression, remains unclear. Additionally, these factors are frequently interrelated, complicating the isolation of their individual effects. Clarifying their predictive value requires longitudinal studies in individuals without advanced KOA. However, such cohort studies are limited. In particular, longitudinal studies controlling for sex and injury history are lacking.
To minimize the confounding effect of sex differences in joint morphology and KOA risk, this study focused exclusively on women. Therefore, this study aimed to identify the predictors of incident KOA through an 8-year longitudinal analysis of community-dwelling women aged ≥50 years in rural Japan.
2. Materials and Methods
This study was approved by our institutional ethics committee. Written informed consent was obtained from all participants, and the study adhered to the principles of the Declaration of Helsinki (1964).
This study was part of the TOEI study, a community health project conducted in Toei Town, Aichi, Japan. This ongoing biennial survey, initiated in 2012, targets healthy participants aged ≥50 years and includes physical function tests and radiographic evaluations. Participants were residents who were recruited through Toei Hospital or public bulletins. Reports from the TOEI cohort have been published previously [18,19,20].
Of the 225 individuals who participated in the 2012 and 2020 surveys, 148 women were included in the present analysis. The inclusion criteria were age ≥ 50 years and no history of spinal surgery. The exclusion criteria were incomplete questionnaire data (3 participants, 6 knees) and poor-quality radiographs or history of lower-limb surgery (8 participants, 15 knees). A total of 11 participants (21 knees) were excluded. Formal power analysis was not performed owing to the fixed cohort size.
The final analysis included 138 participants (275 knees), of which 191 knees from 105 participants were classified as Kellgren–Lawrence (KL) grade 0 or 1 in 2012 and included in the analysis. In cases where only one knee met this condition, the contralateral knee was classified as KL grade 2.
All participants provided their demographic data in 2012, including age, height, weight, and BMI, and underwent the following assessments. Data regarding occupational and sports history were not collected (a limitation of this study).
Physical Function Evaluation
Physical function was evaluated using the one-leg standing time (with eyes open) and the Functional Reach Test (FRT). For the one-leg standing test, each leg was tested twice, and the longest standing time (up to 60 s) was recorded. For the FRT, participants extended one arm forward and leaned their upper body forward without moving their feet, and the maximum distance reached by the fingertips was measured in two trials. Although knee symptoms were not formally assessed, all participants completed the functional tests without difficulty.
Questionnaires
Musculoskeletal function was evaluated using the self-administered 25-item Geriatric Locomotive Function Scale (GLFS-25) and the EuroQol 5-Dimension (EQ-5D) questionnaire. The GLFS-25 contains 25 items addressing mobility, activities of daily living, social function, and mental health domains, each scored 0–4 (total score 0–100, with higher scores indicating worse dysfunction) [21,22]. QOL was assessed using the EQ-5D, a standardized health status measure [23]. Higher GLFS-25 and lower EQ-5D index scores indicate greater musculoskeletal impairment.
Radiographic Evaluation
Full-spine and full-length lower-limb radiographs were obtained. Standing lateral whole-spine radiographs were obtained with participants looking straight ahead and hands on the clavicles. Standing full-length anteroposterior (AP) lower-limb radiographs were acquired with the knees fully extended and the patellae facing forward. The X-ray tube was positioned 1.5 m from the participant. Images were analyzed using Surgimap Spine software (version 2.3.2.1; Nemaris, New York, NY, USA).
The radiographic parameters (see Figure 1) measured in this study were as follows:
Spinal and pelvic parameters:
- Sagittal vertical axis (SVA): The horizontal distance between the vertical plumb line dropped from the center of the C7 vertebral body and the posterosuperior corner of the sacrum.
- Thoracic kyphosis (TK): The angle between the superior endplate of T1 and the inferior endplate of T12.
- Lumbar lordosis (LL): The angle between the superior endplate of L1 and the superior endplate of S1.
- Sacral slope (SS): The angle between the superior endplate of S1 and a horizontal reference line.
- Pelvic tilt (PT): The angle between the line connecting the midpoint of the S1 superior endplate to the femoral head center and a vertical reference line
- Pelvic incidence (PI): The angle between a line perpendicular to the S1 superior endplate at its midpoint and the line connecting this midpoint to the femoral head center. By definition, PI = SS + PT.
- PI–LL mismatch: Difference between PI and LL (i.e., PI minus LL).
- Femoral inclination (FI): The angle between the femoral axis and the vertical reference line. Mean values from both sides were used.
Evaluated radiographic parameters. Abbreviations: SVA, sagittal vertical axis; TK, thoracic kyphosis; LL, lumbar lordosis; SS, sacral slope; PT, pelvic tilt; PI, pelvic incidence; PI–LL, PI minus LL; FI, femoral inclination; LDFA, lateral distal femoral angle; MPTA, medial proximal tibial angle; HKA, hip–knee–ankle angle; FCI, femoral condyle inclination; TPI, tibial plateau inclination.
Lower limb parameters:
- Lateral distal femoral angle (LDFA): The lateral angle between the femoral mechanical axis and distal joint line.
- Medial proximal tibial angle (MPTA): The medial angle between the tibial mechanical axis and proximal joint line.
- Arithmetic hip–knee–ankle angle (aHKA): Defined as MPTA minus LDFA, representing overall lower-limb alignment. Negative values indicate varus alignment.
- Joint line obliquity (JLO): Calculated as the sum of the MPTA and LDFA, reflecting the inclination of the joint line relative to the floor in a bipedal standing position. Values less than 180° indicate the apex distal configuration.
- Hip-knee-ankle angle (HKA): The angle between the mechanical axes of the femur and tibia. A value of 0° represents neutral alignment, and positive values indicate varus alignment.
- Joint line convergence angle (JLCA): The angle formed by the intersection of the distal femoral and proximal tibial joint lines. Positive values indicate a lateral opening.
- Femoral condyle inclination (FCI): The inclination of the femoral joint surface relative to the horizontal plane. Negative values indicate lateral distal inclination.
- Tibial plateau inclination (TPI): inclination of the tibial joint surface relative to the horizontal plane. Negative values indicate medial-distal inclination.
Radiographic analyses were conducted by two board-certified orthopedic surgeons. The intraobserver and interobserver reliability for all alignment parameters and KL grade was high with intraclass correlation coefficients (ICCs) >0.75 (range: 0.792–0.988).
Group classification and analysis
Participants with KOA classified as KL grades 0 or 1 in 2012 were included in this study. Those with knees progressing to KL grade ≥2 by 2020 were classified as incident KOA (I group), while those remaining at KL grade 0 or 1 were classified as non-incident KOA (NI group). Mean values of each outcome measure were compared between groups I and NI. Multivariate analysis was performed to identify potential predictors and assess multicollinearity among the outcome measures. For the predictors that remained significant in the multivariate analysis, the optimal cut-off points were determined, and the corresponding cut-off values were calculated.
Statistical Analysis
Continuous variables are expressed as mean ± standard deviation, and t-tests were used for group comparisons. To identify the predictive factors, multivariate analysis was performed using binary logistic regression. Stepwise selection was performed using the Wald test, with a significance level of p < 0.01. Receiver operating characteristic (ROC) curve analysis was used to assess diagnostic accuracy. Collinearity was assessed using variance inflation factors (VIFs), and Spearman’s rank correlation was used for correlation analysis. Scatter plots were generated for each variable identified as an independent predictor, with predicted probabilities from the logistic regression model overlaid. A p-value <0.05 was considered statistically significant. To enhance readability, in-text statistics are typically rounded to two decimals, whereas tables report values to four decimals. All performance indices and hypothesis tests were computed using full-precision estimates.
Internal validation and performance
Harrell’s bootstrap optimism correction (1000 resamples) was used to estimate and remove overfitting bias (“optimism”) from apparent performance. We report the Brier score (mean squared error of predicted probabilities) and the Brier Skill Score (BSS = 1 − Brier_model/Brier_null), which reflects improvement over a null model that predicts the cohort incidence only. Calibration was summarized by the slope (ideal ≈ 1) and intercept (ideal ≈ 0) from regressing the outcome on the model’s logit predictions.
All computations for statistical testing, data processing, and figure generation were performed in MATLAB R2025a (MathWorks, Natick, MA, USA), employing built-in functions and add-on toolboxes.
3. Results
The mean age of the participants was 69.3 ± 6.1 years (52–82). Of 191 knees, 133 were non-incident (NI group) and 58 incident (I group). No cases of medial femoral condyle osteonecrosis or rheumatoid arthritis were observed in the I group. Incident knees showed higher weight and BMI (23.8 vs. 21.1 kg/m^2^), higher GLFS-25 scores and lower EQ-5D index values, greater PT and PI–LL mismatch (11.5° vs. 5.3°), and smaller MPTA and JLO (Table 1).
A binary logistic regression analysis was conducted using age, BMI, GLFS-25, EQ-5D, SVA, TK, SS, PT, PI–LL mismatch, MPTA, and LDFA as explanatory variables. BMI and PI–LL mismatch were selected as significant predictors of incident KOA, and the following base model was constructed:
With an interaction term, the extended model was
(Table 2).
BMI correlated modestly with MPTA and showed the lowest VIF. PI–LL mismatch correlated with multiple sagittal parameters (SVA, TK, SS, PT) and had the highest VIF (Figure 2 and Figure 3).
In ROC analyses, the AUCs of BMI alone, PI–LL mismatch alone, and the combined model (BMI + PI–LL mismatch) were 0.753, 0.596, and 0.803, respectively (Table 3, Figure 4). The ROC-derived BMI cutoff was 22.07 kg/m^2^. The interaction model’s decision boundary was −0.4066 < −11.445 + 0.45543 × BMI + 0.28648 × PI–LL mismatch − 0.011004 × BMI × PI–LL mismatch.
A BMI–PI–LL mismatch heat map visually depicts the lower risk at jointly low values (Figure 5).
Within the multivariable framework, BMI remained the strongest single predictor of incident KOA. Adding a BMI × PI–LL mismatch interaction modestly improved discrimination over the BMI-only model (AUC 0.803 vs. 0.753; Table 3, Figure 4). The predicted-risk surface showed the lowest risk, where both BMI and PI–LL mismatch were low, and higher risk along two complementary bands—(i) higher PI–LL mismatch at lower BMI and (ii) higher BMI at lower PI–LL mismatch—consistent with a negative BMI × PI–LL mismatch interaction (β < 0). The upper-right region (high BMI and high PI–LL mismatch) was sparsely populated and did not exhibit the highest predicted risk (Figure 5). Thus, PI–LL mismatch operates as a complementary risk marker that improves risk stratification beyond BMI.
To address potential overfitting, we applied Harrell’s bootstrap optimism correction (1000 resamples) to the final interaction model. After correction, performance remained robust: AUC = 0.792; Brier score = 0.166; Brier Skill Score = 0.221; calibration slope = 0.925; calibration intercept = 4.586 × 10^−4^. These bootstrap-corrected metrics support the stability of the BMI × PI–LL interaction and reinforce its incremental value for risk stratification beyond BMI alone.
4. Discussion
This 8-year cohort study identified BMI as the strongest single predictor of incident KOA, with PI–LL mismatch contributing complementary value despite limited standalone discrimination. To reduce the confounding influence of two well-known risk factors—sex and prior joint injury [7,9]—only women were included, and participants with a history of lower limb surgery were excluded. Univariate differences (BMI, GLFS-25, EQ-5D, PT, PI–LL, MPTA, JLO) aligned with those of prior literature [6,7,10,17,24], whereas age was not a significant predictor, possibly reflecting KOA onset in midlife [25].
BMI is a well-established risk factor for KOA [6,7,10,24,26,27]. Even within a relatively lean cohort (mean BMI 21.9 kg/m^2^) [28], BMI retained strong predictive value, with a ROC-derived cutoff of 22.07 kg/m^2^. This heightened susceptibility to load-related stress may be attributed not only to the relatively advanced age of the participants but also to population-specific skeletal morphology and culturally specific lifestyle habits, such as frequent squatting and kneeling, which may predispose Japanese individuals to KOA compared with populations in other countries [17,29,30].
Prior studies have shown that greater PI–LL mismatch is observed in more advanced KOA and is linked to compensatory knee flexion [13,14,16,20,31]. In contrast, our cohort comprised knees with KL 0–1 at baseline and did not exhibit sagittal malalignment that was severe enough to necessitate compensation, indicating a different clinical context. Within this early-stage setting, PI–LL alone showed limited discrimination (AUC 0.596) and correlated with multiple sagittal parameters with the highest VIF, suggesting shared variance with global alignment. Thus, we interpreted these results to indicate that PI–LL is not a strong solo predictor or a direct interventional target. Rather, PI–LL serves as a complementary marker that improves risk stratification when considered in conjunction with BMI.
In keeping with this early-stage context, femoral inclination—a proxy for compensatory knee flexion—was not significantly associated with incident KOA in our data, implying that sagittal imbalance had not yet translated into measurable lower-limb compensation at baseline. Conceptually, sagittal spinopelvic malalignment may begin with a reduction in lumbar lordosis [32]; accordingly, monitoring PI–LL may facilitate early detection of emerging global sagittal imbalance and potential increases in knee joint loading.
Incorporating a BMI × PI–LL interaction improved discrimination versus the BMI-only model (AUC 0.803 vs. 0.753; Table 3, Figure 4). However, the risk surface did not peak where both BMI and PI–LL were high. Instead, risk increased along complementary patterns—higher PI–LL at lower BMI and higher BMI at lower PI–LL—consistent with a negative (antagonistic) interaction on the logit scale (β for BMI × PI–LL = −0.011004), whereby the effect of one predictor attenuates as the other increases.
Clinically, these results suggest that primary prevention should prioritize weight management [26]. Spinopelvic assessment (by PI–LL) can refine BMI-based screening, highlighting risk among lean individuals with sagittal malalignment and among individuals with higher BMI even when PI–LL is low. Trunk and lower-limb muscle weakness may contribute to PI–LL mismatch [33,34], and targeted exercise may improve spinopelvic alignment [34]. Whether modifying these features reduces incident KOA, however, warrants prospective evaluation.
Proximal tibial varus alignment has been implicated as an important biomechanical factor in KOA development [17,35]. In our cohort, the between-group difference in MPTA was <1°, which may reflect a selection effect whereby restricting enrollment to baseline early KOA (KL 0–1) probably underrepresented advanced cases with smaller MPTA, thereby attenuating the apparent contrast. The weak correlation of MPTA with BMI suggests that chronic load may contribute to varus remodeling. However, the clinical impact at baseline appeared small in our study population.
Although this study provides important evidence on predictors of incident KOA in older Japanese women, several limitations warrant mention. First, the two-wave design (2012 and 2020) prevents precise ascertainment of onset, so estimates may be conservative. Second, selection and attrition (e.g., exclusions due to image quality or incomplete questionnaires, and loss to follow-up) could introduce bias; detailed flow is provided, but residual selection bias cannot be excluded. Third, radiographic alignment measures (e.g., PI–LL) are subject to posture-related measurement error, which likely attenuates effects. Fourth, while BMI emerged as a dominant predictor, limited body-composition and muscle-function data constrained our ability to disentangle adiposity from sarcopenic components. Fifth, generalizability is limited to community-dwelling Japanese women aged ≥50 years in a rural setting; external validation is needed. Finally, although logistic regression does not assume normality of predictors, unmeasured or residual confounding (e.g., baseline knee symptoms, lifestyle and activity patterns, comorbidities) may still influence the estimates.
In summary, BMI primarily drives incident KOA risk, and PI–LL complements BMI to improve discrimination, supporting prevention strategies that prioritize weight while using spinopelvic metrics for risk refinement.
5. Conclusions
BMI was the primary predictor of incident KOA, even in a relatively lean population. PI–LL mismatch alone had limited discrimination but complemented BMI to improve prediction, consistent with sagittal-alignment-related mechanical stress. Prevention should prioritize weight management, using spinopelvic parameters to refine individual risk.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hunter D.J. Bierma-Zeinstra S. Osteoarthritis Lancet 20193931745175910.1016/S 0140-6736(19)30417-931034380 · doi ↗ · pubmed ↗
- 2Li E. Tan J. Xu K. Pan Y. Xu P. Global burden and socioeconomic impact of knee osteoarthritis: A comprehensive analysis Front. Med.202411132309110.3389/fmed.2024.1323091 PMC 1113724238818397 · doi ↗ · pubmed ↗
- 3Summanen M. Ukkola-Vuoti L. Kurki S. Tuominen S. Madanat R. The burden of hip and knee osteoarthritis in Finnish occupational healthcare BMC Musculoskelet. Disord.20212250110.1186/s 12891-021-04372-934051771 PMC 8164770 · doi ↗ · pubmed ↗
- 4GBD 2021 Osteoarthritis Collaborators Global, regional, and national burden of osteoarthritis, 1990–2020 and projections to 2050: A systematic analysis for the Global Burden of Disease Study 2021 Lancet Rheumatol.20235 e 508e 52210.1016/S 2665-9913(23)00163-737675071 PMC 10477960 · doi ↗ · pubmed ↗
- 5Wallace I.J. Worthington S. Felson D.T. Jurmain R.D. Wren K.T. Maijanen H. Woods R.J. Lieberman D.E. Knee osteoarthritis has doubled in prevalence since the mid-20th century Proc. Natl. Acad. Sci. USA 20171149332933610.1073/pnas.170385611428808025 PMC 5584421 · doi ↗ · pubmed ↗
- 6Dong Y. Yan Y. Zhou J. Zhou Q. Wei H. Evidence on risk factors for knee osteoarthritis in middle-older aged: A systematic review and meta analysis J. Orthop. Surg. Res.20231863410.1186/s 13018-023-04089-637641050 PMC 10464102 · doi ↗ · pubmed ↗
- 7Duong V. Shaheed C.A. Ferreira M.L. Narayan S.W. Venkatesha V. Hunter D.J. Zhu J. Atukorala I. Kobayashi S. Goh S.L. Risk factors for the development of knee osteoarthritis across the lifespan: A systematic review and meta-analysis Osteoarthr. Cart.2025311162117910.1016/j.joca.2025.03.00340174718 · doi ↗ · pubmed ↗
- 8Nishimura A. Hasegawa M. Kato K. Yamada T. Uchida A. Sudo A. Risk factors for the incidence and progression of radiographic knee osteoarthritis of the knee among Japanese Int. Orthop.20113583984310.1007/s 00264-010-1073-x 20559829 PMC 3103966 · doi ↗ · pubmed ↗
