Post-translational modifications of collagen type I in osteogenesis imperfecta: Systematic review and meta-analysis
Priyesh Patel, Sirion Aksornthong, Svetlana V. Komarova

TL;DR
This study reviews how collagen type I in osteogenesis imperfecta is chemically altered, finding increased modifications in certain tissues and severe cases.
Contribution
The study systematically quantifies post-translational modifications in collagen-I from osteogenesis imperfecta patients, revealing tissue-specific and severity-related patterns.
Findings
Hydroxylysine and glycosylation levels are increased in OI collagen-I, while hydroxyproline remains unchanged.
Collagen-I modifications are more pronounced in bone-derived samples compared to skin or dentine.
Severe OI cases (type 2) show the highest levels of hydroxylysine and glycosylation.
Abstract
Osteogenesis imperfecta (OI) is a rare genetic disorder most often caused by mutations in genes that encode collagen type I. OI collagen-I differs from healthy collagen-I due to the underlying mutation and altered post-translational modifications (PTMs). The objective of this study was to use knowledge synthesis to quantify the levels of selected PTMs, hydroxylysine (HYL), hydroxyproline (HYP) and glycosylation (GLY) in OI collagen-I. A systematic search in Medline, Ovid and Web of Science, identified 701 studies reporting on PTM outcomes for OI patients with collagen-I mutation. We excluded animal studies, and reports for OI patients with mutations other than in collagen-I. After screening, we included 36 qualitative studies and 25 quantitative studies for meta-analysis. All qualitative studies reported that OI collagen-I was overmodified. Meta-analysis of studies with quantitative…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConnective tissue disorders research · Dermatological and Skeletal Disorders · TGF-β signaling in diseases
Introduction
1
Osteogenesis imperfecta (OI) is a rare metabolic bone disorder affecting 1 in every 15,000 births. Most OI cases (85 %) are caused by a mutation in one of the two genes that encode for type 1 collagen, COL1A1 and COL1A2 (Rauch and Glorieux, 2004; Tournis and Dede, 2018; Nijhuis et al., 2019; Tauer et al., 2019). Type 1 Collagen (collagen-I) is the most prevalent protein in bone tissue. OI is synonymous with brittle bone disease, as the main characteristic of OI is bone fragility. Additionally, phenotypic presentation includes skeletal deformities, scoliosis, low bone mineral density, and dentinogenesis imperfecta (Tournis and Dede, 2018). Presentation of OI with collagen-I mutations is classified under four categories, OI type I-IV, of which OI type I is mild; OI type II is the most severe and associated with perinatal or neonatal lethal cases, OI type III is severe and OI type IV is moderate-severe (Van Dijk et al., 2010).
The essential characteristic of collagen-I is that it is made of 2 COL1A1 chains and 1 COL1A2 chain interwoven into a triple helix (Buss et al., 2022). Each chain has a repetitive amino acid sequence G-X-Y, where G is glycine, X is likely proline, and Y is likely hydroxyproline. Collagen-I undergoes several post-translational modifications (PTM) during the triple helix formation. Both proline and lysine are modified (Gjaltema and Bank RA, 2017; Naomi et al., 2021). Proline is hydroxylated to form hydroxyproline (HYP); the reaction is catalyzed primarily by prolyl 4-hydroxylase (P4H). The hydroxylation by P4H causes a conformational change, which helps with the assembly of the triple helix (Chow et al., 2018). Lysyl hydroxylase converts lysine to 5-hydroxylysine (HYL). Glycosyltransferase 25 domain-containing 1 catalyzes the addition of galactose to HYL to form galactosyl hydroxylysine (GHYL). Glycosyltransferase 25 domain containing 2 catalyzes the addition of glucose to GHYL to form glucosyl galactosyl hydroxylysine (GGHYL) (De Giorgi et al., 2021). OI collagen-I differs from healthy collagen-I due to the underlying mutation and the alterations in PTM levels (Lehmann et al., 1995). There is currently no quantitative consensus on the PTM levels in OI.
We performed a systematic review and meta-analysis on OI collagen-I post-translational modification. The primary objective was to quantify HYL, HYP, and glycosylation levels in OI collagen-I compared to healthy controls. The secondary objective was to examine the association between PTM levels and disease severity.
Materials and methods
2
Information sources, search strategy, eligibility criteria, and screening
2.1
This study complies with the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) statement (Supplementary Table 1) (Page et al., 2021). The search strategy used combined keywords and Medical Subject Headings (MESH) terms related to OI and terms related to collagen post-translational modification (Supplementary Methods 1). The search was performed in Medline, Embase, and Web of Science in January 2023 and updated in January 2025. The systematic review was not registered in PROSPERO. The title/abstract screening was performed independently by two reviewers (PP and SA) using the Rayyan Systematic Review Screening Software (Ouzzani et al., 2016). Disagreements between screeners were resolved by discussion. The full text screening was performed by PP with the inclusion criteria of studies experimentally assessing collagen-I PTM in samples from OI patients. The included studies were grouped as i) studies containing quantitative assessment of lysine hydroxylation, proline hydroxylation, and hydroxylysine glycosylation suitable for the meta-analysis; ii) studies reporting quantitative data on collagen-I PTM that were incompatible with meta-analysis due to unit/method differences; iii) studies containing qualitative data only. The exclusion criteria were studies describing OI types that are not due to collagen-I mutations, animal studies, conference abstracts, reviews, and editorial commentary. The full list of included studies is in the Supplementary bibliography.
Data extraction
2.2
Publication information and patient parameters were extracted from studies in all three groups (meta-analysis, quantitative and qualitative). It includes authors, publication year, data type presentation (individual or aggregate), number of patients/controls, patient OI type (collagen-I mutation was noted), age and sex of participants. Some studies did not explicitly state the OI type of the patient; however, the disease severity could be determined by the description of the patient (such as perinatal lethal OI type II). For qualitative data, the presence of α,α-dipyridyl was noted. For the meta-analysis, we extracted the PTM levels with variance measures in patient and control samples, the collagen processing technique, the units of measurement for PTM (Supplementary Table 2), and the origin of the collagen samples (bone, fibroblast, or dentine).
Data conversion
2.3
The common outcome units for HYL and HYP were percentage hydroxylysine and percentage hydroxyproline, expressed as . When the outcome was presented as amino acid count, this formula was used. When the outcome was presented as a ratio ( or ), it was converted to percentage with . When the amino acid count was given by cyanogen bromide peptide, the amino acid counts for each peptide were added up, and then percentage was calculated. When the percentage hydroxylation was given per chain of collagen, a weighted average was calculated (66 % for A1 chains and 33 % for A2 chains). The ratio of was another unit of interest and it was calculated using the amino acid counts, except when given directly. For the glycosylation outcomes, the common outcome presentation was as the ratio of diglycosylated/monoglycosylated (GGHYL/GHYL) hydroxylysine.
Study level outcomes
2.4
The selected effect size was the normalized mean difference, calculated as , where were means for the patient and control populations in study i (Mikolajewicz and Komarova, 2019). For studies reporting data for fibroblast and bone samples, the effect size was calculated for these samples independently and then the average of both samples was used for meta-analysis. For studies reporting aggregated data by OI type, the effect size was calculated for each group, followed by calculating a weighted average based on the number of patients in each group.
To estimate the standard errors, we used the following considerations. For one study reporting aggregate data, we use the reported standard deviation (SD), for patient and control populations. For studies that reported individual patient data (IPD) for 4 or more patients or controls per outcome (4 of 17 studies for HYL dataset, 2 of 14 studies for HYP dataset, and 1 of 5 studies for Gly dataset) the standard deviations for patients and controls were calculated as , where represents individual datapoints, and is the mean of individual datapoints. To estimate the SD for samples in studies with fewer than 4 patients or controls, first, we calculated the combined SD for all data points in small-sized studies, treating data for patients and controls separately. Then, the combined SD for patients was imputed to patient or patient groups in each study with 4 or less patients, while the combined standard deviation for controls was assigned to control or control groups in each study with 4 or less controls (Furukawa et al., 2006; Seo et al., 2021). The study-level standard error was calculated as , where are the SD from the pooled control and patient data, is the number controls and patients within the study.
Meta-analysis
2.5
Meta-analysis was performed using a random effects model (Riley et al., 2011). The global effect size ( and its standard error was calculated using the Hunter-Schmidt estimator with small sample size correction (Brannick et al., 2019). Confidence intervals (CI) were calculated as . Heterogeneity analysis was performed, and the percentage of variation across studies due to heterogeneity, I^2^, and between-study variance, τau^2^, were reported. Assumption of the methodologies described is presented in Supplementary Method 3.
To evaluate the covariate effect, the individual effect size was calculated by comparing the individual patient value with the average control value. Then, the individual patient effect sizes were combined by their covariates (Lambert et al., 2002). Only studies in which the analyzed covariates were explicitly reported were included in this analysis.
Assessment of bias
2.6
Studies selected for meta-analysis were evaluated for bias using a quality assessment tool that includes questions to assess case studies. The assessment consists of 10 questions, with a maximum obtainable score of 11 points (Supplementary Methods 2). A funnel plot was made for each outcome to visualize study-level reporting bias (Viechtbauer and Cheung, 2010).
Software
2.7
EndNote 20 and Rayyan were used for reference management (Ouzzani et al., 2016). Webplot digitizer was used for data extraction from figures. Microsoft Excel version 2507 build 16.0.19029 was used for data management, effect size and standard error calculations. Python with the matplotlib package was used to make some figures (Hunter, 2007). R version 4.2.2 was used in Visual Studio Code version 1.103.1 with the metafor package for meta-analysis (forest plot, heterogeneity analysis, funnel plot), and the tidyplots package for figure preparation (Engler, 2025; Viechtbauer, 2010).
Results
3
A systematic search was conducted across three databases: Medline (n = 660), Embase (n = 858), and Web of Science (n = 801), which after duplicate removal, identified 701 unique studies. These studies were screened by title and abstract, and 207 were selected for full-text evaluation (Fig. 1A). After meeting the inclusion criteria of describing PTM of collagen-I in OI patients, 70 studies primarily published in the early to late 90s (Fig. 1B) were included in the systematic review. Of these, 36 studies only provided the qualitative data, 9 reported quantitative outcomes which could not be combined with other studies, and 25 studies were included for meta-analysis (complete reference lists are in Supplemental bibliography). The meta-analysis included studies assessing PTMs in bone, fibroblast and dentine-derived collagen-I, including 17 studies of hydroxylysine (HYL) levels in 88 OI patients, 14 studies of hydroxyproline (HYP) levels in 79 patients, and 5 studies of glycosylation ratios (GLY) in 17 patients (Fig. 1C). Additionally, three studies reported PTMs in dentine collagen-I in 62 patients. A total of 269 patients were described. The OI type distribution was balanced across all four types and for 69 patients, the OI type was not reported (NR) (Fig. 1D). The age was not reported for the majority of patients. When reported, it was mainly for the pediatric population (Fig. 1E). The patient sex was only reported for 13 of 269 patients (Fig. 1F).Fig. 1Systematic review and meta-analysis information flow. (A) The PRISMA diagram indicates the flow of assessment in each step of the process. (B) The publication year distribution for all included studies. (C) Bar plots for number of publications (blue, left y-axis) and patients (orange, right y-axis) for meta-analysis of collagen modifications for hydroxylysine (HYL), hydroxyproline (HYP) and glycosylation (GLY). (D, E, F) Pie charts visualizing reporting of patient parameters including OI type (D), age (E), and sex (F). NR: not reported.Fig. 1
The qualitative dataset comprised 36 studies, which described 58 patients (Table 1). These studies report overmodified collagen-I by isolating the collagen-I from OI patients and using gel electrophoresis to compare the thickness of the collagen-I band to that of the control, where the thicker bands indicate overmodified collagen (Barsh and Byers, 1981). In addition, some studies used the iron chelator α,α-dipyridyl to inhibit lysyl oxidase, which reduced the band thickness to the control levels, confirming that the increase in thickness is related to an increase in PTM levels (Ikeda et al., 1992). All studies in the qualitative dataset reported overmodified collagen-I in OI patients (Table 1).Table 1Overview of studies included in qualitative analysis. Shown are the number of patients (NP) and controls (NC), OI type or reported severity, patients' age (in years, y), and sex (male (M), females (F), α,α-dipyridyl compound used as control for post-translational modification, Overmodified in OI based information reporting. Abbreviation: NR, not reported.Table 1. Author and titleNPNCOI typeAge (y)Sexα,α- DipyridylOvermodified in OIBarsh, 198111II0.5NRYesYesSippola, 198411Mild5MYesYesDe Vries, 198611NR59FYesYesWenstrup, 198611III8MYesYesWenstrup, 198611IV1MYesYesVogel, 198711II2MYesYesBateman, 198711IIPerinatal lethalFYesYesBateman, 198711IINRFNoYesWilling, 198811IIPerinatal lethalNRYesYesByers, 198811IIPerinatal lethalNRYesYesTenni, 198831III2,9,10MYesYesBateman, 198811IIINRNRYesYesSuperti-Furga, 198811IIINRMNoYesWenstrup, 198811I42FYesYesMarini, 198911IV4FYesYesConstantinos, 198911IIPerinatal lethalFYesYesBaldwin, 198911IIPerinatal lethalFYesYesStarman, 198931I21FYesYesWallis, 199011IINRNRYesYesGrange, 199011IIPerinatal lethalFYesYesValli, R. 199021I4,5M,FYesYesBateman, 199111IVNRFNoYesTenni, 199111I2MYesYesHawkins, 199111IINRNRNoYesNicholls, 199111IIINRNRNoYesWallis, 199211IIPerinatal lethalMNoYesBateman, 199211IIPerinatal lethalNRNoYesFertala, 199321IIPerinatal lethalNRNoYesValli, 199311II18 monthsFNoYesMarini, 199321IV4,8FYesYesLightfoot, 199411IV2MNoYesSarafova, 199831IV9,5,15M,FYesYesPace, 200111I-IVNRNRNoYesPace, 200111INRMNoYesPace, 200211IIPerinatal lethalFNoYesCabral, 200341II & NRNRNRNoYes
Nine quantitative studies describing 51 patients were not included in the meta-analysis (Table 2). Six of 9 studies reported outcomes related to glycosylation and 5 studies reported outcomes related to lysine hydroxylation. Glycosylation outcomes reported were percentage glycosylation; however, it was unclear what this percentage specifically designates (diglycosylated/all lysine residues, diglycosylated/hydroxylated lysine residues, diglycosylated/monoglycosylated + hydroxylated lysine residues), preventing their inclusion in meta-analysis. The two studies by Taga and colleagues (Taga et al., 2013; Takeyari et al., 2021) demonstrated site-specific hydroxylation and glycosylation of lysine, indicating that each site had its own distribution of hydroxylation and glycosylation levels in OI patients. These studies show that there is significant heterogeneity with regard to PTM levels (LYS, HYL, GHYL and GGHYL) at each site. All the studies with quantitative data on glycosylation reported a higher glycosylation levels in OI collagen-I compared to controls.Table 2Overview of studies containing quantitative data excluded from the meta-analysis. Shown are number of patients (NP) and controls (NC), OI type, patients' age (in years, y), and sex (male (M), females (F)), reported quantitative data for outcomes (collagen modification of hydroxylysine (HYL), hydroxyproline (HYP), glycosylation (GLY)), Units represents the units that was used for data reporting. Abbreviation: NR, not reported.Table 2. Author and yearNPNCOI TypeAge (y)SexOutcomeUnitsTrelstad, 197711IIPerinatal lethalFGLYPercentage of glucose, galactoseKirsch, 198111II0.25MGLYRelative amount of glycosylated hydroxylysine and Percentage of the sum of glycosylated hydroxylatedBateman, 198472IIPerinatal lethalNRHYL, GLYPercentage Hydroxylation per Chain (missing data)Bateman, 198711IIPerinatal lethalFHYLHYL residues per CNBR peptide of A1 ChainKirsch, 198742IIPerinatal lethalNRGLYPercentage GlycosylatedTajima, 199413IV12MGLYPercentage GlycosylatedBank, 20002861III,IVNRNRHYLHYL/triple helixTaga, 201221II & NRNRNRHYL, GLYsite specific hydroxylation and glycosylation levelsTakeyari, 202061I & IIINRNRHYL,GLYsite specific hydroxylation and glycosylation levels
From 25 quantitative studies included in meta-analysis, 13 were case reports with 1 patient each, 4 were case series, with 2–4 patients per study, and 8 were cohort studies with 13–59 patients per study (Table 3). The number of control samples was generally similar to that of patients, except for three studies, which reported 3 to 7 control samples for 22–30 patient samples (Brenner et al., 1989; Brenner et al., 1993; Brenner et al., 1988). The study level effects were calculated as a normalized mean difference between PTM levels in patient and control samples, with values higher than 0 indicating higher levels in OI patients. For studies reporting aggregate data or IPD for more than 4 patients and controls, we used the reported or calculated mean and SD to estimate the study level effect size and its variance. For studies reporting 1–3 patient or control samples, we calculated the global variance for both patients and controls by combining the SD from all the studies reporting the same PTM outcome. The global variance was then imputed to the patients and controls in relevant studies, followed by calculation of the effect size and its variance (standard error). For statistical independence, when data for the same outcome assessed in different samples of the same patient were reported, the effect size was calculated for each sample independently and then averaged.Table 3Overview of studies included in meta-analysis. Shown are number of patients (NP) and controls (NC), OI type, patients' age (in years, y), and sex (male (M), females (F)), reported outcomes (collagen modification of hydroxylysine (HYL), hydroxyproline (HYP), ratio of hydroxylysine to hydroxyproline (HYL/HYP), glycosylation (GLY)), and quality scores (QS; out of 11 maximum). Abbreviation: NR, not reported; PL, perinatal lethal; B, bone; F, fibroblast; D, dentine.Table 3. Author and yearNPNCOI typeAge (y)SexSample originOutcomesQuality score (n/11)Bleckmann, 197111NR3MBHYL, HYP, HYL/HYP5Eastoe, 197311IIPLMB, DHYL, HYP, HYL/HYP5Meigel, 197511NRNRNRFHYP4Trelstad, 197711IIPLFB, FHYL7Takagi, 198023Tarda and cogenita4, 13NRDHYL4.5Kirsch, 198112IIPLMB, FHYL, HYP, HYL/HYP8Herbage, 198241Moderate/severe and severeRange 4 to 16 yearM,FB, FHYL, HYL/HYP6.5Kirsch, 198311IIPLNRBHYL, HYP, HYL/HYP6Cetta, 19831312Mild and severeNRNRFGLY5.5Steinmann, 198413IINRMFHYL, HYP, GLY8Deak, 198511NRNRNRFHYL, HYP, HYL/HYP3Stoss, 198611IIIPLMBHYL, HYP, HYL/HYP8Kirsch, 19872310IINRNRFHYL6.5Tenni, 198812I to IV0 to 20NRFHYL, GLY9Brenner, 1988296II5MBHYL/HYP2Maroteaux, 198833I to IVNRNRFHYL, HYP, GLY3Gage, 19885933NR3 months, 15, 27NRDHYL, HYP5Rao, 198911NRNRNRFHYL,HYP7.5Brenner, 1989307IINRMBHYL/HYP8Brenner, 1993223I to IVNRNRBHYL/HYP4Tajima, 1994.11III & IVNRNRFHYL, HYP, GLY8Lehmann, 19953914IV12MFHYL, HYP7Bank, 20002851I to IVNRNRBHYP6Barnes, 200611IIINRNRFHYL7Makareeva, 201841NRNRNRFHYL4.5
A meta-analysis of the change in HYL in OI collagen-I demonstrated a significant overall increase in hydroxylysine in OI collagen, with a global effect size of 0.33 [95 % CI: 0.21, 0.45] (Fig. 2A). Heterogeneity was minimal, with τau^2^ estimated at 0.02 and I^2^ at 41 %. When considering subgroup analysis, the bone and fibroblast had an effect size of 0.37 [95 % CI: 0.26, 0.49], and the dentine subgroup had an effect of 0.09 [95 % CI:- 0.01, 0.19]; the difference was significant, with a Q_M_ value of 7.88. To evaluate the covariate effect on PTM levels, we analyzed the individual patient data for which the covariate was explicitly reported. Bone-derived collagen-I demonstrated significantly higher HYL levels compared to fibroblast-derived and dentine-derived collagen-I (Fig. 2B). Since only individual patient data were used for this analysis, there were only 3 individual patients represented from the 62 total patients in the dentine group. We also identified three publications in which data for HYL in fibroblast-derived and bone-derived collagen-I were reported for the same individuals. While in OI patients, the HYL levels were consistently higher in bone-derived collagen-I compared to fibroblasts-derived, in control individuals HYL levels were lower in bone-derived samples (Supplementary Fig. S1). Next, the IPD for patients with reported OI type was analyzed for the association of HYL levels with OI severity. The most severe OI type II exhibited the highest HYL levels, while the least severe OI type I showed the lowest HYL levels (Fig. 2C). OI type II collagen-I HYL was significantly different from OI type I and type III. It is important to note that the bone-derived collagen-I was more commonly obtained from OI type II patients; therefore, it is difficult to interpret whether the OI severity or the tissue origin of OI collagen is the main driver of high HYL levels.Fig. 2HYL percentage in collagen of OI patients compared to controls. (A) Forest plot of percentage difference in HYL in OI collagen, positive values indicate higher levels in patients. Squares with lines depict the study-level effect, normalized mean difference with 95 % Cls; the size of the square is proportional to the study weight. The diamond represents the global effect size. Heterogeneity statistics I^2^ and τau^2^ are reported. (B) HYL percentage in individual patient data sampled from fibroblasts, bone and dentine. (C) HYL percentage in individual patient data separated by OI type, ordered from least to most severe (I,IV,III,II). Unknown are samples in which OI type is unclear. B, C: shown are individual data points and their mean ± SEM; *p < 0.05 and ** p < 0.01 indicate significant differences by 1-way ANOVA with Tukey's post hoc test.Fig. 2
A meta-analysis of the change in HYP in OI collagen-I demonstrated similar levels of HYP in OI and control collagen, with a global effect size of 0.00 [95 % CI: −0.02, 0.02] (Fig. 3A). Heterogeneity was minimal, with τau^2^ estimated at 0.02 and I^2^ at 10.5 %. The OI collagen-I HYP levels were similar in bone- and dentine-samples, and higher in fibroblast-derived samples; nevertheless, overall effect sizes remained very small (Fig. 3B). HYP levels were not associated with OI severity (Fig. 3C). The HYL to HYP amino acid ratio was directly reported in a number of studies (Table 3) or it was calculated from the studies reporting both HYL and HYP. The HYL/HYP ratio was significantly higher in OI samples compared to control, with a global effect size of 0.25 [95 % CI: 0.23, 0.28] (Fig. 3D). Heterogeneity was minimal with a τau^2^ of 0.00 and an I^2^ of 0.0. These data are consistent with higher levels of HYL and similar levels of HYP in OI collagen-I compared to control.Fig. 3HYP percentage in collagen of OI patients compared to control. (A) Forest plot of the percentage difference in HYP in OI collagen compared to the control, with positive values indicating higher levels in patients. (B) HYP percentage in individual patient data sampled from fibroblasts, bone and dentine. (C) HYP percentage in individual patient data separated by OI type, ordered from least to most severe (I,IV,III,II). Unknown are samples in which OI type is unclear. (D) Forest plot of percentage difference in the ratio of HLY to HYP in OI compared to control collagen. For A and D, squares with lines depict the study level effect, normalized mean difference with 95 % Cls; the size of the square is proportional to the study weight. The diamond represents the global effect size. Heterogeneity statistics I^2^ and τau^2^ are reported. B, C: shown are individual data points and their mean ± SEM; ***p < 0.0001 indicate significant differences by 1-way ANOVA with Tukey's post hoc test.Fig. 3
A meta-analysis of the GLY ratio in OI collagen-I demonstrated a significantly higher GLY ratio in OI samples compared to controls, with a global effect size of 0.55 [95 % CI: 0.38, 0.71] (Fig. 4A). The heterogeneity was zero with a τau^2^ of 0.00, and I^2^ of 0.0 %. GLY ratio was significantly higher in the most severe OI type II compared to OI type I and III (Fig. 4B).Fig. 4Change in glycosylation ratio in collagen of OI patients compared to controls. (A) Forest plot of the percentage difference in glycosylation ratio (diglycosylated/monoglycosylated) in OI collagen compared to the control, with positive values indicating higher levels in patients. Squares with lines depict the study-level effect, normalized mean difference with 95 % Cls; the size of the square is proportional to the study weight. The diamond represents the global effect size. Heterogeneity statistics I^2^ and τau^2^ are reported. (B) Glycosylation percentage in individual patient data separated by OI type, ordered from least to most severe (I,IV,III,II). Unknown are samples in which OI type is unclear. Shown are individual data points and their mean ± SEM; *p < 0.05 indicates significant differences by 1-way ANOVA with Tukey's post hoc test.Fig. 4
To analyze the publication bias, we generated the funnel plots for HYL, HYP and GLY datasets. The studies tended to cluster around the global effect size; the plots were generally symmetrical, although some asymmetry can be noted for the HYL dataset (Supplementary Fig. S2A—C). We also examined whether there is an association between the reported effect size and the quality score of the study. We observed no evidence of over-reporting in lower quality studies (Supplementary Fig. S2D—F).
Discussion
4
This systematic review and meta-analysis compiled and quantified all available biological data on the post-translational modification of collagen-I in individuals with OI due to mutations in the collagen-I gene. Using the knowledge synthesis methodology allowed us to combine the data for 269 patients, a substantial sample size for a rare disease. We identified significantly higher levels of HYL and GLY in OI patients compared to control individuals; however, the HYP level remained unchanged. We have found that patients with the most severe form of OI (type II) had the highest levels of HYL and GLY, suggesting a potential role of high collagen-I PTM in mediating disease severity. We also identified that collagen-I derived from bone samples was more susceptible to over-modification, compared to fibroblast-derived and dentine-derived collagen, which is consistent with the knowledge that bone physiology is strongly affected in OI, even though collagen-I is expressed in many tissues. Taken together, our study provides new insights into collagen-I physiology and pathophysiology in OI. It generates new hypotheses regarding the role of PTM in mediating disease presentation in different tissues and overall severity.
We demonstrated that OI collagen-I is significantly post-translationally overmodified, specifically on its lysine residues, while proline residues remain unaffected. The PTM proteins are known to only bind to the individual chains and not to the assembled triple helix (Brownell and Veis, 1976). In the individual chain, hydroxylation of prolines at the Y position of the G-X-Y residues motif is specifically mediated by P4H and is a prerequisite for the triple helix formation (Salo et al., 2024). Thus, in control collagen-I, all proline residues in the Y position are likely hydroxylated, and proline hydroxylation in other positions is harder to achieve, suggesting a saturation for proline hydroxylation, which is maintained in OI collagen. In contrast, lysine hydroxylation by LH occurs on any lysine residue, and hydroxylated lysine can be further glycosylated. Over-modification of lysine supports the notion that OI collagen-I spends longer time in unfolded state (Raghunath et al., 1994). In keeping, it was shown that delaying collagen-I folding by increasing temperature leads to increased HYL levels while HYP levels remained the same (Torre-Blanco et al., 1992; Fujii et al., 2022). The estimates for the degree and variability of PTM in control and OI collagen support future molecular modeling studies to uncover how PTM affects the structure and packing of the collagen-I fibril.
We identified significant differences in the degree of PTM of collagen-I derived from bone, dentine and fibroblasts in OI. This suggests that factors beyond the intrinsic delayed folding of collagen-I have an effect on the PTMs levels. Collagen-I is part of distinct structures in different tissues. In bone, collagen-I fibrils form a foundation for a complex hierarchical organization resulting in a composite material, which is constantly remodeled through an intricate crosstalk between osteoblasts and osteoclasts (Garnero, 2015; Kim et al., 2020). In OI, bone is the site of the main clinical presentation. Dentine is avascular and non-remodeling. Dentinogenesis imperfecta occurs in 50 % of OI patients, leading to various dental issues such as discolored enamel, missing teeth, unerupted teeth, and malocclusion (Taqi et al., 2021a; Taqi et al., 2021b). The skin extracellular matrix is a fibrous network with lower matrix turnover (Mayorca-Guiliani et al., 2025). Individuals with OI do not report clinical issues related to skin, despite having mild alterations to skin elasticity (Hansen and Jemec, 2002). It is possible that tissue-specific environments require different levels of collagen-I PTM to allow for building structures that have vastly different functions (Bank et al., 1999; Eyre et al., 2019). If osteoblasts and fibroblasts have different levels of PTM enzymes or substrates, then delayed triple helix formation can lead to higher levels of over-modification in cells with higher levels of LH. Alternatively, the delay in folding of the triple helix may also be cell-type specific. It also should be noted that collagen from fibroblast is extracted after the cell culture step, whereas bone or dentine collagen is extracted directly using chemical treatment. To date, there have been no comparative studies on PTM protein gene expression and protein quantity across different tissues. Our analysis suggests that a high degree of PTM in OI bone collagen-I may explain why bone is so much more affected than other collagen-I containing tissues in OI.
Structural alterations due to high PTM in collagen-I likely contribute to bone fragility in OI. The increase in glycosylation, presented as the addition of two sugars residues along collagen-I would lead to a significant structural alteration compared to its healthy counterpart (Tvaroška, 2024; Schegg et al., 2009). In silico modeling of single tropocollagen demonstrated that GGHYL reduces the interfacial energy between collagen-I and the surrounding medium (Tang et al., 2020). The increased hydroxylation was shown to alter the cross-linking patterns, resulting in stiffer collagen-I fibrils (Lin and Gu, 2015). These molecular defects disrupt higher-order structures, likely leading to kinked, poorly packed tropocollagen and altered fibril diameter (Cassella et al., 1994). The hierarchical organization of bone amplifies structural defects of collagen-I fibrils, negatively affecting the mechanical properties of the entire bone matrix (Alcântara et al., 2022). The overall pathology of OI is complex and includes abnormal collagen-I structure, decreased collagen-I secretion, changes in cross-linking patterns, hyper-mineralized bone matrix, lower bone mineral density and higher osteoclast activity (Nijhuis et al., 2019; Tauer et al., 2019; Imbert et al., 2014). This complexity needs a multiscale approach to disease understanding, from the molecular level to the clinical presentation.
This study has several limitations. First, many of the included publications date back to a time when diagnostic tools for OI were less developed and less prevalent in the medical space. As a result, disease classification may have been inconsistent, potentially affecting the severity analysis. Additionally, non-collagen-I mutation forms of OI were not well-characterized until more recently, which may have led to improper inclusion of some studies. Even when genetic diagnosis was performed, the patient's genotype (or even if it was a base change mutation or a frameshift mutation) was severely under-reported, preventing genotype-PTM association studies. Another major limitation was the frequent lack of demographic information, such as age and sex, preventing exploration of these variables as potential modifiers of PTM levels. The studies that report age were skewed toward pediatric populations, pointing out a flaw in the research field of OI. Finally, the sample size for certain outcomes remains small; most notably, common glycosylation outcome data were available for only 17 patients across five studies. Expanding the number of studies and patients would help solidify trends that are not observable with the current sample size.
Our systematic review and meta-analysis provide the most comprehensive synthesis to date of biological data on collagen-I PTMs in individuals with OI due to collagen-I mutations. By combining data from 269 patients, we achieved a substantial sample size for this rare disease, providing novel insights into the pathophysiology of OI. Our findings provide a foundation for future work investigating the mechanistic link between collagen-I biochemistry and skeletal fragility in OI and have several translational implications. PTM profiles could be developed into biomarkers for early diagnosis, disease stratification, and monitoring of therapeutic response. They may also inform the design of targeted therapies that modulate collagen-I modification pathways, as well as the development of tissue-specific interventions and biomaterials for bone repair and regeneration. By providing a unified framework and benchmark dataset, this study bridges fundamental biology and clinical practice by supporting a multi-scale approach to understand OI pathophysiology, and laying the groundwork for integrating molecular data into personalized care strategies.
Authors contributions
PP developed the search strategy, performed screening and data extraction, participated in development of data analysis protocols and performed the data analysis, wrote the first draft of the manuscript. SA performed screening, participated in development of data analysis protocols and edited the manuscript. SVK conceived the study, obtained funding, developed the data analysis protocols, participated in data analysis and edited the manuscript. All authors read and approved the final manuscript.
CRediT authorship contribution statement
Priyesh Patel: Writing – review & editing, Writing – original draft, Visualization, Validation, Software, Methodology, Formal analysis, Data curation, Conceptualization. Sirion Aksornthong: Writing – review & editing, Validation, Data curation. Svetlana V. Komarova: Writing – review & editing, Supervision, Project administration, Formal analysis, Conceptualization.
Funding
This work was supported by the Natural Sciences and Engineering Research Council (NSERC, RGPIN-288253) and Developmental Grant DGBR87210CAN24 from the Shriners of North America.
Declaration of competing interest
The authors declare that they have no competing interests.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alcântara A.C.S.Felix L.C.Galvão D.S.Sollero P.Skaf M.S.Devising bone molecular models at the nanoscale: from usual mineralized collagen fibrils to the first bone fibers including hydroxyapatite in the extra-fibrillar volume Materials 156202210.3390/ma 15062274 PMC 895516935329726 · doi ↗ · pubmed ↗
- 2Bank R.A.Robins S.P.Wijmenga C.Defective collagen crosslinking in bone, but not in ligament or cartilage, in Bruck syndrome: indications for a bone-specific telopeptide lysyl hydroxylase on chromosome 17Proc. Natl. Acad. Sci. USA 96319991054105810.1073/pnas.96.3.10541999/02/029927692 PMC 15349 · doi ↗ · pubmed ↗
- 3Barsh G.S.Byers P.H.Reduced secretion of structurally abnormal type I procollagen in a form of osteogenesis imperfecta. Research support, non-U.S. gov’t research support, U.S. gov’t, P.H.S Proc. Natl. Acad. Sci. USA 788Aug 19815142514610.1073/pnas.78.8.51426946461 PMC 320349 · doi ↗ · pubmed ↗
- 4Brannick M.T.Potter S.Teng Y.Quantifying uncertainty in the meta-analytic lower bound estimate Psychol. Methods 246Dec 201975477310.1037/met 000021731094545 · doi ↗ · pubmed ↗
- 5Brenner R.E.Vetter U.Worsdorfer O.Nerlich A.Teller W.M.Muller P.K.Biochemical analysis of bone compacta in osteogenesis imperfecta Ann. N. Y. Acad. Sci.543198810610810.1111/j.1749-6632.1988.tb 55322.x 3214048 · doi ↗ · pubmed ↗
- 6Brenner RE, Vetter U, Nerlich A, Wörsdorfer O, Teller WM, Müller PK. Osteogenesis imperfecta: insufficient collagen synthesis in early childhood as evidenced by analysis of compact bone and fibroblast cultures. Eur. J. Clin. Investig. 1989/04/01 1989;19(2):159–166. doi:10.1111/j.1365-2362.1989.tb 00211.x.2499474 · doi ↗ · pubmed ↗
- 7Brenner R.E.Vetter U.Stoss H.Muller P.K.Teller W.M.Defective collagen fibril formation and mineralization in osteogenesis imperfecta with congenital joint contractures (Bruck syndrome). Case reports research support, non-U.S. gov’t Eur. J. Pediatr.1526 Jun 199350550810.1007/BF 019550608335019 · doi ↗ · pubmed ↗
- 8Brownell A.G.Veis A.Intracellular location of triple helix formation of collagen. Enzyme probe studies J. Biol. Chem.25122 Nov 25 197671377143186454 · pubmed ↗
