The Construction and Performance Evaluation of a Risk Prediction Model for Nonalcoholic Steatohepatitis Based on Serological Markers
Dongbo Huang, Wanqin Zhang, Ying Fang, Taotao Liu, Da Zhou

TL;DR
This paper develops a noninvasive model to predict nonalcoholic steatohepatitis (NASH) using blood markers, reducing the need for liver biopsies.
Contribution
A new risk prediction model for NASH using serological markers is developed and validated.
Findings
AST and ceruloplasmin are independent risk factors for NASH.
The CHART2 model shows good accuracy and clinical utility for NASH prediction.
Serological markers like HDL, TSH, and TBA are included in the predictive model.
Abstract
To develop a noninvasive clinical diagnostic model based on serological markers for nonalcoholic steatohepatitis (NASH) and to verify its predictive efficacy. A total of 82 biopsy-proven patients with nonalcoholic fatty liver disease (NAFLD) were included in the study. Patients were classified into nonalcoholic fatty liver (NAFL) and NASH groups based on the results of liver biopsies. The study utilized the LASSO regression model for variable selection, followed by logistic regression analysis to create a prediction model. A nomogram was then developed to illustrate this model. To validate the model, bootstrapping was applied for internal validation, and the accuracy, consistency, and clinical utility of the prediction model were evaluated. The NASH group had significantly higher levels of red blood cell count, lactate dehydrogenase (LDH), aspartate aminotransferase (AST), and alanine…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLiver Disease Diagnosis and Treatment · Diet, Metabolism, and Disease · Hepatitis C virus research
1. Introduction
Nonalcoholic steatohepatitis (NASH) is a significant inflammatory subtype of nonalcoholic fatty liver disease (NAFLD). Compared to patients with nonalcoholic fatty liver (NAFL), those with NASH exhibit faster disease progression and worse prognoses. Over 20% of NASH patients eventually develop cirrhosis. Studies show that NASH patients are more likely to experience metabolic abnormalities, including obesity, Type 2 diabetes, hyperlipidemia, and hypertension, when compared to the general NAFLD population. Moreover, NASH patients have a higher incidence of advanced liver fibrosis and are at a significantly increased risk of developing hepatocellular carcinoma (HCC) compared to the broader NAFLD population [1]. Research suggests that NASH has become the leading indication for liver transplantation in American women and is projected to surpass alcoholic liver disease as the primary cause for liver transplantation in patients with end-stage liver disease [2]. Additionally, NASH patients are more prone to extrahepatic complications such as atherosclerotic cardiovascular disease, cerebrovascular disease, renal disease, and certain malignancies. Due to both intrahepatic and extrahepatic complications, the survival rate of NASH patients is significantly lower than that of the general population. Therefore, early diagnosis, effective prevention, and treatment are essential for improving prognosis and quality of life in NASH patients.
Currently, liver biopsy is the gold standard for diagnosing NAFLD and NASH in clinical settings. However, as an invasive procedure, it is limited by sampling errors and complications, including bleeding, infection, bile leakage, and even death. As a result, there is an urgent need for noninvasive diagnostic methods for NASH. Although alanine aminotransferase (ALT) is frequently used as a serological marker, it has a poor correlation with NASH and cannot be relied upon for diagnosis [2]. Cytokeratin-18 (CK-18) is recognized as one of the most validated blood biomarkers for NASH, yet its clinical accuracy remains insufficient [3]. While some diagnostic models for NASH have been proposed, they have not gained widespread use due to factors such as limited population representativeness and complex calculations.
Thus, the development of universally accepted, reliable noninvasive diagnostic methods for NASH remains a critical unmet need. This study is aimed at establishing a noninvasive clinical diagnostic model for NASH based on serological markers and presenting it using a nomogram.
2. Methods
2.1. Study Population
This retrospective study included 127 patients who underwent liver biopsy between September 2021 and November 2023. The inclusion criteria were as follows: (1) age between 18 and 75 years, (2) evidence of hepatic steatosis or fatty liver as indicated by ultrasound, and (3) availability of complete data for statistical analysis. The exclusion criteria were as follows: (1) excessive alcohol consumption (> 30 g/day for men or > 20 g/day for women), (2) presence of other liver diseases, such as Wilson's disease, hepatitis B or C infection, or drug-induced liver injury, and (3) liver biopsy showing steatosis in less than 5% of hepatocytes. The study adhered to the guidelines of the Declaration of Helsinki and was approved by the Medical Ethics Committee of Zhongshan Hospital of Fudan University (Approval No. B2020-085R). Informed consent was obtained from all patients prior to liver biopsy.
2.2. Data Collection and Laboratory Assessment
Demographic characteristics and medical histories were retrieved from the hospital's electronic medical record system. Venous blood samples were collected on the same day as the liver biopsy, following an overnight fast of at least 8 h. Laboratory assessments for all patients included complete blood count, thyroid function markers, liver and kidney function markers, glucose metabolism markers, lipid metabolism markers, iron and copper metabolism markers, alpha-fetoprotein, C-reactive protein, and other relevant parameters.
2.3. Vibration-Controlled Transient Elastography (VCTE) Examination
All enrolled patients underwent a VCTE examination using the FibroScan 502 device (Echosens). Liver stiffness measurements (LSMs) were recorded following the examination.
2.4. Histopathology Evaluation
Liver samples were analyzed independently and blinded by two experienced pathologists using the Steatosis-Activity-Fibrosis (SAF) scoring algorithm. The diagnosis of NAFLD was based on the following criteria: (1) the presence of steatosis in at least 5% of hepatocytes on liver biopsy and (2) exclusion of other liver diseases. NASH was defined by a histological score of at least 1 point in each of the following features: hepatic steatosis, hepatocellular ballooning, and lobular inflammation [2].
2.5. Statistical Analysis
Statistical analysis was performed using the SPSS 27.0 software (SPSS Inc., Chicago, Illinois) and R statistical software (Foundation for Statistical Computing, Vienna, Austria). Categorical variables were presented as frequencies and percentages and analyzed using the chi-square test or Fisher's exact test. The normality of continuous variables was assessed using the Shapiro–Wilk test. Data following normal distribution were expressed as , while nonnormally distributed data were presented as median and interquartile range. Intergroup comparisons were conducted using the t-test or Mann–Whitney U test when appropriate. All statistical analyses were two-tailed, with a 95% confidence interval (CI), and statistical significance was defined as p < 0.05. LASSO regression was applied for variable selection. Multivariable logistic regression was used to analyze the risk factors associated with NASH and to develop a prediction model for distinguishing NASH from NAFLD. The nomogram was constructed to estimate the probability of NASH. The diagnostic performance of the model was evaluated using receiver operating characteristic (ROC) curve analysis.
3. Results
3.1. General Characteristics
A total of 82 adult patients with biopsy-proven NAFLD were enrolled in this study. Of these, 60 cases were classified into the NASH group and 22 into the NAFL group, based on liver biopsy results. The patient enrollment flowchart is shown in Figure 1, and the general characteristics of the study population are summarized in Table 1. Compared to patients in the NAFL group, those in the NASH group had significantly higher levels of red blood cell counts, ALT, AST, and LDH, while HDL levels were significantly lower (p < 0.05). No significant differences were observed in other parameters, including LSMs (p > 0.05 for all comparisons).
3.2. Construction of the CHART2 Model for Discriminating NASH
LASSO regression analysis identified eight potential predictive factors: red blood cell count, total bile acid (TBA), ceruloplasmin, thyroid-stimulating hormone (TSH), ALT, AST, HDL, and uric acid (UA). These eight variables were then included in a binary logistic regression. The analysis revealed that six variables—red blood cell count, AST, TSH, HDL, TBA, and ceruloplasmin—were incorporated into the regression model. Among them, AST (OR: 1.027, 95% CI: 1.004–1.055, p = 0.035) and ceruloplasmin (OR: 1.022, 95% CI: 1.002–1.044, p = 0.038) were independent risk predictors for NASH (Table 2). The regression formula for predicting a patient's risk of developing NASH was as follows: P = e^logit (NASH)^/1 + e^logit (NASH)^; logit(NASH) = −9.941 + 1.312 × red blood cell count (10^12^/L) + 0.026 × AST (U/L) − 2.239 × HDL (mmol/L) + 0.499 × TSH (μIU/mL) + 0.125 × TBA (μmol/L) + 0.021 × ceruloplasmin (mg/L). Subsequently, these six variables were further analyzed using R statistical software to construct a personalized nomogram for NASH risk (see Figure 2).
3.3. Performance of the CHART2 Model in Discriminating NASH From NAFLD Patients
3.3.1. Discriminative Ability of the Model
To assess the discriminative ability of the CHART2 model, a ROC curve analysis was conducted. The area under the ROC curve (AUROC) for the model was 0.813 (95% CI: 0.706–0.920), demonstrating a high level of accuracy in distinguishing NASH from NAFLD patients (see Figure 3).
3.3.2. Calibration of the Model
The calibration of the CHART2 model was assessed using the Hosmer–Lemeshow test, which yielded a p value of 0.627, indicating a strong agreement between the model's predictions and actual observations. Additionally, bootstrapping was performed for internal validation, and a calibration curve was plotted. The calibration curve closely aligned with the reference line, further confirming the model's accuracy and consistency (see Figure 4).
3.3.3. Clinical Utility of the Model
The clinical utility of the CHART2 model was evaluated through a decision curve analysis (DCA). The results showed that when the risk threshold probability exceeds 25%, the net benefit of using the CHART2 model surpasses that of the two extreme curves. This indicates that within this risk threshold range, the model provides significant clinical benefit in identifying NASH patients. Furthermore, the wide range of applicable risk thresholds underscores the model's robust clinical utility (see Figure 5).
4. Discussion
In this study, we conducted a retrospective analysis of NAFLD cases diagnosed via liver biopsy to identify potential predictive factors for NASH and developed the CHART2 model for distinguishing NASH from NAFLD. We utilized LASSO regression to select optimal predictive variables and subsequently incorporated six variables—red blood cell count, AST, TSH, HDL, TBA, and ceruloplasmin—into the predictive model, which was presented as a nomogram. The model demonstrated an AUROC of 0.813, and both the calibration curve and the Hosmer–Lemeshow test indicated strong predictive consistency. DCA further confirmed the model's significant clinical utility. The CHART2 model's advantage lies in its use of readily available clinical biochemical markers and straightforward calculation methods.
NASH has a significantly lower survival rate compared to the general population, and its rising prevalence poses a substantial burden on public health and socioeconomic systems. Early and noninvasive diagnosis of NASH is crucial for improving patient prognosis and quality of life. Although liver biopsy remains the gold standard for diagnosing NASH, its invasive nature limits widespread clinical application. Consequently, there is a pressing need for noninvasive diagnostic methods. Several noninvasive models, including those based on serological biomarkers, imaging indicators, or algorithms, have been developed. However, many of these models face challenges such as limited applicability or lack of extensive validation. For instance, the Gholam model, which includes serum AST levels and the presence of Type 2 diabetes, has an AUROC of 0.82 but is limited by its study population, which was severely obese (BMI: 55 ± 12 kg/m^2^) and predominantly female [4]. Similarly, the model by Palekar et al., incorporating variables like age, gender, AST, BMI, AST/ALT ratio, and serum hyaluronic acid, has an AUROC of 0.76, with lower sensitivity and specificity [5]. The NashTest scoring model, developed by Poynard et al., with an AUROC of 0.79, has high specificity but low sensitivity and is complex due to its 13 variables and inclusion of less commonly tested markers [6]. In contrast, the CHART2 model, with an AUROC of 0.813 and sensitivity and specificity of 80% and 77.3%, respectively, uses more accessible biomarkers and simpler calculations, making it more suitable for routine clinical use. Wu et al.'s acNASH index model, with an AUROC of 0.805–0.818, identifies NASH through serum creatinine and AST levels but has not performed well in other studies [7]. Thus, our predictive model offers a novel, noninvasive approach for identifying NASH patients, potentially reducing reliance on liver biopsies.
Our study identified elevated AST as an independent risk factor for NASH, consistent with findings from Younossi et al. and Feldstein et al. [8, 9]. AST, a mitochondrial enzyme released during hepatocyte damage, may reflect mitochondrial injury in NASH patients. Elevated serum AST levels are commonly included in NASH predictive models, emphasizing its predictive significance. However, the optimal timing for using serum AST levels in clinical practice must be established, as it can be influenced by hepatoprotective agents.
Additionally, ceruloplasmin emerged as an independent predictor for NASH. As a copper-containing protein secreted by the liver, ceruloplasmin plays a key role in iron homeostasis [10]. Studies have found that serum ceruloplasmin levels are negatively correlated with liver fat content [11]. There is also evidence that ceruloplasmin with enzymatic activity can increase iron efflux from the liver [12], leading some researchers to suggest that reduced ceruloplasmin may result in iron deposition in the liver, which is closely associated with the development of NAFLD/NASH [13]. Iron accumulated in hepatocytes can increase the production of reactive oxygen species through the Fenton reaction, thereby inducing inflammation, fibrosis, and apoptosis. Some studies have found that lower serum ceruloplasmin levels are significantly associated with NASH and can be used to predict the onset of NASH [14]. Conversely, other studies have reported increased levels of serum ceruloplasmin in patients with diabetes, metabolic syndrome, and NAFLD [15–17]. Given these ongoing controversies, we systematically evaluated the impact of ceruloplasmin exclusion in our modeling approach. Our analyses revealed that the exclusion of ceruloplasmin resulted in a complete failure to derive a valid logistic regression equation. Consequently, the predictive model could not be properly constructed. Therefore, we ultimately included this variable in our predictive model. The regression equation generated in this study when constructing the predictive model appears to support a higher ceruloplasmin level (OR 1.022, 95% CI: 1.002–1.044) being associated with NASH. A possible reason is that ceruloplasmin, as an acute-phase reactant protein, may have its serum concentration increased during inflammation, infection, and trauma due to cytokine-mediated hepatocyte gene transcription [18]. To date, the role of ceruloplasmin in the pathogenesis of NASH remains incompletely elucidated, and further research is needed.
This study has limitations, including a single-center design and a relatively small sample size, which restrict generalizability and necessitate confirmation through larger, external validation cohorts. Additionally, the retrospective nature of the study and the selection of clinical tests may have introduced data bias and omitted valuable predictive variables.
In conclusion, the nomogram based on serological markers could be used as a valuable tool for noninvasive risk prediction of NASH, facilitating the differentiation of NASH from NAFLD in clinical practice.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Younossi Z. M. Koenig A. B. Abdelatif D. Fazel Y. Henry L. Wymer M. Global Epidemiology of Nonalcoholic Fatty Liver Disease-Meta-Analytic Assessment of Prevalence, Incidence, and Outcomes Hepatology 2016641738410.1002/hep.284312-s 2.0-8497647913026707365 · doi ↗ · pubmed ↗
- 2Sheka A. C. Adeyi O. Thompson J. Hameed B. Crawford P. A. Ikramuddin S. Nonalcoholic Steatohepatitis: A Review JAMA 2020323 p. 1210.1001/jama.2020.229832207804 · doi ↗ · pubmed ↗
- 3Kwok R. Tse Y. K. Wong G. L. H. Systematic Review With Meta-Analysis: Non-Invasive Assessment of Non-Alcoholic Fatty Liver Disease-the Role of Transient Elastography and Plasma Cytokeratin-18 Fragments Alimentary Pharmacology & Therapeutics 201439325426910.1111/apt.125692-s 2.0-8489188985524308774 · doi ↗ · pubmed ↗
- 4Gholam P. M. Flancbaum L. Machan J. T. Charney D. A. Kotler D. P. Nonalcoholic Fatty Liver Disease in Severely Obese Subjects American Journal of Gastroenterology 2007102239940810.1111/j.1572-0241.2006.01041.x 2-s 2.0-3384647361217311652 · doi ↗ · pubmed ↗
- 5Palekar N. A. Naus R. Larson S. P. Ward J. Harrison S. A. Clinical Model for Distinguishing Nonalcoholic Steatohepatitis From Simple Steatosis in Patients With Nonalcoholic Fatty Liver Disease Liver International 200626215115610.1111/j.1478-3231.2005.01209.x 2-s 2.0-3364601390416448452 · doi ↗ · pubmed ↗
- 6Poynard T. Ratziu V. Charlotte F. Diagnostic Value of Biochemical Markers (Nash Test) for the Prediction of non alcoholo steato hepatitis in Patients With Non-Alcoholic Fatty Liver Disease BMC Gastroenterology 20066110.1186/1471-230X-6-342-s 2.0-3384520709617096854 PMC 1657015 · doi ↗ · pubmed ↗
- 7Wang Q. Zhou D. Wang M. A Novel Non-Invasive Approach Based on Serum Ceruloplasmin for Identifying Non-Alcoholic Steatohepatitis Patients in the Non-Diabetic Population Frontiers in Medicine 2022990079410.3389/fmed.2022.900794 PMC 925251835795637 · doi ↗ · pubmed ↗
- 8Younossi Z. M. Jarrar M. Nugent C. A Novel Diagnostic Biomarker Panel for Obesity-Related Nonalcoholic Steatohepatitis (NASH) Obesity Surgery 200818111430143710.1007/s 11695-008-9506-y 2-s 2.0-5384911464318500507 · doi ↗ · pubmed ↗
