# Broken adaptive ridge method for variable selection in generalized partly linear models with application to the coronary artery disease data

**Authors:** Christian Chan, Xiaotian Dai, Thierry Chekouo, Quan Long, Xuewen Lu

PMC · DOI: 10.1016/j.jcmds.2025.100127 · Journal of Computational Mathematics and Data Science · 2026-01-31

## TL;DR

This paper introduces a new statistical method for analyzing high-dimensional data in generalized partly linear models, with an application to coronary artery disease.

## Contribution

The broken adaptive ridge (BAR) estimator is proposed as a novel method for variable selection and parameter estimation in generalized partly linear models.

## Key findings

- The BAR estimator outperforms other penalty-based variable selection methods in simulations.
- The method was applied to coronary artery disease data, revealing new insights in both genetic and non-genetic covariates.

## Abstract

Motivated by the CATHGEN data, we develop a new statistical method for simultaneous variable selection and parameter estimation in the context of generalized partly linear models for data with high-dimensional covariates. The method is referred to as the broken adaptive ridge (BAR) estimator, which is an approximation of the L0-penalized regression by iteratively performing reweighted squared L2-penalized regression. The generalized partly linear model extends the generalized linear model by incorporating a non-parametric component, allowing for the construction of a flexible model to capture various types of covariate effects. We employ the Bernstein polynomials as the sieve space to approximate the non-parametric functions so that our method can be implemented easily using the existing R packages. Extensive simulation studies suggest that the proposed method performs better than other commonly used penalty-based variable selection methods. We apply the method to the CATHGEN data with a binary response from a coronary artery disease study, which motivated our research, and obtained new findings in both high-dimensional genetic and low-dimensional non-genetic covariates.

## Linked entities

- **Diseases:** coronary artery disease (MONDO:0005010)

## Full-text entities

- **Genes:** F10 (coagulation factor X) [NCBI Gene 2159] {aka FX, FXA}, RBFOX1 (RNA binding fox-1 homolog 1) [NCBI Gene 54715] {aka 2BP1, A2BP1, FOX-1, FOX1, HRNBP1}, APOE (apolipoprotein E) [NCBI Gene 348] {aka AD2, APO-E, ApoE4, LDLCQ5, LPG}, GABRG3 (gamma-aminobutyric acid type A receptor subunit gamma3) [NCBI Gene 2567], ABCA1 (ATP binding cassette subfamily A member 1) [NCBI Gene 19] {aka ABC-1, ABC1, CERP, HDLCQTL13, HDLDT1, HPALP1}, IL1B (interleukin 1 beta) [NCBI Gene 3553] {aka IL-1, IL1-BETA, IL1F2, IL1beta}, NR1H4 (nuclear receptor subfamily 1 group H member 4) [NCBI Gene 9971] {aka BAR, FXR, HRR-1, HRR1, PFIC5, RIP14}, CDH13 (cadherin 13) [NCBI Gene 1012] {aka CDHH, P105}
- **Diseases:** obesity (MESH:D009765), hypertension (MESH:D006973), ischemic heart disease (MESH:D017202), CAD (MESH:D003324), myocardial infarction (MESH:D009203), hypercholesterolemia (MESH:D006937), GPLMs (MESH:D004195), diabetes (MESH:D003920), heart failure (MESH:D006333), cardiovascular disease (MESH:D002318), death (MESH:D003643), coronary heart disease (MESH:D003327)
- **Chemicals:** blood cholesterol (-), dodecanedioic acid (MESH:C036836), lipid (MESH:D008055), blood sugar (MESH:D001786)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12857884/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12857884/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC12857884/full.md

---
Source: https://tomesphere.com/paper/PMC12857884