# Identification of SORD as A Key Gene Mediating Osimertinib Primary Resistance in EGFR-Mutant Lung Adenocarcinoma via Machine Learning and Methylomics

**Authors:** Guowei LIANG, Hongfeng WU, Chaoyi JIA, Penghu GAO, Zhanrui ZHANG, Chen CHEN, Yongwen LI, Hongyu LIU, Jun CHEN

PMC · DOI: 10.3779/j.issn.1009-3419.2025.102.46 · Chinese Journal of Lung Cancer · 2025-12-20

## TL;DR

This study identifies SORD as a key gene linked to primary resistance to the drug osimertinib in lung adenocarcinoma patients with EGFR mutations, using machine learning and methylation data.

## Contribution

The study introduces SORD as a novel biomarker for predicting osimertinib resistance and proposes targeting the SORD pathway as a new strategy to overcome resistance.

## Key findings

- SORD methylation levels are negatively correlated with its mRNA expression in EGFR-mutant lung adenocarcinoma.
- Modulating SORD expression reverses osimertinib resistance in resistant and sensitive cell lines.
- Targeting SORD methylation could serve as a potential therapeutic strategy to overcome drug resistance.

## Abstract

肺腺癌（lung adenocarcinoma, LUAD）是非小细胞肺癌中最常见的亚型，而表皮生长因子受体（epidermal growth factor receptor, EGFR）突变是其主要的分子驱动事件。尽管第三代酪氨酸激酶抑制剂（trrosine kinase inhibitors, TKIs）药物奥希替尼已成为该类患者的标准一线疗法，但耐药性的产生严重限制了患者的长期生存获益。证据表明，表观遗传重塑是导致耐药的重要非遗传机制。其中，DNA启动子区的高甲基化可通过沉默关键抑癌基因或代谢调节基因，协助肿瘤细胞逃避药物杀伤。本研究旨在通过机器学习和高通量筛选，挖掘调控奥希替尼敏感性的关键基因，并解析甲基化修饰在其调控奥希替尼原发性耐药中的作用。

基于癌症基因组图谱（The Cancer Genome Atlas, TCGA）-LUAD EGFR突变队列，应用oncoPredict模型预测奥希替尼的半数抑制浓度（half inhibitory concentration, IC50）。通过多组学关联分析，分别筛选与IC50显著相关的转录组与甲基化组差异基因。根据“高甲基化-低表达”负相关模式对两组学数据取交集，并应用LASSO回归与Bootstrap验证筛选出最稳健的核心基因。最后，在EGFR突变LUAD细胞系中，通过药物敏感性测定、去甲基化处理及基因过表达/敲低实验，验证核心基因对奥希替尼耐药、细胞增殖及凋亡的影响。

多组学分析与机器学习算法鉴定出SORD为核心候选基因，其关键位点（cg06424894）甲基化水平与mRNA表达呈显著负相关。体外实验显示，奥希替尼耐药株H1650呈SORD高甲基化与低表达特征，去甲基化药物处理可显著恢复其表达；敏感株则呈相反趋势。功能实验证实，在H1650耐药细胞中过表达SORD可逆转耐药，抑制细胞增殖并促进凋亡。相反，在H1975和PC9敏感细胞中敲低SORD，则显著诱导耐药，促进增殖并抑制凋亡。

SORD的甲基化状态或表达水平可望成为预测奥希替尼疗效的潜在生物标志物，靶向SORD相关通路可能为克服奥希替尼耐药提供新策略。

## Linked entities

- **Genes:** SORD (sorbitol dehydrogenase) [NCBI Gene 6652]
- **Chemicals:** osimertinib (PubChem CID 71496458)
- **Diseases:** lung adenocarcinoma (MONDO:0005061)

## Full-text entities

- **Genes:** METTL3 (methyltransferase 3, N6-adenosine-methyltransferase complex catalytic subunit) [NCBI Gene 56339] {aka IME4, M6A, MT-A70, Spo8, hMETTL3}, ZNF280B (zinc finger protein 280B) [NCBI Gene 140883] {aka 5'OY11.1, D87009.C22.3, SUHW2, ZNF279, ZNF632}, LANCL2 (LanC like glutathione S-transferase 2) [NCBI Gene 55915] {aka GPR69B, TASP}, POM121 (POM121 transmembrane nucleoporin) [NCBI Gene 9883] {aka P145, POM121A}, FIGN (fidgetin, microtubule severing factor) [NCBI Gene 55137], RAC1 (Rac family small GTPase 1) [NCBI Gene 5879] {aka MIG5, MRD48, Rac-1, TC-25, p21-Rac1}, RNF220 (ring finger protein 220) [NCBI Gene 55182] {aka C1orf164, HLD23}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, SORD (sorbitol dehydrogenase) [NCBI Gene 6652] {aka HEL-S-95n, HMNR8, RDH, SDH, SORD1, SORDD}, TIAM1 (TIAM Rac1 associated GEF 1) [NCBI Gene 7074] {aka NEDLDS, TIAM-1}, ZNF418 (zinc finger protein 418) [NCBI Gene 147686] {aka ZFP418}, LYPD6 (LY6/PLAUR domain containing 6) [NCBI Gene 130574], GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, AKR1B1 (aldo-keto reductase family 1 member B) [NCBI Gene 231] {aka ADR, ALDR1, ALR2, AR}, TXK (TXK tyrosine kinase) [NCBI Gene 7294] {aka BTKL, PSCTK5, PTK4, RLK, TKL}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}
- **Diseases:** cytotoxicity (MESH:D064420), Cancer (MESH:D009369), NSCLC (MESH:D002289), LUAD (MESH:D000077192)
- **Chemicals:** Lipofectamine 2000 (MESH:C086724), Osimertinib (MESH:C000596361), Annexin V PI (-), Decitabine (MESH:D000077209), PBS (MESH:D007854), CO2 (MESH:D002245), CCK-8 (MESH:D012844)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** H1975 P, I759del, K754I 759del      L858R E746_A750del, D0068S, A750del, H773dup, G719A/C, H1975, T751del, T790M, L833F, S768I, C797S, L861Q, V769dup, L858R
- **Cell lines:** PC9 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_B260), PM150210 — Homo sapiens (Human), Hybridoma (CVCL_XI22), H1975 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_1511), H1650 — Homo sapiens (Human), Minimally invasive lung adenocarcinoma, Cancer cell line (CVCL_1483), HCC827 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_2063)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12972575/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12972575/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/PMC12972575/full.md

---
Source: https://tomesphere.com/paper/PMC12972575