# Multiomics assessment of lung adenocarcinoma subtypes defined through tumor purity-adjusted DNA methylation

**Authors:** Deborah F. Nacer, Elsa Arbajian, Srinivas Veerla, Mattias Aine, Mats Jönsson, Frida Rosengren, Anna Karlsson, Annette Salomonsson, Sofi Isaksson, Maria Planck, Johan Staaf

PMC · DOI: 10.1186/s13073-026-01609-x · Genome Medicine · 2026-02-14

## TL;DR

This study identifies four distinct lung adenocarcinoma subtypes based on DNA methylation patterns, revealing their unique molecular and clinical features.

## Contribution

A novel multiomics approach using tumor purity-adjusted DNA methylation to define LUAD subtypes with distinct biological and clinical characteristics.

## Key findings

- Four epigenetic subtypes (M1-M4) were identified, each with distinct methylation and expression profiles.
- M3 subtype showed lower DNA methylation in NAPSA/surfactant genes despite global promoter hypermethylation.
- LUAD cell lines recapitulated tumor epitype characteristics, supporting epigenetic states as drivers of tumor phenotypes.

## Abstract

Molecular subtypes of lung adenocarcinoma (LUAD) with varying prognosis and characteristics have been proposed based on one or two-dimensional studies but are not yet implemented into clinical routine. Epigenetic modifications in cancer cells are independent of sequence variants, directly linked to gene and genome regulation, and thus provide important information to guide subclassification efforts.

We performed in-depth epigenomic profiling of 95 primary LUAD samples from a Swedish discovery cohort with comprehensive clinicopathological, epigenomic, genomic, transcriptomic, proteomic, and metabolomic data. Additionally, we estimated pure tumor cell methylomes using a computational approach. We subdivided the discovery cohort into four epigenetic subtypes, the epitypes, reflecting distinct tumor cell methylation states. Resulting epitypes were contrasted based on clinicopathological and molecular features, and our main findings were validated in two additional primary tumor cohorts totaling over 700 samples.

Of the four DNA methylation epitypes, M1-M4, M1 and M4 were associated with the previously proposed mRNA subtypes Terminal Respiratory Unit and Proximal Proliferative, respectively. Epitypes M2 and M3 showed similar mRNA/protein subtype composition but differed with respect to e.g., higher expression of the LUAD histology-associated NAPSA/surfactant metabolism expression metagene in M3. Genes included in this metagene showed lower DNA methylation in M3, counter to a global tendency towards promoter hypermethylation in this epitype. To further delineate tumor intrinsic links between the epigenomic and expression phenotypes, 62 LUAD cell lines classified into the four epitypes were investigated and recapitulated several characteristics from the tumor epitypes, such as methylation and expression pattens of NAPSA/surfactant genes, highlighting epigenetic states as likely drivers or maintainers of broad tumor phenotypes and differentiation states.

Dissecting LUAD based on combined biological characteristics using multiomics data has deepened our understanding of the heterogeneity in this complex disease and the mechanisms underlying phenotype formation and maintenance. There remains a critical need for large, publicly accessible, well-annotated multiomic LUAD cohorts to support rigorous subtype discovery and validation, particularly those linked to targeted therapy trial outcomes.

The online version contains supplementary material available at 10.1186/s13073-026-01609-x.

## Linked entities

- **Genes:** NAPSA (napsin A aspartic peptidase) [NCBI Gene 9476]
- **Diseases:** lung adenocarcinoma (MONDO:0005061)

## Full-text entities

- **Genes:** TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, CALCA (calcitonin related polypeptide alpha) [NCBI Gene 796] {aka CALC1, CGRP, CGRP-I, CGRP-alpha, CGRP1, CT}, HHLA2 (HHLA2 member of B7 family) [NCBI Gene 11148] {aka B7-H5, B7-H7, B7H7, B7y}, EZH2 (enhancer of zeste 2 polycomb repressive complex 2 subunit) [NCBI Gene 2146] {aka ENX-1, ENX1, EZH2b, KMT6, KMT6A, WVS}, LGALS4 (galectin 4) [NCBI Gene 3960] {aka GAL4, L36LBP}, KRT19 (keratin 19) [NCBI Gene 3880] {aka CK19, K19, K1CS}, FGL1 (fibrinogen like 1) [NCBI Gene 2267] {aka HFREP1, HP-041, HPS, LFIRE-1, LFIRE1}, SFTA3 (surfactant associated 3) [NCBI Gene 253970] {aka NANCI, PAHRF, SFTPH, SP-H, SPH}, ASCL1 (achaete-scute family bHLH transcription factor 1) [NCBI Gene 429] {aka ASH1, HASH1, MASH1, bHLHa46}, STK11 (serine/threonine kinase 11) [NCBI Gene 6794] {aka LKB1, PJS, hLKB1}, POLD1 (DNA polymerase delta 1, catalytic subunit) [NCBI Gene 5424] {aka CDC2, CRCS10, IMD120, MDPL, POLD}, CEACAM5 (CEA cell adhesion molecule 5) [NCBI Gene 1048] {aka CD66e, CEA}, TFF3 (trefoil factor 3) [NCBI Gene 7033] {aka ITF, P1B, TFI}, KDM4D (lysine demethylase 4D) [NCBI Gene 55693] {aka JMJD2D}, SMARCA4 (SWI/SNF related BAF chromatin remodeling complex subunit ATPase 4) [NCBI Gene 6597] {aka BAF190, BAF190A, BRG1, CSS4, MRD16, OTSC12}, SFTA2 (surfactant associated 2) [NCBI Gene 389376] {aka GSGL541, SFTPG, SP-G, UNQ541}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}, SUZ12 (SUZ12 polycomb repressive complex 2 subunit) [NCBI Gene 23512] {aka CHET9, IMMAS, JJAZ1}, SLC44A4 (solute carrier family 44 member 4) [NCBI Gene 80736] {aka C6orf29, CTL4, DFNA72, NG22, TPPT, hTPPT1}, NAPSA (napsin A aspartic peptidase) [NCBI Gene 9476] {aka KAP, Kdap, NAP1, NAPA, NR1H2-AS1, SNAPA}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, DYM (dymeclin) [NCBI Gene 54808] {aka DMC, SMC}, CREBBP (CREB binding lysine acetyltransferase) [NCBI Gene 1387] {aka CBP, KAT3A, MKHK1, RSTS, RSTS1}, SMAD2 (SMAD family member 2) [NCBI Gene 4087] {aka CHTD8, JV18, JV18-1, LDS6, MADH2, MADR2}
- **Diseases:** GSOA (MESH:D020920), SBS (MESH:D012640), DFI (MESH:D015673), M1 (MESH:D015470), lung squamous cell carcinoma (MESH:D002294), LUAD (MESH:D000077192), TRU (MESH:D012131), IV disease (MESH:D020432), PP (MESH:D009220), Tumor (MESH:D009369), lung cancer (MESH:D008175), homologous (MESH:D006086), Inflammatory (MESH:D007249), breast cancer (MESH:D001943), M3 (MESH:D015473), M4 tumor (MESH:D015479), DMCs (MESH:D012734), CIMP (MESH:D007516), ES (MESH:D012512), NMF (MESH:C538347)
- **Chemicals:** amino acid (MESH:D000596), fatty acid (MESH:D005227), DFN (-)
- **Species:** Nicotiana tabacum (American tobacco, species) [taxon 4097], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** NCI-H1975 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_1511), VMRC-LCD — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_1787), GSE149521 — Konosirus punctatus (Dotted gizzard shad), Spontaneously immortalized cell line (CVCL_6F81), S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232), NCI-H2228 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_1543), HCC827 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_2063)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12927254/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12927254/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC12927254/full.md

---
Source: https://tomesphere.com/paper/PMC12927254