Curated and harmonised transcriptomics datasets of interstitial lung diseases
Simo Inkala, Antonio Federico, Angela Serra, Dario Greco

TL;DR
This study creates a standardized collection of lung disease gene expression data to improve research and treatment development.
Contribution
The novel contribution is a curated, harmonized transcriptomics dataset for interstitial lung diseases with standardized metadata and gene expression comparisons.
Findings
A compendium of 30 transcriptomics datasets (1371 samples) was curated and harmonized for interstitial lung diseases.
Differentially expressed genes between ILD and healthy samples were identified and provided.
Co-expression networks for IPF and healthy samples were inferred and included in the dataset.
Abstract
This study provides manually curated and homogenised transcriptomics data of interstitial lung disease (ILD) patients retrieved from the NCBI Gene Expression Omnibus and European Nucleotide Archive repositories. The compendium includes 30 transcriptomics datasets generated with DNA microarrays and RNA sequencing (RNA-seq) technologies for a total of 1371 samples. All the datasets underwent metadata curation and harmonisation, data quality check, and preprocessing with standardised procedures. Furthermore, a robust data model was developed to standardise phenotypic data, thereby enhancing comparability across heterogeneous datasets. Gene expression data and lists of differentially expressed genes computed between ILD and healthy samples are provided. Among the ILDs included in this study, idiopathic pulmonary fibrosis (IPF) is the most represented worldwide. Co-expression networks of IPF…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterstitial Lung Diseases and Idiopathic Pulmonary Fibrosis · Single-cell and spatial transcriptomics · Systemic Sclerosis and Related Diseases
