Dialogue: Validation of eight endotypes of lupus based on whole-blood RNA profiles
Lina-Marcela Diaz-Gallo

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —http://dx.doi.org/10.13039/501100007857Stiftelsen Konung Gustaf V:s 80-årsfond
- —http://dx.doi.org/10.13039/501100007949Reumatikerförbundet
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSystemic Lupus Erythematosus Research · Atherosclerosis and Cardiovascular Diseases · Lipid metabolism and disorders
Effectively addressing the heterogeneity of SLE holds great promise for enhancing diagnosis and therapeutic strategies. This approach involves identifying distinct molecular signatures of SLE that can be useful in developing personalised diagnostic tools that account for individual variation, tailoring treatment plans, designing models for monitoring disease activity and even considering disease prevention. An increasing number of studies have contributed to this effort. Typically, each study uses a specific type of data—such as transcriptomics13 or autoantibody profile—to identify distinct SLE patterns, which are then tested for enrichment in clinical manifestations, disease activity or long-term outcomes. Among these strategies, the recently published validation study by Hubbard et al1 and its methodological predecessor2 focused on leveraging transcriptomic data to define SLE endotypes.
In this follow-up study,1 researchers used their earlier framework to categorise individuals with SLE into eight molecular subtypes.2 Using RNA sequencing data from 101 individuals enrolled in three separate clinical trials,1 the authors successfully applied their previously developed random forest model.2 This is a type of supervised machine learning—a method that teaches computers to recognise patterns from labelled examples and make classifications on unseen data. In this case, the model used the expression of gene sets to assign each patient to one of the eight predefined endotypes. Their results indicate a categorisation of the studied individuals into a transcriptome gradient that spans from endotypes closer to health to those reflecting more disease-related disturbances. Although differences in clinical features among the endotypes were modest, certain characteristics appeared more frequently in specific subgroups, particularly when comparisons were made between endotype pairs. An interesting relationship described by Hubbard et al1 is the enrichment of specific genetic ancestries within various transcriptome endotypes, which is related to a transcriptome - genetic ancestry - autoantibody positivity relationship previously described in the context of SLE by the same research group4 and used to define the gene sets for their SLE classification models. These findings echo other observations, where individuals of African ancestry tend to cluster in autoantibody-defined groups characterised by higher disease activity.5 It will be valuable to determine how much of that relationship is attributable to genetic risk factors or other components, such as environmental conditions rooted in social disparities.
A strength of Hubbard et al’s study1 is their use of the same method to analyse participants’ transcriptome (RNA sequencing), which helps reduce technical differences and potential confounders, often a limitation of this type of approach. Their findings support the value of transcriptome data for biological understanding and clinical applications, aligning with results from similar studies by other groups.3 Notably, these studies13 demonstrate how bioinformatics can reveal essential insights from various clinical trials and offer a method for combining diverse datasets. To maximise impact, it is critical to clearly communicate the studies—especially complex bioinformatics aspects—so rheumatologists and other researchers can grasp and apply the main findings.
There is still room to improve how studies are designed and carried out to understand the complexity of SLE. For instance, it is challenging to establish the influence of demographic, environmental and disease variables in the data used to subgroup individuals with SLE, such as whole blood transcriptomics, as in the referred studies,13 or autoantibody profiles in other approaches,5 6 which can affect the observations and conclusions. It is similar to the egg and chicken dilemma: are the disturbances in the expression of immune-related genes or specific autoantibody profiles a consequence of demographic and clinical variables, or vice versa? These challenges could be overcome by addressing appropriate sampling time points, implementing longitudinal designs, ensuring a broad and deep data collection and applying appropriate analyses. For example, studying individuals at risk of developing SLE and collecting data on immunological markers, omics, environmental factors and clinical variables over several years of follow-up will enable us to dissect the causal processes, which in turn will facilitate the development of disease prevention and prediction strategies.
Several approaches exist for identifying clinically meaningful SLE subgroups, all of which rely on the quality and breadth of input data. Improving data collection and overall study design requires collaboration among multidisciplinary experts, as well as the public and private sectors. Finally, as a research community, we should ensure that studies are comparable and independently validated. There is a tendency to overlook replication studies, which can undermine the value of reproducibility, a fundamental scientific principle.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hubbard E Bachali P Grammer AC et al Validation of eight endotypes of lupus based on whole-blood RNA profiles Lupus Sci Med 202512 e 00152610.1136/lupus-2025-00152640368437 PMC 12083367 · doi ↗ · pubmed ↗
- 2Hubbard EL Bachali P Kingsmore KM et al Analysis of transcriptomic features reveals molecular endotypes of SLE with clinical implications Genome Med 2023158410.1186/s 13073-023-01237-937845772 PMC 10578040 · doi ↗ · pubmed ↗
- 3Toro-Domínguez D Martorell-Marugán J Martinez-Bueno M et al Scoring personalized molecular portraits identify Systemic Lupus Erythematosus subtypes and predict individualized drug responses, symptomatology and disease progression Brief Bioinform 202223 bbac 33210.1093/bib/bbac 33235947992 PMC 9487588 · doi ↗ · pubmed ↗
- 4Catalina MD Bachali P Yeo AE et al Patient ancestry significantly contributes to molecular heterogeneity of systemic lupus erythematosus JCI Insight 20205 e 14038010.1172/jci.insight.14038032759501 PMC 7455079 · doi ↗ · pubmed ↗
- 5Choi MY Chen I Clarke AE et al Machine learning identifies clusters of longitudinal autoantibody profiles predictive of systemic lupus erythematosus disease outcomes Ann Rheum Dis 2023829273610.1136/ard-2022-22380837085289 PMC 11293954 · doi ↗ · pubmed ↗
- 6Diaz-Gallo L-M Oke V Lundström E et al Four Systemic Lupus Erythematosus Subgroups, Defined by Autoantibodies Status, Differ Regarding HLA-DRB 1 Genotype Associations and Immunological and Clinical Manifestations ACR Open Rheumatol 20224273910.1002/acr 2.1134334658170 PMC 8754019 · doi ↗ · pubmed ↗
