# Immunogenetic Architecture of Chronic Lymphocytic Leukemia at Early Stage: Insights from the O-CLL1 Cohort

**Authors:** Davide Bagnara, Andrea Nicola Mazzarello, Monica Colombo, Ennio Nano, Niccolò Cardente, Fabiana Ferrero, Nadia Bertola, Vanessa Cossu, Fabio Ghiotto, Adalberto Ibatici, Emanuele Angelucci, Antonino Neri, Massimo Gentile, Fortunato Morabito, Manlio Ferrarini, Giovanna Cutrona, Franco Fais

PMC · DOI: 10.3390/antib15020025 · 2026-03-18

## TL;DR

This study examines the immunoglobulin gene patterns in early-stage chronic lymphocytic leukemia, revealing insights into how the disease evolves.

## Contribution

The study identifies a biased immunoglobulin repertoire in early CLL, highlighting antigen-driven selection as a key factor.

## Key findings

- The IGHV repertoire in early CLL closely mirrors that of a real-world cohort, suggesting it is defined at diagnosis.
- Subset #4 is over-represented in mutated IGHV cases, indicating selective expansion rather than recombinational bias.
- Subset #4 cases retain canonical HCDR3 motifs and show similar treatment timelines to other mutated CLL cases.

## Abstract

Background/Objectives: The immunoglobulin heavy-chain variable (IGHV) gene repertoire represents a characteristic feature of chronic lymphocytic leukemia (CLL), although its configuration is not well defined at the early disease stages. The IGHV repertoire of a cohort of early CLL patients was analyzed and compared to that of a “real-world” reference cohort. Methods: Patients from the O-CLL1 observational protocol, which enrolled only Binet stage A cases within twelve months from diagnosis, were studied. IGHV/IGHJ rearrangements were sequenced and annotated following ERIC recommendations, and stereotyped subsets were assigned using ARResT/AssignSubsets. The repertoire features were compared with the dataset of a real-world cohort of patients with heterogeneous staging (CTR cohort) and with published early-diagnosis series. Results: IGHV and IGHJ gene distributions and HCDR3-length profiles in O-CLL1 closely mirrored those of CTR, indicating that the BcR IG repertoire at diagnosis is already defined rather than being selected during disease progression. Mutated IGHV (M-CLL) predominated, with a frequency of stereotyped BcR IG comparable to that of other early-diagnosis cohorts. However, within this conserved framework, subset #4 was over-represented among M-CLL from O-CLL without an increased overall IGHV4-34 gene usage, suggestive of a selective expansion rather than a recombinational bias. Subset #4 cases retained canonical HCDR3 motifs and showed time-to-first-treatment like other M-CLL, likely reflecting the younger age structure of O-CLL1. Conclusions: Early-diagnosis CLL displays a biased IGHV repertoire with stereotyped configurations characteristic of CLL, including subsets that are rare in the normal B-cell repertoire. These findings support a central role for antigen-driven selection in shaping CLL evolution.

## Linked entities

- **Genes:** IGH (immunoglobulin heavy locus) [NCBI Gene 3492], IGH (immunoglobulin heavy locus) [NCBI Gene 3492], IGHV4-34 (immunoglobulin heavy variable 4-34) [NCBI Gene 28395]
- **Diseases:** chronic lymphocytic leukemia (MONDO:0004948), CLL (MONDO:0004948)

## Full-text entities

- **Genes:** NOTCH1 (notch receptor 1) [NCBI Gene 4851] {aka AOS5, AOVD1, TAN1, hN1}, BCR (BCR activator of RhoGEF and GTPase) [NCBI Gene 613] {aka ALL, BCR1, CML, D22S11, D22S662, PHL}, IGKV@ (immunoglobulin kappa variable cluster) [NCBI Gene 3519] {aka IGKV, IGKV1, IGKV1@, IGKV2, IGKV2@, IGKV3}, CD38 (CD38 molecule) [NCBI Gene 952] {aka ADPRC 1, ADPRC1, cADPR1}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, IGH (immunoglobulin heavy locus) [NCBI Gene 3492] {aka IGD1, IGH.1@, IGH@, IGHD@, IGHDY1, IGHJ}, IGLV@ (immunoglobulin lambda variable cluster) [NCBI Gene 3546] {aka IGLV}, HLA-G (major histocompatibility complex, class I, G) [NCBI Gene 3135] {aka MHC-G}, FCER2 (Fc epsilon receptor II) [NCBI Gene 2208] {aka BLAST-2, CD23, CD23A, CLEC4J, FCE2, FCErII}, IGHV4-34 (immunoglobulin heavy variable 4-34) [NCBI Gene 28395] {aka IGHV434, VH}, LOC102723407 (immunoglobulin heavy variable 4-38-2-like) [NCBI Gene 102723407] {aka IGHV4, IGHV4-30, IGHV4-38-2, IGHV4-39, IGHV4-b, IGVH4-39}, SF3B1 (splicing factor 3b subunit 1) [NCBI Gene 23451] {aka Hsh155, MDS, PRP10, PRPF10, SAP155, SF3b155}, CLEC12A (C-type lectin domain family 12 member A) [NCBI Gene 160364] {aka CD371, CLL-1, CLL1, DCAL-2, MICL, hKLRL1}, CD5 (CD5 molecule) [NCBI Gene 921] {aka LEU1, T1}, ZAP70 (zeta chain of T cell receptor associated protein kinase 70) [NCBI Gene 7535] {aka ADMIO2, IMD48, SRK, STCD, STD, TZK}, IGHD (immunoglobulin heavy constant delta) [NCBI Gene 3495], MBL3P (mannose-binding lectin family member 3, pseudogene) [NCBI Gene 50639] {aka COLEC2, MBL}, CD22 (CD22 molecule) [NCBI Gene 933] {aka SIGLEC-2, SIGLEC2}
- **Diseases:** lymphoid malignancy (MESH:D008223), chromosomal abnormalities (MESH:D002869), injury to (MESH:D014947), -CLL (MESH:D015451), Cancer (MESH:D009369), leukemic (MESH:D007938), B-cell lymphocytosis (MESH:D015448)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13010646/full.md

---
Source: https://tomesphere.com/paper/PMC13010646