The Usefulness of 2-[18F]FDG PET or PET/CT in Extranodal Natural Killer/T-Cell Lymphoma: A Systematic Review and Meta-Analysis
Domenico Albano, Carlo Rodella, Alessandra Tucci, Giorgio Treglia, Francesco Bertagna, Arturo Chiti, Federico Fallanca

TL;DR
This study reviews how 2-[18F]FDG PET/CT can help in diagnosing and managing a rare and aggressive lymphoma called ENKTCL.
Contribution
The paper provides a systematic review and meta-analysis on the clinical utility of 2-[18F]FDG PET/CT in ENKTCL.
Findings
2-[18F]FDG PET/CT improves staging accuracy compared to conventional imaging.
It has high negative predictive value for bone marrow involvement in early-stage disease.
PET parameters may help predict patient prognosis.
Abstract
Background/Objectives: Extranodal NK-/T-cell lymphoma (ENKTCL) is a rare and highly aggressive lymphoma with a bad prognosis. The aim of our analysis is to evaluate existing research on the potential usefulness of fluorine-18-fluorodeoxyglucose positron emission tomography or positron/computed tomography (2-[18F]FDG PET or PET/CT) in the management of patients with ENKTCL. Methods: A complete search of the literature was conducted across Scopus, PubMed/MEDLINE, and Embase databases, focusing on articles published up to March 2025. Results: A total of 21 studies that investigated the role of 2-[18F]FDG PET or PET/CT in ENKTCL were included in our analysis. The main findings from the literature analysis were (1) 2-[18F]FDG PET/CT seems to be helpful in staging settings, showing a better diagnostic performance than conventional imaging and a positive impact on clinical stage; (2)…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLymphoma Diagnosis and Treatment · Sarcoma Diagnosis and Treatment · CNS Lymphoma Diagnosis and Treatment
1. Introduction
Extranodal NK-/T-cell lymphoma (ENKTCL) is a rare and highly aggressive lymphoma that shows worthy racial and geographic differences worldwide. These lymphomas are mainly extranodal and are characterized by significant vascular damage and tissue destruction, with “nasal-type” as the most frequent localization [1]. In the pathogenesis of this cancer, Epstein–Barr virus (EBV) is thought to play a crucial role. Immunohistochemistry tests, including EBV-encoded small RNA in situ hybridization, are fundamental in the diagnostic process [2,3]. Progress has been made in understanding the disease; in the mid-1990s, it was demonstrated that ENKTCL production of P-glycoprotein was due to the expression of the MDR1/ABCB1 gene. Additionally, the apoptotic effect of asparaginase on NKTCL cell lines was shown in vitro. Despite the introduction of non-anthracycline-based chemotherapy regimens and the use of L-asparaginase, which have improved survival rates, the prognosis remains poor in cases of refractory or relapsed disease. For accurate staging of this disease, fluorine-18 fluorodeoxyglucose positron emission tomography (2-[18F]FDG PET) is a non-invasive essential tool due to the increased glucose metabolism and consequent increased radiopharmaceutical uptake of this aggressive lymphoma variant. For localized disease, a combination of chemotherapy and radiotherapy using non-anthracycline-based regimens is suggested, while for advanced stages, regimens containing L-asparaginase have shown improved survival outcomes, although relapsed and refractory cases have very poor survival [4]. Despite advancements in understanding the disease’s pathways, upfront treatment primarily involves chemotherapy and radiotherapy, and treatment-related mortality remains significant [5]. For this reason, an accurate and early diagnosis and staging might significantly impact patient outcomes. Moreover, a better approach to identifying high-risk patients seems to be essential. Due to the rarity of the disease, international collaborations and clinical trials are shareable to shape the future of ENKTCL diagnosis and treatment.
2. Materials and Methods
2.1. Protocol
The current systematic review was carried out following a preset protocol, and the “Preferred Reporting Items for a Systematic Review and Meta-Analysis” (PRISMA 2020 statement) served as a guideline for its development and reporting [6]. The complete PRISMA checklist is in the Supplementary Materials (Table S1). Pre-registering was not carried out. As a first step, a direct review query using the Population, Intervention, Comparator, and Outcomes (PICO) framework was conducted: “What is the role (“outcome”) of 2-[18F]FDG PET or PET/CT (“intervention”) in patients with ENKTCL (“population”) compared or not to other imaging methods (“comparator”)?”. Two investigators (D.A. and F.F.) independently performed the literature search, the study selection, the data extraction, and the quality evaluation. In case of disagreements, a third opinion (C.R.) was asked.
2.2. Search Strategy
A thorough search of the PubMed/MEDLINE, Scopus, and Embase databases was performed to identify relevant published articles regarding the use of 2-[18F]FDG PET or PET/CT in patients with ENKTCL.
Moreover, specific research on the ClinicalTrials.gov database for ongoing investigations (access date: 1 March 2025) was performed. We used a search algorithm based on a combination of the following terms: (1) “PET” OR “positron emission tomography” AND (2) “FDG” OR “fluorodeoxyglucose” AND (3) “ENKTCL” OR “natural killer/T cell”. Our research had no beginning date limit and was updated until 1 March 2025. Only articles in the English language were selected. To enlarge our research, references of the retrieved articles were also screened for searching additional papers. For the management of these articles, we used EndNote Basic 2025 (ThompsonReuters, Philadelpia, PA, USA).
2.3. Study Selection
Studies or subsets within studies that examined the utility of 2-[18F]FDG PET or PET/CT in patients with ENKTCL were considered eligible for inclusion. The exclusion criteria included (1) articles outside the relevant field; (2) review articles, meta-analyses, letters, conference proceedings, and editorials; and (3) case reports or small case series involving fewer than 10 patients with ENKTCL. Two researchers (F.F. and D.A.) independently screened the titles and abstracts based on these criteria and subsequently reviewed the full texts to assess their relevance. All studies included in the systematic review were incorporated into the meta-analysis if they provided sufficient data to evaluate the diagnostic performance of the test and if there was no risk of overlap with other studies from the same research group.
2.4. Data Extraction and Collection
To minimize potential biases, two researchers independently reviewed each study and extracted data from the full manuscripts, including figures and tables. For every included study, we gathered general information such as the first author, year of publication, country, study design, funding sources, number of participants, age, and gender. We also collected technical details like the type of scanner used, the radiopharmaceutical dose administered, uptake time, and image analysis methods. Additionally, data on the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and overall accuracy of 2-[18F]FDG PET or PET/CT were extracted. The key findings from the included articles are summarized in the tables and detailed in Section 3.
2.5. Quality Assessment (Risk of Bias Assessment)
A quality assessment of the included articles was conducted to evaluate the risk of bias in each study related to the review question. Four areas—patient selection; the index test; the reference standard; and flow and timing—were examined for potential bias. Additionally, three aspects—patient selection; the index test; and the reference standard—were assessed for applicability concerns using the QUADAS-2 tool (Bristol, UK) [7].
2.6. Statistical Analysis
Diagnostic accuracy metrics were derived from each included study whenever possible, based on a per-patient analysis and bone marrow evaluation, using data such as true positives, false positives, true negatives, and false negatives. The main outcomes for the quantitative analysis were pooled sensitivity and specificity, calculated with a bivariate random-effects model that accounts for potential correlation between these two measures [8]. Additionally, the authors determined pooled positive and negative likelihood ratios (LR+ and LR−) and the diagnostic odds ratio (DOR). All pooled estimates were reported with 95% confidence intervals (95% CI). To provide a comprehensive overview of the diagnostic performance, a summary receiver operating characteristic (SROC) curve was also generated, illustrating the relationship between sensitivity and specificity [8]. If significant statistical heterogeneity was detected, subgroup analyses were planned, considering factors such as features of the index test, patient characteristics, technical aspects, and clinical scenarios. The presence and extent of heterogeneity were assessed using the I-squared (I^2^) index, with values over 50% indicating substantial heterogeneity [8]. For the statistical analysis, we used the open-source software OpenMeta Analyst^®^ (Brown University, Providence, RI, USA, version 10.12).
Concerning forest plots, they visually summarize the results of individual studies and the overall effect in a meta-analysis. They allow for quick assessment of the consistency in the direction and magnitude of the effect across studies. Each line represents a study, with a square indicating the effect size and a horizontal line showing the confidence interval. The size of the square reflects the study’s weight in the meta-analysis. A vertical line typically marks the line with no effect (e.g., odds ratio or risk ratio of 1). If most confidence intervals cross this line, it suggests inconsistency or non-significant results. In contrast, if the effect sizes are generally on the same side of the line and the confidence intervals do not overlap widely, this indicates consistency and potential significance. The presence of heterogeneity can also be assessed visually—wide variation in effect sizes and overlapping confidence intervals suggest heterogeneity; which is formally tested using statistical measures such as I^2^.
3. Results
3.1. Literature Search
During the literature search across the chosen databases, we identified a total of 131 records. After reviewing the titles and abstracts, 110 articles were excluded because they did not fall within the scope of our research—specifically; 76 were unrelated to the topic; 27 were small case series or case reports; 5 were reviews or editorials; and 2 were preclinical studies. Ultimately, 21 studies [9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29] were selected for full-text review and included in this systematic review (Figure 1).
No other articles were added after the revision of the references of the selected records.
3.2. Studies and Patients Data
The main features of the articles (all retrospective) included in the qualitative analysis were detailed in Table 1 and Table 2. The selected articles were published between 2007 and 2025, especially in China (n = 12), followed by Japan (n = 3), Korea (n = 3), and the USA (n = 3). Funding sources were reported only in five studies. The median/mean age ranged from 35.2 to 61 years, and males were more prevalent than females in all studies. The main purposes investigated in these studies were the diagnostic role of 2-[18F]FDG PET/CT in the staging (n = 10), bone marrow (BM) evaluation (n = 8), and the prognostic role of PET/CT (n = 8). In all studies except for 3, the scanner utilized was a hybrid PET/CT.
3.3. Risk of Bias and Applicability
The overall assessment of the risk of bias and concerns about the applicability of the studies included in the systematic review, based on QUADAS-2, is summarized in Figure 2. The average radiotracer activity administered varied widely across studies. When expressed as relative values, the activity ranged from 3.5 to 4.5 MBq/kg, whereas in absolute terms, it varied from 185 to 740 MBq. The time interval between injection and scanning was approximately 60 min in all studies. PET images were evaluated qualitatively in all cases, and in 18 studies, a semiquantitative approach was also used. Among the semiquantitative parameters, the most commonly measured PET feature was the maximum standardized uptake value (SUVmax), followed by metabolic tumor volume (MTV), total lesion glycolysis (TLG), and texture features.
3.4. Staging
Eight studies, including 137 patients, were included in our bivariate patient-based meta-analysis [9,10,11,13,14,15,16,17]. The pooled sensitivity, specificity, PPV, NPV, and accuracy of 2-[18F]FDG PET or PET/CT were 97% (95% CI: 92–99%), 57% (95% CI: 18–90%), 95% (95% CI: 90–98%), 50% (24–76%), and 95% (95% CI: 90–98%), respectively (Table 3). The SROC curve is represented in Figure 3 and demonstrated a good diagnostic performance of 2-[18F]FDG PET/CT. The pooled LR−, LR+, and DOR were 0.145 (95% CI: 0.096–0.219), 1.457 (95% CI: 0.965–2.202), and 16.246 (95% CI: 3.541–74.529), respectively (Figure 4 and Figure 5). There was no statistically significant heterogeneity among the studies for any of the metrics assessed, as indicated by an I^2^ value of less than 50%.
3.5. Bone Marrow Evaluation
For the bivariate patient-based meta-analysis, 8 researches including 1175 patients were included [9,13,16,18,19,21,22,23]. The pooled sensitivity, specificity, PPV, NPV, and accuracy of 2-[18F]FDG PET or PET/CT were 64% (95% CI: 55–72%), 93% (95% CI: 91–95%), 56% (95% CI: 49–62%), 94% (95% CI: 94–96%), and 90% (95% CI: 88–91%), respectively (Table 4). The SROC curve is represented in Figure 6 and demonstrated a good diagnostic performance of 2-[18F]FDG PET/CT. The pooled LR−, LR+, and DOR were 0.303 (95% CI: 0.178–0.513), 8.371 (95% CI: 4.236–16.546), and 24.612 (95% CI: 9.800–61.811), respectively (Figure 7 and Figure 8). A significant heterogeneity among the studies was found for all the metrics evaluated (I^2^ > 50%).
3.6. Prognosis
Regarding the prognostic role of 2-[18F]FDG PET/CT in ENKTCL, eight studies are available in the literature, reporting a significant role of PET/CT features in predicting prognosis [12,20,24,25,26,27,28,29] (Table 5). First, Suh C. et al. [12], in a small population (n = 21), showed that SUVmax using a cutoff of 5.5 could predict poorer disease-specific survival. However, this cut-off was not confirmed by subsequent research that derived a higher value as the cut-off [28]. Also, the difference of SUVmax between baseline and interim PET/CT had a strong prognostic power, but also in this case, different thresholds were suggested [26,29]. Beyond semiquantitative variables, visual analysis also demonstrated a prognostic impact in this disease. Treatment response expressed by Deauville scores significantly predicted PFS and OS, especially Deauville score 5 [20,28]. Texture analysis was studied only in two papers [24,27] with promising results: deep learning models including PET features may help to stratify prognosis (PFS and OS).
4. Discussion
2-[18F]FDG PET/CT is a functional imaging test that has been widely used in the staging, prognosis, and treatment response valuation of Hodgkin lymphoma (HL) and various types of B-cell non-Hodgkin lymphomas (NHL) [30,31]. However, specific studies about the role of PET/CT in ENKTCL are relatively rare and have heterogeneous results. The value of 2-[18F]FDG PET/CT in evaluating ENKTCL is still under debate because it is not yet understood due to the low number of studies and patients evaluated in literature, in comparison with other more diffuse lymphomas, like HL, DLBCL, and FL. The first issue to answer is if this lymphoma is 2-[18F]FDG-avid or not. NKTCL is typically aggressive and may present with extranodal involvement, often affecting the nasal cavity, upper respiratory tract, or gastrointestinal system. Since this lymphoma is highly metabolically active, it tends to show up as 2-[18F]FDG-avid disease. In extranodal sites of the disease, lesions may present as small in size and with minimal density variation compared to the surrounding tissues; therefore, they are more easily detectable with PET than conventional morphologic imaging. In the literature, very few cases of false negative scans were reported, all described in one study [11]. In fact, the reported overall PPV of 2-[18F]FDG PET/CT is excellent (95%), while NPV is very low (50%). Also, the specificity is not so high (57%). Thus, the possibility of having false or negative findings is not low. Potential causes of false positive reports are inflammatory/infectious processes, while for negative findings, the size of the lesions or the incorrect patient preparation (high blood glucose level) are potential causes.
Only demonstrating a high detection rate, it is possible to use 2-[18F]FDG PET/CT for evaluating treatment response and prognosis. In case of a positive PET, the uptake is usually quite high, with an average value of SUVmax included between 5.2 and 16, and it seems that it is relatively higher in nasal cavities than extranasal [11]. Detection of BM disease in patients affected by lymphoma is crucial, but it may be challenging owing to sampling errors; it has been previously demonstrated that a third of patients will show unilateral positive BM that could have been missed [32]. Besides, the BM biopsy is considered the reference standard for assessment of BM involvement, but it has several limitations, like the invasive nature of the procedure, the unilateral sampling, the potential complications related to the procedure, and the very small part of BM studied sometimes not being representative of the real disease. For these reasons, we have controversial results in the literature about the role of 2-[18F]FDG PET/CT in BM evaluation in NHL and HL. Concerning DLBCL, a focal 2-[18F]FDG uptake in BM has to be considered a true positive report, avoiding BM biopsy, but a negative PET/CT cannot exclude with certainty BM involvement. For ENKTCL, the ability of 2-[18F]FDG PET/CT in studying BM involvement could be fundamental because BM disease significantly affects the staging and the treatment choice in this disease [33]. Globally, the accuracy of 2-[18F]FDG PET/CT in BM evaluation is good with very high NPV (94%), especially in early-stage patients, but with suboptimal sensitivity (64%) due to the presence of a non-negligible number of false negative reports. One of the limitations of the studies focused upon BM evaluation is the heterogeneity of study designs (I^2^ > 50%), including variations in disease sub-types that may have intrinsically different risks of BM involvement, stages, and interpretation criteria, causing a wide range of reported sensitivity (20–100%) and specificity (75–100%). Also, the definition of positive BM at PET/CT is under debate and needs deep investigation. For the evaluation of BM involvement in lymphoma on 2-[18F]FDG PET/CT, unifocal or multifocal increased uptake is consistently considered positive. However, the diffuse increased uptake pattern is controversial due to the difficulties in discriminating between abnormal and reactive diffuse tracer uptake. Thus it seems not so excessive to suggest avoiding BM biopsy in the case of negative 2-[18F]FDG PET/CT in early-stage patients, while PET may not replace BM biopsy in advanced-stage patients due to the potential risk of false negative findings and the complex histological features of bone marrow involvement in mature T- and natural killer-cell lymphomas [34].
Also, the cases of false positive findings on 2-[18F]FDG PET/CT reported are not so low, causing a moderate specificity. This is a strong risk to consider because the possibility of overstaging is not negligible and merits high consideration.
Considering ENKTCL as a 2-[18F]FDG-avid lymphoma, the ideal treatment response criteria based on PET/CT were Deauville criteria. Deauville scores were introduced for the evaluation of treatment response in 2-[18F]FDG-avid lymphoma, like HL and DLBCL [35], but their predictive and prognostic role was demonstrated also in other lymphoma variants, like Mantle Cell Lymphoma [36] and Burkitt Lymphoma [37]. In the two studies where the prognostic role of Deauville scores was evaluated [20,28], this criterion showed to be an independent prognostic factor. Especially patients with a Deauville score of 5 demonstrated having worse PFS and OS. The potential prognostic role of baseline 2-[18F]FDG PET/CT parameters remains an unresolved issue, with findings that are quite heterogeneous. Actually, almost all publications demonstrated a significant role of PET features in predicting the outcome, but the heterogeneity of the studies included is wide due to the different populations analyzed (for example, for age and stage), variations in sample size, and the kind of treatment. Additionally, the relatively small sample sizes and the use of different semiquantitative metabolic parameters (such as SUVs, ratios, MTV, and TLG) further complicate this analysis. For these reasons, the thresholds derived to stratify ENKTCL patients are different, reducing the potential application of these parameters in the clinical scenario [38]. In this context, the use of artificial intelligence systems could be a strong help.
An interesting point now investigated in the literature is the cost-effectiveness and prioritization in clinical practice of PET/CT in this scenario. Considering the difficulties to stage and investigate ENKTCL, PET/CT could reduce costs related to unnecessary procedures and/or useless treatments. However, well-designed studies with adequate follow-up need to be designed to answer this question. The widespread use in rural or low-income settings must be strategically prioritized. With thoughtful integration—including referral models; focused clinical use; and investment in infrastructure—PET/CT can deliver meaningful benefits without overwhelming limited healthcare budgets. Ultimately, its use should be guided by evidence-based protocols and cost-effectiveness thresholds tailored to local needs and capabilities.
Generally, the main limitation of the literature available about 2-[18F]FDG PET/CT and ENKTCL is the retrospective nature of all the studies included, which increases the risk of selection bias and missing data.
5. Conclusions
Despite the heterogeneity of the included studies, this systematic review and meta-analysis suggest that 2-[18F]FDG PET/CT may play a significant role in the evaluation of ENKTCL patients, particularly in disease staging.
It shows a notably high NPV in identifying BM involvement, especially in those with early-stage disease. However, its overall sensitivity remains low, which means the possibility of missing bone marrow lesions—i.e.; false negatives—should not be overlooked.
However, while preliminary findings regarding its prognostic value are promising, they remain inconclusive. Larger, prospective, multicenter studies are warranted to better define its potential role in the diagnostic workflow.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Wang H. Fu B.B. Gale R.P. Liang Y. NK-/T-cell lymphomas Leukemia 2021352460246810.1038/s 41375-021-01313-234117356 PMC 8410593 · doi ↗ · pubmed ↗
- 2He X. Gao Y. Li Z. Huang H. Review on natural killer/T-cell lymphoma Hematol. Oncol.20234122122910.1002/hon.294434731509 · doi ↗ · pubmed ↗
- 3Yamaguchi M. Oguchi M. Suzuki R. Extranodal NK/T-cell lymphoma: Updates in biology and management strategies Best Pract. Res. Clin. Haematol.20183131532110.1016/j.beha.2018.07.00230213402 · doi ↗ · pubmed ↗
- 4Yan Z. Yao S. Wang Z. Zhou W. Yao Z. Liu Y. Treatment of extranodal NK/T-cell lymphoma: From past to future Front. Immunol.202314108868510.3389/fimmu.2023.108868536825002 PMC 9941192 · doi ↗ · pubmed ↗
- 5Costa R.O. Pereira J. Lage L.A.P.C. Baiocchi O.C.G. Extranodal NK-/T-cell lymphoma, nasal type: What advances have been made in the last decade?Front. Oncol.202313117554510.3389/fonc.2023.117554537529691 PMC 10388588 · doi ↗ · pubmed ↗
- 6Page M.J. Mc Kenzie J.E. Bossuyt P.M. Boutron I. Hoffmann T.C. Mulrow C.D. Shamsser L. Tetzlaff J.M. Akl E. Brenan S.E. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews BMJ 2021372 n 7110.1136/bmj.n 7133782057 PMC 8005924 · doi ↗ · pubmed ↗
- 7Whiting P.F. Rutjes A.W. Westwood M.E. Mallett S. Deeks J.J. Reitsma J.B. Leeflang M.M.G. Sterne J.A.S. Bossuyt P.M.M. QUADAS-2 Group. QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies Ann. Intern. Med.201115552953610.7326/0003-4819-155-8-201110180-0000922007046 · doi ↗ · pubmed ↗
- 8Sadeghi R. Treglia G. Systematic reviews and meta-analyses of diagnostic studies: A practical guideline Clin. Transl. Imaging 20175838710.1007/s 40336-016-0219-2 · doi ↗
