# A CT Dataset with RECIST Measurements and Comprehensive Segmentation Masks for Tumors and Lymph Nodes

**Authors:** Roberto Rojas-Pizarro, Constanza Vásquez-Venegas, Gonzalo Pereira, María F. Eyssautier, Felipe Bravo-Bahamóndez, Nicolás Sanhueza, Paulina Gallardo-Badilla, Francisca Caro-Flores, Camila Ormeño-Candia, Felipe Santander, Nicolás Pérez, María M. Molina, Gonzalo Rojas, Steffen Härtel, Guillermo Cabrera-Vives

PMC · DOI: 10.1038/s41597-026-06597-6 · Scientific Data · 2026-01-20

## TL;DR

This paper introduces a new CT dataset with detailed tumor and lymph node annotations to support AI development in cancer treatment assessment.

## Contribution

The novel contribution is a publicly available dataset with 1,246 manually segmented lesions and RECIST-compliant measurements from Latin American patients.

## Key findings

- The dataset includes 1,246 manually segmented lesions from 58 CT scans of 22 cancer patients.
- It features 82 target lesions with RECIST 1.1-compliant diameter measurements for AI validation and benchmarking.
- The dataset supports global representation in medical AI development by including data from a Latin American institution.

## Abstract

The Response Evaluation Criteria in Solid Tumors (RECIST 1.1) protocol is the gold standard for assessing treatment response in oncological clinical trials and routine practice. It requires radiologists to review and select appropriate target lesions and perform precise diameter measurements, making the process labor-intensive and variable. Artificial Intelligence (AI) holds great promise for automating this workflow, but progress is hindered by the lack of public datasets with comprehensive lesion annotations and RECIST-compliant measurements. We address this gap by presenting a dataset of 1,246 manually segmented lesions from 58 CT scans of 22 cancer patients treated at the Clinical Hospital of the University of Chile (HCUCH). All cases were evaluated under RECIST 1.1, with diameter measurements reported for 82 target lesions. This resource supports diverse applications, including validating automated RECIST tools, applying radiomics to study metastatic heterogeneity, benchmarking segmentation algorithms, and advancing foundation models in medical imaging. By including data from a Latin American institution, this dataset also promotes global representation in the development of generalizable medical AI tools.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** lesion (MESH:D009059), Solid Tumors (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12917031/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12917031/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC12917031/full.md

---
Source: https://tomesphere.com/paper/PMC12917031