Methodological approach to optimize a step-by-step deterministic linkage of SNDS data with a clinical database (FREGAT) of gastric/gastroesophageal junction adenocarcinoma in France: Pitfalls and learnings
Magali Laborey, Audrey Lajoinie, Jonatan Freilich, Emmanuelle Samalin, Olivier Bouché, Guillaume Piessen, Matthias Stoelzel, Andrew Chilelli

TL;DR
This paper describes a method to link two French health databases for gastric cancer patients, aiming to improve real-world data for research.
Contribution
A deterministic linkage algorithm was developed to connect FREGAT and SNDS databases for gastric cancer epidemiology.
Findings
1385 out of 1617 FREGAT patients were successfully linked to the SNDS database.
83.7% of successfully linked patients were matched in the first part of the linkage process.
Abstract
Survival rates in the European population with gastric and gastroesophageal junction (G/GEJ) adenocarcinoma remain low. Epidemiologic research is warranted to understand the population size, unmet need, and current treatment patterns of G/GEJ adenocarcinoma. The objective of this research was to develop an algorithm to link patients across the FRench EsoGAstric Tumours (FREGAT) and Système National des Données de Santé (SNDS) databases to develop a real-world dataset for G/GEJ adenocarcinoma. A step-by-step, indirect, deterministic record linkage algorithm was developed to match patient records from the FREGAT and SNDS databases. Corresponding variables in each data source were matched at an individual level. Each step in the linkage process used a given scoring criterion; the linkage process proceeded until a unique pair of patient records had equal observations across the databases,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEsophageal Cancer Research and Treatment · Gastric Cancer Management and Outcomes · Pancreatic and Hepatic Oncology Research
