# How do clinician and parent-reported data differ? An analysis of similarity and difference in the datasets from a cross-syndrome genetics cohort study (GenROC)

**Authors:** Karen Jaqueline Low, Huw Day, Mevmi Lasanya Kodippuli Thanthilla, Suzanne Alsters, Charlotte Davis, Helen V Firth, Caroline Wright

PMC · DOI: 10.1136/jmg-2025-111193 · 2026-01-19

## TL;DR

This study compares parent-reported and clinician-reported data in a genetics cohort, finding differences in the types and details of phenotypic information collected.

## Contribution

The paper provides a detailed comparison of parent and clinician-reported data across multiple systems and genes in a cross-syndrome cohort.

## Key findings

- Parent-reported data included more terms for dental, gastroenterology, immunology, and respiratory systems.
- Clinician-reported data provided more detail for most gene subgroups and neurology.
- Similarity scores were highest for cardiac and lowest for ENT systems.

## Abstract

Parent/patient-reported (PRD) datasets provide ready access to phenotypic data for monogenic neurodevelopmental disorders, yet their concordance with clinical data is unclear.

In the GenROC study, 547 children (mean age 7.6 years, balanced sex ratio) had parallel parent-reported web questionnaires and clinician-reported (CRD) Human Phenotype Ontology proformas. We compared the two sources per participant by system, gene and gene group and overall for quantity, detail and similarity.

547 probands were analysed ranging in age from infancy to 16 years (mean 7.6) with similar gender distribution. PRD provided more terms for dental, gastroenterology, immunology and respiratory systems and for vision (p<0.001 for all) and to a lesser degree for cardiac (p=0.0012). CRD provides more detail than PRD for most gene subgroups, combined systems and for neurology (p<0.001). Similarity scores were low overall per participant (mean 0.38 for combined). Similarity scores were highest for cardiac (mean 0.74) and lowest for Ear/Nose/Throat(ENT) (mean 0.34). There was minimal difference in similarity scores across gene groups or between the top 10 genes—scaffold adaptor gene groups had the highest (mean 0.43) as did STXBP1 (mean 0.5) and CACNA1A (0.49). CRD is more similar to published syndrome phenotypes for syndromic genes.

Parents reported more common childhood phenotypes, such as asthma and dental issues, while clinicians provided clinical phenotype descriptors, such as brain morphology and seizure semiology. It is important to understand the differences when designing studies and using datasets to appreciate their strengths and limitations.

## Linked entities

- **Genes:** STXBP1 (syntaxin binding protein 1) [NCBI Gene 6812], CACNA1A (calcium voltage-gated channel subunit alpha1 A) [NCBI Gene 773]

## Full-text entities

- **Genes:** CACNA1A (calcium voltage-gated channel subunit alpha1 A) [NCBI Gene 773] {aka APCA, BI, CACNL1A4, CAV2.1, DEE42, EA2}, STXBP1 (syntaxin binding protein 1) [NCBI Gene 6812] {aka DEE4, MUNC18-1, N-Sec1, NSEC1, P67, RBSEC1}
- **Diseases:** neurodevelopmental disorders (MESH:D002658), asthma (MESH:D001249), seizure (MESH:D012640)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13018764/full.md

---
Source: https://tomesphere.com/paper/PMC13018764