CoSMeTIC: Zero-Knowledge Computational Sparse Merkle Trees with Inclusion-Exclusion Proofs for Clinical Research
Mohammad Shahid, Paritosh Ramanan, Mohammad Fili, Guiping Hu, Hillel Haim

TL;DR
CoSMeTIC introduces a zero-knowledge framework using Sparse Merkle Trees to enable privacy-preserving, verifiable inclusion proofs in clinical data, balancing privacy with regulatory accountability.
Contribution
It proposes a novel zero-knowledge computational approach with Sparse Merkle Trees for verifiable data inclusion in clinical research, ensuring privacy and compliance.
Findings
Achieves strong privacy guarantees with verifiable proofs.
Maintains statistical fidelity in genomic analyses.
Demonstrates scalability and practicality in real-world datasets.
Abstract
Analysis of clinical data is a cornerstone of biomedical research with applications in areas such as genomic testing and response characterization of therapeutic drugs. Maintaining strict privacy controls is essential because such data typically contains personally identifiable health information of patients. At the same time, regulatory compliance often requires study managers to demonstrate the integrity and authenticity of participant data used in analyses. Balancing these competing requirements, privacy preservation and verifiable accountability, remains a critical challenge. In this paper, we present CoSMeTIC, a zero-knowledge computational framework that proposes computational Sparse Merkle Trees (SMTs) as a means to generate verifiable inclusion and exclusion proofs for individual participants' data in clinical studies. We formally analyze the zero-knowledge properties of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics in Clinical Research · Machine Learning in Healthcare
