A Data Analytics Framework for Aggregate Data Analysis
Sanket Tavarageri, Nag Mani, Anand Ramasubramanian, Jaskiran Kalsi

TL;DR
This paper introduces a comprehensive framework for inferring individual-level data from aggregate statistics, enabling detailed analysis and machine learning applications in fields with privacy constraints, validated on medical data.
Contribution
It presents a novel algorithm for reconstructing detailed data from summary statistics and an end-to-end pipeline that leverages multiple candidate datasets for improved analysis.
Findings
Effective reconstruction of fine-grained data from aggregate statistics.
Parallel architecture reduces uncertainty in inferred data.
Validated approach on medical dataset for traumatic coagulopathy.
Abstract
In many contexts, we have access to aggregate data, but individual level data is unavailable. For example, medical studies sometimes report only aggregate statistics about disease prevalence because of privacy concerns. Even so, many a time it is desirable, and in fact could be necessary to infer individual level characteristics from aggregate data. For instance, other researchers who want to perform more detailed analysis of disease characteristics would require individual level data. Similar challenges arise in other fields too including politics, and marketing. In this paper, we present an end-to-end pipeline for processing of aggregate data to derive individual level statistics, and then using the inferred data to train machine learning models to answer questions of interest. We describe a novel algorithm for reconstructing fine-grained data from summary statistics. This step will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTrauma and Emergency Care Studies · Trauma, Hemostasis, Coagulopathy, Resuscitation · Autopsy Techniques and Outcomes
