Presenting DiaData for Research on Type 1 Diabetes
Beyza Cinar, Maria Maleshkova

TL;DR
This paper introduces DiaData, a large integrated dataset of 2510 T1D patients with extensive glucose and heart rate data, to facilitate research and machine learning models for hypoglycemia prediction.
Contribution
It systematically combines 15 datasets into a comprehensive database, assesses data quality, and explores correlations between glucose and heart rate data for hypoglycemia prediction.
Findings
The dataset contains 149 million measurements, with 4% in hypoglycemic range.
Data imbalance and missing values are significant challenges.
A correlation between heart rate and glucose levels is observed 15-55 minutes before hypoglycemia.
Abstract
Type 1 diabetes (T1D) is an autoimmune disorder that leads to the destruction of insulin-producing cells, resulting in insulin deficiency, as to why the affected individuals depend on external insulin injections. However, insulin can decrease blood glucose levels and can cause hypoglycemia. Hypoglycemia is a severe event of low blood glucose levels (70 mg/dL) with dangerous side effects of dizziness, coma, or death. Data analysis can significantly enhance diabetes care by identifying personal patterns and trends leading to adverse events. Especially, machine learning (ML) models can predict glucose levels and provide early alarms. However, diabetes and hypoglycemia research is limited by the unavailability of large datasets. Thus, this work systematically integrates 15 datasets to provide a large database of 2510 subjects with glucose measurements recorded every 5 minutes. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
