Omni-iEEG: A Large-Scale, Comprehensive iEEG Dataset and Benchmark for Epilepsy Research
Chenda Duan, Yipeng Zhang, Sotaro Kanai, Yuanyi Ding, Atsuro Daida, Pengyue Yu, Tiancheng Zheng, Naoto Kuroda, Shaun A. Hussain, Eishi Asano, Hiroki Nariai, Vwani Roychowdhury

TL;DR
Omni-iEEG is a large, standardized, and annotated intracranial EEG dataset designed to facilitate reproducible and clinically relevant epilepsy research, enabling systematic benchmarking and model development.
Contribution
The paper introduces Omni-iEEG, a comprehensive, harmonized iEEG dataset with extensive annotations and clinical metadata, addressing previous limitations in data inconsistency and lack of benchmarks.
Findings
Dataset includes 302 patients and 178 hours of recordings.
Over 36,000 expert-validated pathological event annotations.
Defines clinically meaningful tasks with evaluation metrics.
Abstract
Epilepsy affects over 50 million people worldwide, and one-third of patients suffer drug-resistant seizures where surgery offers the best chance of seizure freedom. Accurate localization of the epileptogenic zone (EZ) relies on intracranial EEG (iEEG). Clinical workflows, however, remain constrained by labor-intensive manual review. At the same time, existing data-driven approaches are typically developed on single-center datasets that are inconsistent in format and metadata, lack standardized benchmarks, and rarely release pathological event annotations, creating barriers to reproducibility, cross-center validation, and clinical relevance. With extensive efforts to reconcile heterogeneous iEEG formats, metadata, and recordings across publicly available sources, we present , a large-scale, pre-surgical iEEG resource comprising and $\textbf{178…
Peer Reviews
Decision·ICLR 2026 Poster
1. Consolidates fragmented iEEG datasets into a unified benchmark with consistent structure and task definitions, which could materially improve comparability in the area. 2. Tasks and evaluation targets are tied to familiar clinical surrogates, increasing practical relevance. 3. Adds a sizable layer of expert-validated event annotations (spkHFOs) with a described protocol and agreement checks. 4. Includes both biomarker-driven and long-context end-to-end baselines, highlighting trade-offs an
1. It’s hard to separate what is newly curated/validated post-merge (re-labeling, unified clinical ontology, normalized resection masks, QC decisions) from what is simply inherited. Please enumerate concrete new artifacts. 2. Pooled or random subject splits are insufficient for a multi-center resource. The paper needs leave-one-center-out/per-center reporting for the primary tasks, not only a subset, to demonstrate robustness to site effects. 3. “Harmonized” is described procedurally (e.g., re
1.The Omni-iEEG dataset contains data from multiple epilepsy centers, offering a large sample size and diversity, which better represents different populations and pathological types. 2.The dataset's annotations are performed by experts, ensuring high quality and accuracy, making it suitable for machine learning model training and clinical applications. 3.The dataset provides standardized benchmark tests, enabling other researchers to evaluate the dataset easily, promoting model and method compa
1.The paper focuses mainly on dataset construction, while ICLR typically emphasizes innovation in algorithms, models, or methods. The dataset's contribution does not highlight any novelty in algorithms or methods, which may not meet ICLR's review standards. 2.No concrete application scenarios presented: While the paper mentions the potential applications of the dataset, it does not demonstrate its actual effectiveness or value through specific benchmark tests or application scenarios, lacking re
- **Originality**: First large-scale, multi-center iEEG dataset with harmonized annotations and clinical metadata. - **Quality**: Expert-validated data and diverse tasks ensure robustness and clinical relevance. - **Clarity**: Clear task definitions and dataset structure, supported by visuals. - **Significance**: Bridges ML and epilepsy research, enhancing reproducibility and translatability.
- **Methodological Flaws**: Inter-rater reliability for 36K annotations is not quantified, risking bias. HFO detection algorithm selection lacks justification. - **Experimental Gaps**: No baseline model performance or cross-validation results are provided for benchmark tasks. Transfer learning potential is theoretical without empirical support. - **Oversight**: Data privacy protocols beyond de-identification are unclear. Scalability of annotation processes for future expansions is unaddressed. -
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpilepsy research and treatment · EEG and Brain-Computer Interfaces · Functional Brain Connectivity Studies
