Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization
Yakov Pyotr Shkolnikov

TL;DR
This paper introduces a verified bit-identical training framework for deep learning that eliminates sources of randomness, ensuring reproducibility and consistent predictions across runs, especially on rare clinical classes.
Contribution
The authors propose a structured orthogonal initialization and deterministic training pipeline that guarantees identical models across independent runs, improving reproducibility in medical deep learning.
Findings
Significantly reduces variance in ECG rhythm classification.
Achieves MD5-verified identical weights across runs.
Maintains performance on standard benchmarks and improves consistency on rare classes.
Abstract
Deep learning training is non-deterministic: identical code with different random seeds produces models that agree on aggregate metrics but disagree on individual predictions, with per-class AUC swings exceeding 20 percentage points on rare clinical classes. We present a framework for verified bit-identical training that eliminates three sources of randomness: weight initialization (via structured orthogonal basis functions), batch ordering (via golden ratio scheduling), and non-deterministic GPU operations (via architecture selection and custom autograd). The pipeline produces MD5-verified identical trained weights across independent runs. On PTB-XL ECG rhythm classification, structured initialization significantly exceeds Kaiming across two architectures (n=20; Conformer p = 0.016, Baseline p < 0.001), reducing aggregate variance by 2-3x and reducing per-class variability on rare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
