BIOT: Cross-data Biosignal Learning in the Wild
Chaoqi Yang, M. Brandon Westover, Jimeng Sun

TL;DR
This paper introduces BIOT, a versatile biosignal transformer model that enables cross-data learning across various biosignal formats, improving performance in tasks like seizure detection by leveraging multi-source training.
Contribution
The paper proposes a novel biosignal transformer that tokenizes diverse biosignals into unified sentences, allowing cross-data learning and pre-training across different biosignal datasets.
Findings
BIOT outperforms baselines in EEG, ECG, and activity signals.
Pre-trained BIOT models improve seizure detection accuracy.
Versatile application across multiple biosignal formats.
Abstract
Biological signals, such as electroencephalograms (EEG), play a crucial role in numerous clinical applications, exhibiting diverse data formats and quality profiles. Current deep learning models for biosignals are typically specialized for specific datasets and clinical settings, limiting their broader applicability. Motivated by the success of large language models in text processing, we explore the development of foundational models that are trained from multiple data sources and can be fine-tuned on different downstream biosignal tasks. To overcome the unique challenges associated with biosignals of various formats, such as mismatched channels, variable sample lengths, and prevalent missing values, we propose a Biosignal Transformer (\method). The proposed \method model can enable cross-data learning with mismatched channels, variable lengths, and missing values by tokenizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · ECG Monitoring and Analysis · Phonocardiography and Auscultation Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dense Connections · Adam · Residual Connection · Absolute Position Encodings · Softmax · Layer Normalization
