World of ScoreCraft: Novel Multi‐Scorer Experiment on the Impact of a Decision Support System in Sleep Staging
Benedikt Holm, Arnar Óskarsson, Björn Elvar Þorleifsson, Hörður Þór Hafsteinsson, Sigríður Sigurðardóttir, Heiður Grétarsdóttir, Kenan Hoelke, Gabriel Marc Marie Jouan, Thomas Penzel, Erna Sif Arnardottir, María Óskarsdóttir

TL;DR
This study explores how decision support systems affect sleep scoring accuracy and finds that accurate recommendations improve results regardless of their source.
Contribution
A novel online platform was used to investigate the impact of recommendation sources on sleep staging accuracy and decision-making.
Findings
Correct recommendations significantly improved scoring accuracy for both traditional and self-applied PSGs.
No significant bias was observed toward AI-generated versus human-generated recommendations.
Traditional PSGs were scored slightly more accurately than self-applied PSGs, though the difference was not statistically significant.
Abstract
Manual scoring of polysomnography (PSG) is a time‐intensive task, prone to inter‐scorer variability that can impact diagnostic reliability. This study investigates the integration of decision support systems (DSS) into PSG scoring workflows, focusing on their effects on accuracy, scoring time and potential biases toward recommendations from artificial intelligence (AI) compared to human‐generated recommendations. Using a novel online scoring platform, we conducted a repeated‐measures study with sleep technologists, who scored traditional and self‐applied PSGs. Participants were occasionally presented with recommendations labelled as either human‐ or AI‐generated. As the goal of this study was to isolate the effect of perceived recommendation sources on scorer behaviour, all recommendations were human‐generated. We found that traditional PSGs tended to be scored slightly more accurately…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPatient Safety and Medication Errors · Healthcare Technology and Patient Monitoring · EEG and Brain-Computer Interfaces
