An Approach for Determining the Reliability of Manual and Digital Scoring of Sleep Stages. Sleep Gerardy, B., Kuna, S. T., Pack, A., Kushida, C. A., Walsh, J. K., Staley, B., Pien, G. W., Younes, M. 2023

Abstract

STUDY OBJECTIVES: Inter-scorer variability in sleep staging is largely due to equivocal epochs that contain features of more than one stage. We propose an approach that recognizes the existence of equivocal epochs and evaluates scorers accordingly.METHODS: Epoch-by-epoch staging was performed on 70 polysomnograms by six qualified technologists and by a digital system (MSS). Probability that epochs assigned the same stage by only two of the six technologists (minority score) resulted from random occurrence of two errors was calculated and found to be <5%, thereby indicating that the stage assigned is an acceptable variant for the epoch. Acceptable stages were identified in each epoch as stages assigned by at least two technologists. Percent agreement between each technologist and the other five technologists, acting as judges, was determined. Agreement was considered to exist if the stage assigned by the tested scorer was one of the acceptable stages for the epoch. Stage assigned by MSS was likewise considered in agreement if included in the acceptable stages made by the technologists.RESULTS: Agreement of technologists tested against five qualified judges increased from 80.8% (range 70.5-86.4% among technologists) when using the majority rule, to 96.1 (89.8-98.5%) by the proposed approach. Agreement between unedited MSS and same judges was 90.0% and increased to 92.1% after brief editing.CONCLUSIONS: Accounting for equivocal epochs provides a more accurate estimate of a scorer's (human or digital) competence in scoring sleep stages and reduces interscorer disagreements. The proposed approach can be implemented in sleep scoring training and accreditation programs.

View details for DOI 10.1093/sleep/zsad248

View details for PubMedID 37712522