TraumaICDBERT, A Natural Language Processing Algorithm to Extract Injury ICD-10 Diagnosis Code from Free Text. Annals of surgery Choi, J., Chen, Y., Sivura, A., Vendrow, E. B., Wang, J., Spain, D. A. 2023

Abstract

OBJECTIVE: To develop and validate TraumaICDBERT, a natural language processing algorithm to predict injury ICD-10 diagnosis codes from trauma tertiary survey notes.SUMMARY BACKGROUND DATA: The adoption of ICD-10 diagnosis codes in clinical settings for injury prediction is hindered by the lack of real-time availability. Existing natural language processing algorithms have limitations in accurately predicting injury ICD-10 diagnosis codes.METHODS: Trauma tertiary survey notes from hospital encounters of adults between January 2016 and June 2021 were used to develop and validate TraumaICDBERT, an algorithm based on BioLinkBERT. The performance of TraumaICDBERT was compared to Amazon Web Services Comprehend Medical, an existing natural language processing tool.RESULTS: A dataset of 3,478 tertiary survey notes with 15,762 4-character injury ICD-10 diagnosis codes was analyzed. TraumaICDBERT outperformed Amazon Web Services Comprehend Medical across all evaluated metrics. On average, each tertiary survey note was associated with 3.8 (standard deviation: 2.9) trauma registrar-extracted 4-character injury ICD-10 diagnosis codes.CONCLUSIONS: TraumaICDBERT demonstrates promising initial performance in predicting injury ICD-10 diagnosis codes from trauma tertiary survey notes, potentially facilitating the adoption of downstream prediction tools in clinical settings.

View details for DOI 10.1097/SLA.0000000000006107

View details for PubMedID 37753654