BACKGROUND: Efficient capture of routine clinical care and patient outcomes are needed at a population-level, as is evidence on important treatment-related side effects and their effect on well-being and clinical outcomes. The increasing availability of electronic health records (EHRs) offers new opportunities to generate population-level patient-centered evidence on oncological care that can better guide treatment decisions and patient-valued care.METHODS: This study includes patients seeking care at an academic medical center, 2008-2018. Digital data sources are combined to address missingness, inaccuracy, and noise common to EHR data. Clinical concepts were identified and extracted from EHR unstructured data using natural language processing (NLP) and machine/deep learning techniques. All models are trained, tested, and validated on independent data samples using standard metrics.RESULTS: We provide use cases for using EHR data to assess guideline adherence and quality measurements among cancer patients. Pretreatment assessment was evaluated by guideline adherence and quality metrics for cancer staging metrics. Patient outcomes included treatment-related side-effects and patient-reported outcomes.CONCLUSIONS: Advanced technologies applied to EHRs present opportunities to advance population-level quality assessment, to learn from routinely collected clinical data for personalized treatment, and to augment epidemiological and population health studies. The effective use of digital data can inform patient-valued care, quality initiatives and policy guidelines.IMPACT: A comprehensive set of health data analyzed with advanced technologies results in a unique resource that facilitates wide-ranging, innovative, and impactful research on prostate cancer. This work demonstrates novel use of EHRs and technology to advance epidemiological studies and benefit oncological care.
View details for DOI 10.1158/1055-9965.EPI-19-0873
View details for PubMedID 32066619