Integrating electronic health record, cancer registry, and geospatial data to study lung cancer in Asian American, Native Hawaiian and Pacific Islander ethnic groups. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology DeRouen, M. C., Thompson, C. A., Canchola, A. J., Jin, A., Nie, S., Wong, C., Jain, J., Lichtensztajn, D. Y., Li, Y., Allen, L., Patel, M. I., Daida, Y. G., Luft, H. S., Shariff-Marco, S., Reynolds, P., Wakelee, H. A., Liang, S., Waitzfelder, B. E., Cheng, I., Gomez, S. L. 2021

Abstract

BACKGROUND: A relatively high proportion of Asian American, Native Hawaiian, and Pacific Islander (AANHPI) females with lung cancer have never smoked. We used an integrative data approach to assemble a large-scale cohort to study lung cancer risk among AANHPI by smoking status with attention to representation of specific AANHPI ethnic groups.METHODS: We leveraged electronic health records (EHRs) from two healthcare systems-Sutter Health in northern California and Kaiser Permanente Hawai'i- that have high representation of AANHPI populations. We linked EHR data on lung cancer risk factors (i.e., smoking, lung diseases, infections, reproductive factors, and body size) to data on incident lung cancer diagnoses from statewide population-based cancer registries of California and Hawai'i for the period 2000-2013. Geocoded address data were linked to data on neighborhood contextual factors and regional air pollutants.RESULTS: The dataset comprises over 2.2 million adult females and males of any race/ethnicity. Over 250,000 are AANHPI females (19.6% of the female study population). Smoking status is available for over 95% of individuals. The dataset includes 7,274 lung cancer cases, including 613 cases among AANHPI females. Prevalence of never-smoking status varied greatly among AANHPI females with incident lung cancer, from 85.7% among Asian Indian to 14.4% among Native Hawaiian females.CONCLUSION: We have developed a large, multilevel dataset particularly well-suited to conduct prospective studies of lung cancer risk among AANHPI females who never smoked.IMPACT: The integrative data approach is an effective way to conduct cancer research assessing multilevel factors on cancer outcomes among small populations.

View details for DOI 10.1158/1055-9965.EPI-21-0019

View details for PubMedID 34001502