RNA-GPS predicts high-resolution RNA subcellular localization and highlights the role of splicing. RNA (New York, N.Y.) Wu, K. E., Parker, K. R., Fazal, F. M., Chang, H., Zou, J. 2020

Abstract

Subcellular localization is essential to RNA biogenesis, processing, and function across the gene expression life cycle. However, the specific nucleotide sequence motifs that direct RNA localization are incompletely understood. Fortunately, new sequencing technologies have provided transcriptome-wide atlases of RNA localization, creating an opportunity to leverage computational modeling. Here we present RNA-GPS, a new machine learning model that uses nucleotide-level features to predict RNA localization across 8 different subcellular locations - the first to provide such a wide range of predictions. RNA-GPS's design enables high throughput sequence ablation and feature importance analyses to probe the sequence motifs that drive localization prediction. We find localization informative motifs to be concentrated on 3' UTRs and scattered along the coding sequence, and motifs related to splicing to be important drivers of predicted localization, even for cytotopic distinctions for membraneless bodies within the nucleus or for organelles within the cytoplasm. Overall, our results suggest transcript splicing is one of many elements influencing RNA subcellular localization.

View details for DOI 10.1261/rna.074161.119

View details for PubMedID 32220894