A cis-regulatory lexicon of DNA motif combinations mediating cell-type-specific gene regulation. Cell genomics Donohue, L. K., Guo, M. G., Zhao, Y., Jung, N., Bussat, R. T., Kim, D. S., Neela, P. H., Kellman, L. N., Garcia, O. S., Meyers, R. M., Altman, R. B., Khavari, P. A. 2022; 2 (11)

Abstract

Gene expression is controlled by transcription factors (TFs) that bind cognate DNA motif sequences in cis-regulatory elements (CREs). The combinations of DNA motifs acting within homeostasis and disease, however, are unclear. Gene expression, chromatin accessibility, TF footprinting, and H3K27ac-dependent DNA looping data were generated and a random-forest-based model was applied to identify 7,531 cell-type-specific cis-regulatory modules (CRMs) across 15 diploid human cell types. A co-enrichment framework within CRMs nominated 838 cell-type-specific, recurrent heterotypic DNA motif combinations (DMCs), which were functionally validated using massively parallel reporter assays. Cancer cells engaged DMCs linked to neoplasia-enabling processes operative in normal cells while also activating new DMCs only seen in the neoplastic state. This integrative approach identifies cell-type-specific cis-regulatory combinatorial DNA motifs in diverse normal and diseased human cells and represents a general framework for deciphering cis-regulatory sequence logic in gene regulation.

View details for DOI 10.1016/j.xgen.2022.100191

View details for PubMedID 36742369