In:
Nature Communications, Springer Science and Business Media LLC, Vol. 13, No. 1 ( 2022-08-30)
Abstract:
Design of de novo synthetic regulatory DNA is a promising avenue to control gene expression in biotechnology and medicine. Using mutagenesis typically requires screening sizable random DNA libraries, which limits the designs to span merely a short section of the promoter and restricts their control of gene expression. Here, we prototype a deep learning strategy based on generative adversarial networks (GAN) by learning directly from genomic and transcriptomic data. Our ExpressionGAN can traverse the entire regulatory sequence-expression landscape in a gene-specific manner, generating regulatory DNA with prespecified target mRNA levels spanning the whole gene regulatory structure including coding and adjacent non-coding regions. Despite high sequence divergence from natural DNA, in vivo measurements show that 57% of the highly-expressed synthetic sequences surpass the expression levels of highly-expressed natural controls. This demonstrates the applicability and relevance of deep generative design to expand our knowledge and control of gene expression regulation in any desired organism, condition or tissue.
Type of Medium:
Online Resource
ISSN:
2041-1723
DOI:
10.1038/s41467-022-32818-8
Language:
English
Publisher:
Springer Science and Business Media LLC
Publication Date:
2022
detail.hit.zdb_id:
2553671-0