Grad-seq guides the discovery of ProQ as a major small RNA-binding protein

Proc Natl Acad Sci U S A. 2016 Oct 11;113(41):11591-11596. doi: 10.1073/pnas.1609981113. Epub 2016 Sep 26.

Abstract

The functional annotation of transcriptomes and identification of noncoding RNA (ncRNA) classes has been greatly facilitated by the advent of next-generation RNA sequencing which, by reading the nucleotide order of transcripts, theoretically allows the rapid profiling of all transcripts in a cell. However, primary sequence per se is a poor predictor of function, as ncRNAs dramatically vary in length and structure and often lack identifiable motifs. Therefore, to visualize an informative RNA landscape of organisms with potentially new RNA biology that are emerging from microbiome and environmental studies requires the use of more functionally relevant criteria. One such criterion is the association of RNAs with functionally important cognate RNA-binding proteins. Here we analyze the full ensemble of cellular RNAs using gradient profiling by sequencing (Grad-seq) in the bacterial pathogen Salmonella enterica, partitioning its coding and noncoding transcripts based on their network of RNA-protein interactions. In addition to capturing established RNA classes based on their biochemical profiles, the Grad-seq approach enabled the discovery of an overlooked large collective of structured small RNAs that form stable complexes with the conserved protein ProQ. We show that ProQ is an abundant RNA-binding protein with a wide range of ligands and a global influence on Salmonella gene expression. Given its generic ability to chart a functional RNA landscape irrespective of transcript length and sequence diversity, Grad-seq promises to define functional RNA classes and major RNA-binding proteins in both model species and genetically intractable organisms.

Keywords: Hfq; ProQ; RNA–protein interaction; noncoding RNA; small RNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacterial Proteins / metabolism*
  • Conserved Sequence / genetics
  • High-Throughput Nucleotide Sequencing / methods*
  • Ligands
  • Principal Component Analysis
  • RNA, Bacterial / genetics
  • RNA, Bacterial / metabolism*
  • RNA, Untranslated / genetics
  • RNA-Binding Proteins / metabolism*
  • Salmonella enterica / metabolism*

Substances

  • Bacterial Proteins
  • Ligands
  • RNA, Bacterial
  • RNA, Untranslated
  • RNA-Binding Proteins