An experimentally anchored map of transcriptional start sites in the model cyanobacterium Synechocystis sp. PCC6803

Proc Natl Acad Sci U S A. 2011 Feb 1;108(5):2124-9. doi: 10.1073/pnas.1015154108. Epub 2011 Jan 18.

Abstract

There has been an increasing interest in cyanobacteria because these photosynthetic organisms convert solar energy into biomass and because of their potential for the production of biofuels. However, the exploitation of cyanobacteria for bioengineering requires knowledge of their transcriptional organization. Using differential RNA sequencing, we have established a genome-wide map of 3,527 transcriptional start sites (TSS) of the model organism Synechocystis sp. PCC6803. One-third of all TSS were located upstream of an annotated gene; another third were on the reverse complementary strand of 866 genes, suggesting massive antisense transcription. Orphan TSS located in intergenic regions led us to predict 314 noncoding RNAs (ncRNAs). Complementary microarray-based RNA profiling verified a high number of noncoding transcripts and identified strong ncRNA regulations. Thus, ∼64% of all TSS give rise to antisense or ncRNAs in a genome that is to 87% protein coding. Our data enhance the information on promoters by a factor of 40, suggest the existence of additional small peptide-encoding mRNAs, and provide corrected 5' annotations for many genes of this cyanobacterium. The global TSS map will facilitate the use of Synechocystis sp. PCC6803 as a model organism for further research on photosynthesis and energy research.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Genes, Bacterial
  • Molecular Sequence Data
  • Oligonucleotide Array Sequence Analysis
  • Open Reading Frames
  • Photosynthesis
  • RNA, Untranslated / genetics
  • Sequence Homology, Nucleic Acid
  • Synechocystis / genetics*
  • Synechocystis / physiology
  • Transcription, Genetic*

Substances

  • RNA, Untranslated

Associated data

  • GEO/GSE14410
  • GEO/GSE16162