Incognito rRNA and rDNA in databases and libraries.

Published January 1, 1997. Vol 7 Issue 1, pp. 65-70. https://doi.org/10.1101/gr.7.1.65
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 5
Current Issue:

Abstract

Both ribosomal DNA (rDNA) and ribosomal RNA (rRNA) are over-represented in the starting material for genomic and cDNA libraries; thus, their sequences have the potential of repeatedly entering the various databases. When DNA (both transcribed and intergenic spacer regions) is used as query sequence, a great number of matches are found in the databases, particularly in the EST database, and to a lesser extent among genomic sequences and STSs, which are not identified as rDNA. We discuss the following explanations for the widespread occurrence of rDNA in cDNA and genomic DNA libraries: pseudogenes of rRNA in other genomic locations, mRNA-derived pseudogenes that reside in rDNA, cDNAs derived from rRNA [either by self-priming or by internal oligo(dT) priming], cDNAs derived from actual transcripts of the rDNA intergenic spacer, and genomic DNA contamination of RNA preparations. Because so many database entries contain unidentified rDNA, we recommend that all sequence submissions be checked (by the submitters) for the presence of structural RNAs in addition to repetitive sequences.

Loading
Loading
Back to top