Annotating Non‐Coding RNAs with Rfam
1Wellcome Trust Sanger Institute Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom
Abstract
Non-coding RNA (ncRNA) genes produce a functional RNA product, rather than a translated protein. The range and importance of such genes is only recently apparent, with known ncRNAs participating in a wide range of structural, regulatory, and catalytic roles within the cell. Like protein-coding genes, multiple sequence alignments of families of ncRNAs tell us much about their structure and function, and enable the formulation of statistical models for the detection of related sequences. Rfam is a database of families of ncRNAs, represented by structure-annotated multiple sequence alignments and covariance models.
Keywords: non-coding RNA; multiple sequence alignment; genome annotation
Table of Contents
- Unit Introduction
- Basic Protocol: Finding Members of a Family via the Rfam Web Interface
- Alternate Protocol: Using Rfam and Infernal to Identify ncRNAs in Genomic DNA
- Guidelines for Understanding Results
- Commentary
- Appendix: Rfam Alignment Formats
- Literature Cited
- Figures
- Tables
Figures
-
Figure 12.5.1A typical Rfam sequence search results page. The table in the upper part of the figure summarizes the hits, locations, and scores; the alignments of each subsequence to the model are shown in the lower part of the figure. The first sequence line shows the simplified consensus sequence represented by the model as generated by the search software, with the bottom line showing the subsequence match of the query. The central line shows bases conserved between the model consensus and the query, with the base-paired secondary structure depicted as nested parentheses above (see Appendix for more information).
-
Figure 12.5.2The Rfam Genomes page for Clostridium perfringens from the Rfam U.K. Web site. The table describes the families identified in the genome, together with types and counts. The graphic on the left shows the locations of all hits in the genome sequence.
-
Figure 12.5.3The Rfam family page for the nuclear RNase P family.
-
Figure 12.5.4The scheme for construction of an Rfam family. The programs cmbuild, cmsearch, and cmalign are components of the INFERNAL software. See text for more detail.
-
Figure 12.5.5The Rfam Seed alignment for the U12 minor spliceosomal RNA family.
-
Figure 12.5.6The SS_cons line (per-column) annotation describes the consensus base-paired secondary structure of a given alignment, encoded as nested sets of brackets.
Literature Cited
| Literature Cited | |
| Bartel, D.P. 2004. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 116:281-297. | |
| Doudna, J.A. and Cech, T.R. 2002. The chemical repertoire of natural ribozymes. Nature 418:222-228. | |
| Durbin, R., Eddy, S., Krogh, A., and Mitchison, G. 1998. Biological sequence analysis: Probabilistic models of proteins and nucleic acids. Cambridge University Press. Cambridge, U.K. | |
| Griffiths-Jones, S., Bateman, A., Marshall, M., Khanna, A., and Eddy, S.R. 2003. Rfam: An RNA family database. Nucleic Acids Res. 31:439-441. | |
| Griffiths-Jones, S., Moxon, S., Marshall, M., Khanna, A., Eddy, S.R., and Bateman, A. 2005. Rfam: Annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33:D121-D124. | |
| Kiss, T. 2002. Small nucleolar RNAs: An abundant group of noncoding RNAs with diverse cellular functions. Cell 109:145-148. | |
| Internet Resources | |
| http://www.sanger.ac.uk/Software/Rfam/ | |
| http://rfam.wustl.edu/ | |
| Rfam home pages. | |
| ftp://ftp.sanger.ac.uk/pub/databases/Rfam/ | |
| ftp://ftp.genetics.wustl.edu/pub/eddy/Rfam/ | |
| Rfam FTP sites. | |
| http://infernal.wustl.edu/ | |
| INFERNAL home page. | |
Did you know we publish 20-30 new protocols monthly? Stay informed! Sign up for NEW PROTOCOL ALERTS.
PUBLISH YOUR PROTOCOL on CurrentProtocols.com.
Read our editors' blog for news, commentaries, and the latest developments in methods in and out of the lab.
Tools & Calculators
Your Recently Viewed Protocols
- Detecting Gene‐Gene Interaction in Linkage Analysis
- T Cell Depletion by Cytotoxic Elimination
- High‐Resolution Multi‐Contrast MRI of the Carotid Artery Wall for Evaluation of Atherosclerotic Plaques
- 3‐D Reconstruction of Neurons from Multichannel Confocal Laser Scanning Image Series
- Plug Flow Cytometry




Join the Conversation
Post new comment