User Ratings

Your rating: None
Your rating: None
Your rating: None
Add your comments

Using the Structure‐Function Linkage Database to Characterize Functional Domains in Enzymes

Shoshana Brown1,  Patricia Babbitt1

1University of California, San Francisco, San Francisco, California

Unit Number: 
Unit 2.10
DOI: 
10.1002/0471250953.bi0210s13
Online Posting Date: 
March, 2006
GO TO THE FULL TEXT:
PDF or HTML at Wiley Online Library
Are you the author of this protocol? Login or register and return to this page.
Learn more about the author(s): 
Shoshana Brown

Abstract

The Structure-Function Linkage Database (SFLD; http://sfld.rbvi.ucsf.edu/) is a Web-accessible database designed to link enzyme sequence, structure, and functional information. This unit describes the protocols by which a user may query the database to predict the function of newly sequenced enzymes and to correct misannotated functional assignments for enzymes currently in public databases. It is especially useful in helping a user discriminate functional capabilities of a sequence that is only distantly related to characterized sequences in publicly available databases.

Keywords: superfamily analysis; sequence analysis; structure-function relationships; function prediction; annotation transfer

     
 
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library

Table of Contents

  • Unit Introduction
  • Basic Protocol 1: Using the SFLD to Predict the Function of an Uncharacterized Enzyme
  • Basic Protocol 2: Using the SFLD to Correct Misannotated Functional Assignments
  • Guidelines for Understanding Results
  • Commentary
  • Literature Cited
  • Figures
     
 
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library

Figures

  • Figure 2.10.1
    The SFLD homepage.

  • Figure 2.10.2
    The SFLD Search by Enzyme page with a protein sequence pasted into the Protein Sequence box.

  • Figure 2.10.3
    SFLD sequence search results. Results are displayed in tabular format, sorted by HMMER E-value.

  • Figure 2.10.4
    Alignment of a query sequence with the SFLD galactonate dehydratase family. The query sequence is the top sequence in the alignment.

  • Figure 2.10.5
    Alignment of a query sequence with the SFLD galactonate dehydratase family. Family-specific catalytic residues are automatically highlighted. Note that the catalytic residue position numbering given in the table below the alignment is keyed to an alignment without a query sequence, and therefore may not correspond to the numbering on alignments that include query sequences.

  • Figure 2.10.6
    A dendrogram of the SFLD galactonate dehydratase family, plus a query sequence.

  • Figure 2.10.7
    The SFLD family page for the galactonate dehydratase family.

  • Figure 2.10.8
    The SFLD family page for the muconate cycloisomerase family. Note the second display panel, which provides a pictorial view of the active site for a representative family member with a solved crystal structure.

  • Figure 2.10.9
    The NCBI homepage with Protein selected in the Search listbox and gi number 23100420 specified as the protein sequence for which to search.

  • Figure 2.10.10
    NCBI summary protein search results for gi number 23100420.

  • Figure 2.10.11
    The SFLD Browse by Reaction page.

  • Figure 2.10.12
    Alignment of a query sequence with the SFLD muconate cycloisomerase family. Family-specific catalytic residues are automatically highlighted.

Literature Cited

Literature Cited
    Babbitt, P.C. and Gerlt, J.A. 1997. Understanding enzyme superfamilies: Chemistry as the fundamental determinant in the evolution of new catalytic activities. J. Biol. Chem. 272:30591-30594.
    Babbitt, P.C., Mrachko, G.T., Hasson, M.S., Huisman, G.W., Kolter, R., Ringe, D., Petsko, G.A., Kenyon, G.L., and Gerlt, J.A. 1995. A functionally diverse enzyme superfamily that abstracts the alpha protons of carboxylic acids. Science 267:1159-1161.
    Brenner, S.E. 1999. Errors in genome annotation. Trends Genet. 15:132-133.
    Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T.J., Higgins, D.G., and Thompson, J.D. 2003. Multiple sequence alignment with the Clustal series of programs. Nucl. Acids Res. 31:3497-3500.
    Deacon, J. and Cooper, R.A. 1977. d-Galactonate utilisation by enteric bacteria: The catabolic pathway in Escherichia coli. FEBS Lett. 77:201-205.
    Devos, D. and Valencia, A. 2001. Intrinsic errors in genome annotation. Trends Genet. 17:429-431.
    Eddy, S.R. 1998. Profile hidden Markov models. Bioinformatics 14:755-763.
    Eulberg, D., Lakner, S., Golovleva, L.A., and Schlomann, M. 1998. Characterization of a protocatechuate catabolic gene cluster from Rhodococcus opacus 1CP: Evidence for a merged enzyme with 4-carboxymuconolactone-decarboxylating and 3-oxoadipate enol-lactone-hydrolyzing activity. J. Bacteriol. 180:1072-1081.
    Gerlt, J.A. and Babbitt, P.C. 2001. Divergent evolution of enzymatic function: Mechanistically diverse superfamilies and functionally distinct suprafamilies. Annu. Rev. Biochem. 70:209-246.
    Gilks, W.R., Audit, B., De Angelis, D., Tsoka, S., and Ouzounis, C.A. 2002. Modeling the percolation of annotation errors in a database of protein sequences. Bioinformatics 18:1641-1649.
    Horowitz, N.H. 1945. On the evolution of biochemical syntheses. Proc. Natl. Acad. Sci. U.S.A. 31:153-157.
    Horowitz, N.H. 1965. The evolution of biochemical syntheses: Retrospect and prospect. In Evolving Genes and Proteins (V. Bryson and H.J. Vogel, eds.) pp. 15-23. Academic Press, New York.
    Jensen, R.A. 1976. Enzyme recruitment in evolution of new function. Annu. Rev. Microbiol. 30:409-425.
    Pegg, S.C., Brown, S., Ojha, S., Huang, C.C., Ferrin, T.E., and Babbitt, P.C. 2005. Representing structure-function relationships in mechanistically diverse enzyme superfamilies. Pac. Symp. Biocomput. 2005:358-369.
    Petsko, G.A., Kenyon, G.L., Gerlt, J.A., Ringe, D., and Kozarich, J.W. 1993. On the origin of enzymatic species. Trends Biochem. Sci. 18:372-376.
    Pettersen, E.F., Goddard, T.D., Huang, C.C., Couch, G.S., Greenblatt, D.M., Meng, E.C., and Ferrin, T.E. 2004. UCSF Chimera: A visualization system for exploratory research and analysis. J. Comput. Chem. 25:1605-1612.
    Rison, S.C., Teichmann, S.A., and Thornton, J.M. 2002. Homology, pathway distance and chromosomal localization of the small molecule metabolism enzymes in Escherichia coli. J. Mol. Biol. 318:911-932.
    Teichmann, S.A., Rison, S.C., Thornton, J.M., Riley, M., Gough, J., and Chothia, C. 2001. The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. J. Mol. Biol. 311:693-708.
    Todd, A.E., Orengo, C.A., and Thornton, J.M. 2001. Evolution of function in protein superfamilies, from a structural perspective. J. Mol. Biol. 307:1113-1143.
    Weininger, D. 1988. SMILES, a chemical language and information system. 1: Introduction to methodology and encoding rules. J. Chem. Inf. Comput. Sci. 28:31-36.
 Key References
    Pegg et al., 2005. See above.

Describes the SFLD.

    Gerlt and Babbitt, 2001. See above.

Describes various mechanisms of enzyme evolution, including chemistry-driven evolution of mechanistically diverse superfamilies. Several mechanistically diverse superfamilies are discussed in detail.

    Babbitt et al., 1995. See above.

Describes the use of superfamily analysis to elucidate the function of an uncharacterized ORF in Escherichia coli.

 Internet Resources
    http://sfld.rbvi.ucsf.edu/

The Structure-Function Linkage Database.

    http://theseed.uchicago.edu/FIG/index.cgi

Get operon context information for a specific gene.

    http://modbase.compbio.ucsf.edu/modbase-cgi-new/search_form.cgi

View modeled 3-D structures for a specific protein.

     
 
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library
Looking for Answers?
Do you have tips, tricks, or improvements to share?

Join the Conversation

Post new comment

The content of this field is kept private and will not be shown publicly.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.