Identification of Motifs in Protein Sequences

Erik L.L. Sonnhammer1, Tyra G. Wolfsberg2

1 Center for Genomics Research, Karolinska Institutet, Stockholm, 2 National Center for Biotechnology Information/NIH, Bethesda, Maryland
Publication Name:  Current Protocols in Cell Biology
Unit Number:  Appendix 1C
DOI:  10.1002/0471143030.cba01cs00
Online Posting Date:  May, 2001
GO TO THE FULL TEXT: PDF or HTML at Wiley Online Library


This brief appendix serves as a guide for the analysis of functional motifs in proteins. Several database search engines that can be accessed via the World Wide Web are described. Such computerized searches have become the preferred method to scan large sequence and motif databases, as the searches are efficient and the databases are updated frequently. A short list of sorting signals is also included, since these motifs often cannot be predicted reliably by a computer search.

PDF or HTML at Wiley Online Library

Table of Contents

  • Databases and Servers on the WWW
  • Analysis Example
  • Sorting Signals
  • Figures
  • Tables
PDF or HTML at Wiley Online Library


PDF or HTML at Wiley Online Library



Literature Cited

Literature Cited
   Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI‐BLAST: A new generation of protein database search programs. Nucl. Acids Res. 25:3389‐3402.
   Attwood, T.K., Beck, M.E., Flower, D.R., Scordis, P., and Selley, J.N. 1998. The PRINTS protein fingerprint database in its fifth year. Nucl. Acids Res. 26:304‐308.
   Bairoch, A., Bucher, P., and Hofmann, K. 1997. The PROSITE database, its status in 1997. Nucl. Acids Res. 25:217‐221.
   Blattner, J., Dorsam, H., and Clatyon, C.E. 1995. Function of N‐terminal import signals in trypanosome microbodies. FEBS Lett. 360:310‐314.
   Chen, W.J., Goldstein, J.L., and Brown, M.S. 1990. NPXY, a sequence often found in cytoplasmic tails, is required for coated pit‐mediated internalization of the low‐density lipoprotein receptor. J. Biol. Chem. 265:3116‐3123.
   Claros, M.G. 1995. MitoProt, a Macintosh application for studying mitochondrial proteins. Comput. Appl. Biosci. 11:441‐447.
   Claros, M.G., Brunak, S., and von Heijne, G. 1997. Prediction of N‐terminal protein sorting signals. Curr. Opin. Struct. Biol. 7:394‐398.
   Cline, K. and Henry, R. 1996. Import and routing of nucleus‐encoded chloroplast proteins. Annu. Rev. Cell. Dev. Biol. 12:1‐26.
   Colley, K.J. 1997. Golgi localization of glycosyltransferases: More questions than answers. Glycobiology 7:1‐13.
   Corbett, A.H. and Silver, P.A. 1997. Nucleocytoplasmic transport of macromolecules. Microbiol. Mol. Biol. Rev. 61:193‐211.
   Dalbey, R.E., Lively, M.O., Bron, S., and van Dijl, J.M. 1997. The chemistry and enzymology of the type I signal peptidases. Protein Sci. 6:1129‐1138.
   Gavel, Y. and von Heijne, G. 1990a. Cleavage‐site motifs in mitochondrial targeting peptides. Protein Eng. 4:33‐37.
   Gavel, Y. and von Heijne, G. 1990b. A conserved cleavage‐site motif in chloroplast transit peptides. FEBS Lett. 261:455‐458.
   Gomord, V., Denmat, L.A., Fitchette‐Laine, A.C., Satiat‐Jeunemaitre, B., Hawes, C., and Faye, L. 1997. The C‐terminal HDEL sequence is sufficient for retention of secretory proteins in the endoplasmic reticulum (ER) but promotes vacuolar targeting of proteins that escape the ER. Plant J. 11:313‐325.
   Henikoff, S., Pietrokovski, S., and Henikoff, J.G. 1998. Superior performance in protein homology detection with the Blocks Database servers. Nucl. Acids Res. 26:309‐312.
   Jackson, M.R., Nilsson, T., and Peterson, P.A. 1990. Identification of a consensus motif for retention of transmembrane proteins in the endoplasmic reticulum. EMBO J. 9:3153‐3162.
   Keller, G.A., Krisans, S., Gould, S.J., Sommer, J.M., Wang, C.C., Schliebs, W., Kunau, W., Brody, S., and Subramani, S. 1991. Evolutionary conservation of a microbody targeting signal that targets proteins to peroxisomes, lyoxysomes, and glycosomes. J. Cell. Biol. 114:893‐904.
   Marks, M.S., Ohno, H., Kirchhausen, T., and Bonifacino, J.S. 1997. Protein sorting by tyrosine‐based signals: Adapting to the Ys and wherefores. Trends Cell Biol. 7:124‐128.
   Neupert, W. 1997. Protein import into mitochondria. Annu. Rev. Biochem. 66:863‐917.
   Nielsen, H., Engelbrecht, J., Brunak, S., and von Heijne, G. 1997. Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng. 10:1‐6.
   Pearson, W.R. 1996. Effective protein sequence comparison. Methods Enzymol. 266:227‐258.
   Pearson, W.R., Wood, T., Zhang, Z., and Miller, W. 1997. Comparison of DNA sequences with protein sequences. Genomics 46:24‐36.
   Recipon, H.E., Schuler, G.D., and Boguski, M.S. 1995. Sequence Similarity Searching Using the BLAST Family of Programs. In Current Protocols in Molecular Biology, (F.M. Ausubel, R. Brent, R.E. Kingston, D.D. Moore, J.G. Seidman, J.A. Smith, and K. Struhl, eds.) pp. 19.3.1‐19.3.38. John Wiley & Sons, New York.
   Sandoval, I. and Bakke, O. 1994. Targeting of membrane proteins to endosomes and lysosomes. Trends Cell Biol. 4:292‐297.
   Schwarz, E. and Neupert, W. 1994. Mitochondrial protein import: Mechanisms, components and energetics. Biochim. Biophys. Acta 1187:270‐274.
   Sommer, J.M. and Wang, C.C. 1994. Targeting proteins to the glycosomes of African trypanosomes. Annu. Rev. Microbiol. 48:105‐138.
   Sonnhammer, E.L., Eddy, S.R., Birney, E., Bateman, A., and Durbin, R. 1998. Pfam: Multiple sequence alignments and HMM‐profiles of protein domains. Nucl. Acids Res. 26:320‐322.
   Tikkanen, R., Peltola, M., Oinonen, C., Rouvinen, J., and Peltonen, L. 1997. Several cooperating binding sites mediate the interaction of a lysosomal enzyme with phosphotransferase. EMBO J. 16:6684.6693.
   Trowbridge, I.S., Collawn, J.F., and Hopkins, C.R. 1993. Signal‐dependent membrane protein trafficking in the endocytic pathway. Annu. Rev. Cell Biol. 9:129‐161.
   Udenfriend, S. and Kodukula, K. 1995. How glycosylphosphatidylinositol‐anchored membrane proteins are made. Annu. Rev. Biochem. 64:563‐591.
   von Heijne, G. 1996. Computer‐assisted identification of protein sorting signals and prediction of membrane protein topology and structure. Adv. Computat. Biol. 2:1‐14.
   Waterham, H.R. and Cregg, J.M. 1997. Peroxisome biogenesis. BioEssays 19:57‐66.
PDF or HTML at Wiley Online Library