Using The Arabidopsis Information Resource (TAIR) to Find Information About Arabidopsis Genes

Philippe Lamesch1, Kate Dreher1, David Swarbreck1, Rajkumar Sasidharan1, Leonore Reiser1, Eva Huala1

1 Carnegie Institution for Science, Stanford, California
Publication Name:  Current Protocols in Bioinformatics
Unit Number:  Unit 1.11
DOI:  10.1002/0471250953.bi0111s30
Online Posting Date:  June, 2010
GO TO THE FULL TEXT: PDF or HTML at Wiley Online Library


The Arabidopsis Information Resource (TAIR; is a comprehensive Web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, biochemical pathways, genome organization, images of mutant plants, protein sub‐cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of Web‐based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes and describes several new tools such as a new TAIR genome browser (GBrowse), and the TAIR synteny viewer (GBrowse_syn). We also describe how to use AraCyc for mining plant metabolic pathways. Curr. Protoc. Bioinform. 30:1.11.1‐1.11.51. © 2010 by John Wiley & Sons, Inc.

Keywords: Arabidopsis; databases; bioinformatics; data mining; genomics

PDF or HTML at Wiley Online Library

Table of Contents

  • Introduction
  • Basic Protocol 1: TAIR Homepage, Sitemap, and Navigation
  • Basic Protocol 2: Finding Comprehensive Information about Arabidopsis Genes
  • Basic Protocol 3: Using the Arabidopsis Genome Browsers (SeqViewer and GBrowse)
  • Basic Protocol 4: Using the Gene Ontology Annotations: Finding Genes with Similar Functions
  • Basic Protocol 5: Finding and Ordering Mutant Seeds and cDNA Clones from the Stock Center
  • Basic Protocol 6: Using Public Microarray Data in TAIR
  • Support Protocol 1: Mapping Array Elements to Annotated Loci
  • Basic Protocol 7: Using the Motif Analysis Tool for Identifying Potential cis‐Regulatory Motifs in Upstream Sequences
  • Basic Protocol 8: Obtaining Information about Arabidopsis Metabolic Pathways
  • Basic Protocol 9: Displaying Metabolic Data Using the “OMICS” Viewer
  • Guidelines for Understanding Results
  • Commentary
  • Literature Cited
  • Figures
PDF or HTML at Wiley Online Library


PDF or HTML at Wiley Online Library



Literature Cited

Literature Cited
   Altschul, S., Gish, W., Miller, W., Myers, E., and Lipman, D. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403‐410.
   Berardini, T.Z., Mundodi, S., Reiser, L., Huala, E., Garcia‐Hernandez, M., Zhang, P., Mueller, L.A., Yoon, J., Doyle, A., Lander, G., Moseyko, N., Yoo, D., Xu, I., Zoeckler, B., Montoya, M., Miller, N., Weems, D., and Rhee, S.Y. 2004. Functional annotation of the Arabidopsis genome using controlled vocabularies. Plant Physiol. 135:745‐755.
   Birney, E., Andrews, D., Bevan, P., Caccamo, M., Cameron, G., Chen, Y., Clarke, L., Coates, G., Cox, T., Cuff, J., Curwen, V., Cutts, T., Down, T., Durbin, R., Eyras, E., Fernandez‐Suarez, X.M., Gane, P., Gibbins, B., Gilbert, J., Hammond, M., Hotz, H., Iyer, V., Kahari, A., Jekosch, K., Kasprzyk, A., Keefe, D., Keenan, S., Lehvaslaiho, H., McVicker, G., Melsopp, C., Meidl, P., Mongin, E., Pettett, R., Potter, S., Proctor, G., Rae, M., Searle, S., Slater, G., Smedley, D., Smith, J., Spooner, W., Stabenau, A., Stalker, J., Storey, R., Ureta‐Vidal, A., Woodwark, C., Clamp, M., and Hubbard, T. 2004. Ensembl 2004. Nucleic Acids Res. 32:D468‐D470.
   Borevitz, J.O. and Nordborg, M. 2003. The impact of genomics on the study of natural variation in Arabidopsis. Plant Physiol. 132:718‐725.
   Clark, R.M., Schweikert, G., Toomajian, C., Ossowski, S., Zeller, G., Shinn, P., Warthmann, N., Hu, T.T., Fu, G., Hinds, D.A., Chen, H., Frazer, K.A., Huson, D.H., Schölkopf, B., Nordborg, M., Rätsch, G., Ecker, J.R., and Weigel, D. 2007. Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana. Science 317:338‐342.
   Craigon, D.J., James, N., Okyere, J., Higgins, J., Jotham, J., and May, S. 2004. NASCArrays: A repository for microarray data generated by NASC's transcriptomics service. Nucleic Acids Res. 32:D575‐D577.
   Cutler, S., Ghassemian, M., Bonetta, D., Cooney, S., and McCourt, P. 1996. A protein farnesyl transferase involved in abscisic acid signal transduction in Arabidopsis. Science 273:1239‐1241.
   Dahlquist, K.D., Salomonis, N., Vranizan, K., Lawlor, S.C., and Conklin, B.R. 2002. GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways. Nat. Genet. 31:19‐20.
   Dudoit, S., Yang, Y.H., Luu, P., Lin, D.M., Peng, V., Ngai, J., and Speed, T.P. 2002. Normalization for cDNA microarray data: A robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 30:E15.
   Eulgem, T., Rushton, P.J., Robatzek, S., and Somssich, I.E.. 2000. The WRKY superfamily of plant transcription factors. Trends Plant Sci. 5:199‐206.
   Field, B. and Osbourn, A.E. 2008. Metabolic diversification: Independent assembly of operon‐like gene clusters in different plants. Science 320:543‐547.
   Flanders, D.J., Weng, S., Petel, F.X., and Cherry, J.M. 1998. AtDB, the Arabidopsis thaliana database, and graphical‐web‐display of progress by the Arabidopsis Genome Initiative. Nucleic Acids Res. 26:80‐84.
   Garcia‐Hernandez, M., Berardini, T.Z., Chen, G., Crist, D., Doyle, A., Huala, E., Knee, E., Lambrecht, M., Miller, N., Mueller, L.A., Mundodi, S., Reiser, L., Rhee, S.Y., Scholl, R., Tacklind, J., Weems, D.C., Wu, Y., Xu, I., Yoo, D., Yoon, J., and Zhang, P. 2002. TAIR: A resource for integrated Arabidopsis data. Funct. Integr. Genomics 2:239‐253.
   The Gene Ontology Consortium. 2010. The gene ontology in 2010: Extensions and refinements. Nucleic Acids Res. In press.
   Haas, B.J., Delcher, A.L., Mount, S.M., Wortman, J.R., Smith, R.K. Jr., Hannick, L.I., Maiti, R., Ronning, C.M., Rusch, D.B., Town, C.D., Salzberg, S.L., and White, O. 2003. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31:5654‐5666.
   Hagen, G. and Guilfoyle, T. 2002. Auxin‐responsive gene expression: Genes, promoters and regulatory factors. Plant Mol. Biol. 49:373‐385.
   Huala, E., Dickerman, A.W., Garcia‐Hernandez, M., Weems, D., Reiser, L., LaFond, F., Hanley, D., Kiphart, D., Zhuang, M., Huang, W., Mueller, L.A., Bhattacharyya, D., Bhaya, D., Sobral, B.W., Beavis, W., Meinke, D.W., Town, C.D., Somerville, C., and Rhee, S.Y. 2001. The Arabidopsis Information Resource (TAIR): A comprehensive database and web‐based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Res. 29:102‐105.
   Karp, P.D., Paley, S., and Romero, P. 2002. The pathway tools software. Bioinformatics 18:S225‐S232.
   Mueller, L.A., Zhang, P., and Rhee, S.Y. 2003. AraCyc: A biochemical pathway database for Arabidopsis. Plant Physiol. 132:453‐460.
   Pearson, W.R. 1995. Comparison of methods for searching protein sequence databases. Protein Sci. 4:1145‐1160.
   Rhee, S.Y. 2004. Carpe diem: Retooling the publish or perish model into the share and survive model. Plant Physiol. 134:543‐547.
   Rhee, S.Y., Weng, S., Bongard‐Pierce, D.K., Garcia‐Hernandez, M., Malekian, A., Flanders, D.J., and Cherry, J.M. 1999. Unified display of Arabidopsis thaliana physical maps from AtDB, the A.thaliana database. Nucleic Acids Res. 27:79‐84.
   Rhee, S.Y., Beavis, W., Berardini, T.Z., Chen, G., Dixon, D., Doyle, A., Garcia‐Hernandez, M., Huala, E., Lander, G., Montoya, M., Miller, N., Mueller, L.A., Mundodi, S., Reiser, L., Tacklind, J., Weems, D.C., Wu, Y., Xu, I., Yoo, D., Yoon, J., and Zhang, P. 2003. The Arabidopsis Information Resource (TAIR): A model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community. Nucleic Acids Res. 31:224‐228.
   Running, M.P., Lavy, M., Sternberg, H., Galichet, A., Gruissem, W., Hake, S., Ori, N., and Yalovsky, S. 2004. Enlarged meristems and delayed growth in plp mutants result from lack of CaaX prenyltransferases. Proc. Natl. Acad. Sci. U.S.A. 101:7815‐7820.
   Salomonis, K., Hanspers, A.C., Zambon, K., Vranizan, S.C., Lawlor, K.D., Dahlquist, S.W., Doniger, J., Stuart, B.R., and Pico, A.R. 2007. GenMAPP 2: New features and resources for pathway analysis. BMC Bioinformatics 8:217.
   Scholl, R.L., May, S.T., and Ware, D.H. 2000. Seed and molecular resources for Arabidopsis. Plant Physiol. 124:1477‐1480.
   Scholl, R., Sachs, M.M., and Ware, D. 2003. Maintaining collections of mutants for plant functional genomics. Methods Mol. Biol. 236:311‐326.
   Stein, L.D., Mungall, C., Shu, S., Caudy, M., Mangone, M., Day, A., Nickerson, E., Stajich, J.E., Harris, T.W., Arva, A., and Lewis, S. 2002. The generic genome browser: A building block for a model organism system database. Genome Res. 12:1599‐1610.
   Swarbreck, D., Wilks, C., Lamesch, P., Berardini, T.Z., Garcia‐Hernandez, M., Foerster, H., Li, D., Meyer, T., Muller, R., Ploetz, L., Radenbaugh, A., Singh, S., Swing, V., Tissier, C., Zhang, P., and Huala, E. 2008. The Arabidopsis Information Resource (TAIR): Gene structure and function annotation. Nucleic Acids Res. 36:D1009‐D1014.
   Thimm, O., Blasing, O., Gibon, Y., Nagel, A., Meyer, S., Kruger, P., Selbig, J., Muller, L.A., Rhee, S.Y., and Stitt, M. 2004. MAPMAN: A user‐driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 37:914‐939.
   Weems, D., Miller, N., Garcia‐Hernandez, M., Huala, E., and Rhee, S.Y. 2004. Design, implementation, and maintenance of a model organism database for Arabidopsis thaliana. Comp. Funct. Genomics 5:362‐369.
   Wortman, J.R., Haas, B.J., Hannick, L.I., Smith, R.K. Jr., Maiti, R., Ronning, C.M., Chan, A.P., Yu, C., Ayele, M., Whitelaw, C.A., White, O.R., and Town, C.D. 2003. Annotation of the Arabidopsis genome. Plant Physiol. 132:461‐468.
   Yalovsky, S., Kulukian, A., Rodriguez‐Concepcion, M., Young, C.A., and Gruissem, W. 2000. Functional requirement of plant farnesyltransferase during development in Arabidopsis. Plant Cell 12:1267‐1278.
   Yamada, K., Lim, J., Dale, J.M., Chen, H., Shinn, P., Palm, C.J., Southwick, A.M., Wu, H.C., Kim, C., Nguyen, M., Pham, P., Cheuk, R., Karlin‐Newmann, G., Liu, S.X., Lam, B., Sakano, H., Wu, T., Yu, G., Miranda, M., Quach, H.L., Tripp, M., Chang, C.H., Lee, J.M., Toriumi, M., Chan, M.M., Tang, C.C., Onodera, C.S., Deng, J.M., Akiyama, K., Ansari, Y., Arakawa, T., Banh, J., Banno, F., Bowser, L., Brooks, S., Carninci, P., Chao, Q., Choy, N., Enju, A., Goldsmith, A.D., Gurjal, M., Hansen, N.F., Hayashizaki, Y., Johnson‐Hopson, C., Hsuan, V.W., Iida, K., Karnes, M., Khan, S., Koesema, E., Ishida, J., Jiang, P.X., Jones, T., Kawai, J., Kamiya, A., Meyers, C., Nakajima, M., Narusaka, M., Seki, M., Sakurai, T., Satou, M., Tamse, R., Vaysberg, M., Wallender, E.K., Wong, C., Yamamura, Y., Yuan, S., Shinozaki, K., Davis, R.W., Theologis, A., and Ecker, J.R. 2003. Empirical analysis of transcriptional activity in the Arabidopsis genome. Science 302:842‐846.
   Zhang, P., Foerster, H., Tissier, C., Mueller, L., Paley, S., Karp, P., and Rhee, S.Y. 2005. MetaCyc and AraCyc: Metabolic pathway databases for plant research. Plant Physiology 138:27‐37.
   Ziegelhoffer, E.C., Medrano, L.J., and Meyerowitz, E.M. 2000. Cloning of the Arabidopsis WIGGUM gene identifies a role for farnesylation in meristem development. Proc. Natl. Acad. Sci. U.S.A. 97:7633‐7638.
   Zimmermann, P., Hirsch‐Hoffmann, M., Hennig, L., and Gruissem, W. 2004. GENEVESTIGATOR: Arabidopsis microarray database and analysis toolbox. Plant Physiol. 136:2621‐2632.
PDF or HTML at Wiley Online Library