PipMaker: A World Wide Web Server for Genomic Sequence Alignments

Laura Elnitski1, Cathy Riemer1, Scott Schwartz1, Ross Hardison1, Webb Miller1

1 The Pennsylvania State University, University Park, Pennsylvania
Publication Name:  Current Protocols in Bioinformatics
Unit Number:  Unit 10.2
DOI:  10.1002/0471250953.bi1002s00
Online Posting Date:  February, 2003
PipMaker is a World‐Wide Web site used to compare two long genomic sequences and identify conserved segments between them. This unit describes the use of the PipMaker server and explains the resulting output files. PipMaker provides an efficient method of aligning genomic sequences and returns a compact, but easy‐to‐interpret form of output, the percent identity plot (pip). For each aligning segment between two sequences the pip shows both the position relative to the first sequence and the degree of similarity. Optional annotations on the pip provide additional information to assist in the interpretation of the alignment. The default parameters of the underlying blastz alignment program are tuned for human‐mouse alignments.

Table of Contents

  • Strategic Planning
  • Basic Protocol 1: Submitting Sequences to PipMaker
  • Support Protocol 1: Generating a Repeats File for Use with PipMaker
  • Support Protocol 2: Generating an Exons File for Use with PipMaker
  • Support Protocol 3: Generating Color Underlays for Use with PipMaker
  • Support Protocol 4: Generating Annotation Files for Use with PipMaker
  • Support Protocol 5: Installing Stand‐Alone Blastz
  • Guidelines for Understanding Results
  • Commentary
  • Figures
Basic Protocol 1: Submitting Sequences to PipMaker

  Necessary Resources
  • Hardware
    • PipMaker can be accessed and used by any computer with a World Wide Web browser and E‐mail access.
  • Software
    • PipMaker is accessible via a Web interface at http://bio.cse.psu.edu/. All output files will be returned to the user via E‐mail. The E‐mail account and software must be capable of handling large messages. Viewing the output from PipMaker requires a PDF viewer to display the pip or dot plot, such as Aladdin GhostScript or Adobe Acrobat Reader. These are available for free download at http://www.cs.wisc.edu/~ghost/ and http://www.adobe.com/, respectively. At the present time, Acrobat Reader has better support for hyperlinks in PDF files, which are an option in PipMaker. PipMaker can optionally generate a PostScript version of the output files. This feature is useful for importing the plot into a graphics program in preparation for publication.
  • Files
    • The following file types are used:
      • Sequences: The PipMaker server accepts two DNA sequences in FASTA format ( appendix 1B) only. These sequence files must be in plain text format, consisting of A, C, G, T, N, and X, typically uppercase. Line length should be within ∼70 characters. The first sequence should be in one contiguous piece, while the second sequence can be in unordered, unoriented contigs.
      • Repeatsfile (see protocol 2)
      • Exonfile (optional; see protocol 3)
      • Underlayfile (optional; see protocol 4)
      • Annotationfile (optional; see protocol 5)

Support Protocol 1: Generating a Repeats File for Use with PipMaker

  Necessary Resources
  • Hardware
    • The authors test and use Blastz on Solaris/Sparc and Linux/x86 platforms, but it should be portable to virtually any ANSI/POSIX system, including Windows and Macintosh.
  • Software
    • The current development snapshot of Blastz is available on the authors' Web site (http://bio.cse.psu.edu/), in a tar.gz file. To unpack it, tar and gzip (or compatible programs) will be needed. An ANSI‐compatible C compiler and the make utility will be needed to compile and install it.
  • Files
    • The stand‐alone of the Blastz program uses the same sequence and repeats files as the PipMaker Web server (see protocol 1).
NOTE: for an introduction to Unix, see appendix 1C.
