TepiTool: A Pipeline for Computational Prediction of T Cell Epitope Candidates

Sinu Paul1, John Sidney1, Alessandro Sette1, Bjoern Peters1

1 La Jolla Institute for Allergy and Immunology, Division of Vaccine Discovery, La Jolla
Publication Name:  Current Protocols in Immunology
Unit Number:  Unit 18.19
DOI:  10.1002/cpim.12
Online Posting Date:  August, 2016
GO TO THE FULL TEXT: PDF or HTML at Wiley Online Library


Computational prediction of T cell epitope candidates is currently being used in several applications including vaccine discovery studies, development of diagnostics, and removal of unwanted immune responses against protein therapeutics. There have been continuous improvements in the performance of MHC binding prediction tools, but their general adoption by immunologists has been slow due to the lack of user‐friendly interfaces and guidelines. Current tools only provide minimal advice on what alleles to include, what lengths to consider, how to deal with homologous peptides, and what cutoffs should be considered relevant. This protocol provides step‐by‐step instructions with necessary recommendations for prediction of the best T cell epitope candidates with the newly developed online tool called TepiTool. TepiTool, which is part of the Immune Epitope Database (IEDB), provides some of the top MHC binding prediction algorithms for number of species including humans, chimpanzees, bovines, gorillas, macaques, mice, and pigs. The TepiTool is freely accessible at http://tools.iedb.org/tepitool/. © 2016 by John Wiley & Sons, Inc.

Keywords: binding affinity prediction; CTL epitope prediction; MHC class I; MHC class II; T cell epitope

PDF or HTML at Wiley Online Library

Table of Contents

  • Introduction
  • Basic Protocol 1: Computational Prediction of Peptides Binding to MHC Class I and Class II Molecules
  • Commentary
  • Figures
PDF or HTML at Wiley Online Library


Basic Protocol 1: Computational Prediction of Peptides Binding to MHC Class I and Class II Molecules

  • Computer with Internet browser and proper Internet connection
  • Protein sequence(s) for binding prediction in single letter amino acid code.
  • TepiTool (http://tools.iedb.org/tepitool/)
PDF or HTML at Wiley Online Library



Literature Cited

Literature Cited
  Bhasin, M. and Raghava, G.P. 2004. SVM based method for predicting HLA‐DRB1*0401 binding peptides in an antigen sequence. Bioinformatics 20:421‐423. doi: 10.1093/bioinformatics/btg424.
  Buus, S., Lauemøller, S., Worning, P., Kesmir, C., Frimurer, T., Corbet, S., Fomsgaard, A., Hilden, J., Holm, A., and Brunak, S. 2003. Sensitive quantitative predictions of peptide‐MHC binding by a ‘Query by Committee’ artificial neural network approach. Tissue Antigens 62:378‐384. doi: 10.1034/j.1399‐0039.2003.00112.x.
  Castellino, F., Zhong, G., and Germain, R.N. 1997. Antigen presentation by MHC class II molecules: Invariant chain function, protein trafficking, and the molecular basis of diverse determinant capture. Hum. Immunol. 54:159‐169. doi: 10.1016/S0198‐8859(97)00078‐5.
  Chicz, R.M., Urban, R.G., Lane, W.S., Gorga, J.C., Stern, L.J., Vignali, D.A., and Strominger, J.L. 1992. Predominant naturally processed peptides bound to HLA‐DR1 are derived from MHC‐related molecules and are heterogeneous in size. Nature 358:764‐768. doi: 10.1038/358764a0.
  De Groot, A.S. and Berzofsky, J.A. 2004. From genome to vaccine—new immunoinformatics tools for vaccine design. Methods 34:425‐428. doi: 10.1016/j.ymeth.2004.06.004.
  Ellis, S.A., Bontrop, R.E., Antczak, D.F., Ballingall, K., Davies, C.J., Kaufman, J., Kennedy, L.J., Robinson, J., Smith, D.M., and Stear, M.J. 2006. ISAG/IUIS‐VIC comparative MHC nomenclature committee report, 2005. Immunogenetics 57:953‐958. doi: 10.1007/s00251‐005‐0071‐4.
  Gonzalez‐Galarza, F.F., Takeshita, L.Y., Santos, E.J., Kempson, F., Maia, M.H., da Silva, A.L., Teles e Silva, A.L., Ghattaoraya, G.S., Alfirevic, A., Jones, A.R., and Middleton, D. 2015. Allele frequency net 2015 update: New features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations. Nucleic Acids Res. 43:D784‐8. doi: 10.1093/nar/gku1166 [doi.
  Greenbaum, J., Sidney, J., Chung, J., Brander, C., Peters, B., and Sette, A. 2011. Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics 63:325‐335. doi: 10.1007/s00251‐011‐0513‐0.
  He, Y., Zhou, Y., Wu, H., Luo, B., Chen, J., Li, W., and Jiang, S. 2004. Identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (SARS) coronavirus: Implication for developing SARS diagnostics and vaccines. J. Immunol. 173:4050‐4057. doi: 173/6/4050 [pii]; doi: 10.4049/jimmunol.173.6.4050.
  Hoof, I., Peters, B., Sidney, J., Pedersen, L.E., Sette, A., Lund, O., Buus, S., and Nielsen, M. 2009. NetMHCpan, a method for MHC class I binding prediction beyond humans. Immunogenetics 61:1‐13. doi: 10.1007/s00251‐008‐0341‐z.
  Karosiene, E., Rasmussen, M., Blicher, T., Lund, O., Buus, S., and Nielsen, M. 2013. NetMHCIIpan‐3.0, a common pan‐specific MHC class II prediction method including all three human MHC class II isotypes, HLA‐DR, HLA‐DP and HLA‐DQ. Immunogenetics 65:711‐724. doi: 10.1007/s00251‐013‐0720‐y.
  Kim, Y., Sidney, J., Pinilla, C., Sette, A., and Peters, B. 2009. Derivation of an amino acid similarity matrix for peptide: MHC binding and its application as a bayesian prior. BMC Bioinformatics 10:394. doi: 10.1186/1471‐2105‐10‐394.
  Kim, Y., Ponomarenko, J., Zhu, Z., Tamang, D., Wang, P., Greenbaum, J., Lundegaard, C., Sette, A., Lund, O., and Bourne, P.E. 2012. Immune epitope database analysis resource. Nucleic Acids Res. 40:W525‐W530. doi: 10.1093/nar/gks438.
  Larsen, M.V., Lelic, A., Parsons, R., Nielsen, M., Hoof, I., Lamberth, K., Loeb, M.B., Buus, S., Bramson, J., and Lund, O. 2010. Identification of CD8 T cell epitopes in the west nile virus polyprotein by reverse‐immunology using NetCTL. PloS One 5:e12697. doi: 10.1371/journal.pone.0012697.
  Lefranc, M.P., Giudicelli, V., Ginestoux, C., Bodmer, J., Muller, W., Bontrop, R., Lemaitre, M., Malik, A., Barbie, V., and Chaume, D. 1999. IMGT, the international ImMunoGeneTics database. Nucleic Acids Res. 27:209‐212. doi: gkc026 [pii] doi: 10.1093/nar/27.1.209.
  Lin, H., Zhang, G., Tongchusak, S., Reinherz, E.L., and Brusic, V. 2008. Evaluation of MHC‐II peptide binding prediction servers: Applications for vaccine research. BMC Bioinformatics 9(Suppl 12):S22. doi: 10.1186/1471‐2105‐9‐S12‐S22.
  Lund, O., Nascimento, E.J., Maciel Jr, M., Nielsen, M., Larsen, M.V., Lundegaard, C., Harndahl, M., Lamberth, K., Buus, S., and Salmon, J. 2011. Human leukocyte antigen (HLA) class I restricted epitope discovery in yellow fewer and dengue viruses: Importance of HLA binding strength. PloS One 6:e26494. doi: 10.1371/journal.pone.0026494.
  Lundegaard, C., Nielsen, M., and Lund, O. 2006. The validity of predicted T‐cell epitopes. Trends Biotechnol. 24:537‐538. doi: 10.1016/j.tibtech.2006.10.001.
  Lundegaard, C., Lund, O., and Nielsen, M. 2008. Accurate approximation method for prediction of class I MHC affinities for peptides of length 8, 10 and 11 using prediction tools trained on 9mers. Bioinformatics 24:1397‐1398. doi: 10.1093/bioinformatics/btn128.
  Lundegaard, C., Hoof, I., Lund, O., and Nielsen, M. 2010. State of the art and challenges in sequence based T‐cell epitope prediction. Immunome Res. 6:S3. doi: 10.1186/1745‐7580‐6‐S2‐S3.
  Lundegaard, C., Lamberth, K., Harndahl, M., Buus, S., Lund, O., and Nielsen, M. 2008. NetMHC‐3.0: Accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8‐11. Nucleic Acids Res. 36:W509‐W512. doi: 10.1093/nar/gkn202.
  Moise, L., McMurry, J.A., Buus, S., Frey, S., Martin, W.D., and De Groot, A.S. 2009. In silico–accelerated identification of conserved and immunogenic variola/vaccinia T‐cell epitopes. Vaccine 27:6471‐6479. doi: 10.1016/j.vaccine.2009.06.018.
  Moutaftsi, M., Peters, B., Pasquetto, V., Tscharke, D.C., Sidney, J., Bui, H., Grey, H., and Sette, A. 2006. A consensus epitope prediction approach identifies the breadth of murine TCD8 ‐cell responses to vaccinia virus. Nat. Biotechnol. 24:817‐819. doi: 10.1038/nbt1215.
  Murphy, K. (Ed.). 2011. Janeway's Immunobiology. Garland Science, New York.
  Nielsen, M. and Lund, O. 2009. NN‐align. an artificial neural network‐based alignment algorithm for MHC class II peptide binding prediction. BMC Bioinformatics 10:296. doi: 10.1186/1471‐2105‐10‐296.
  Nielsen, M., Lundegaard, C., and Lund, O. 2007. Prediction of MHC class II binding affinity using SMM‐align, a novel stabilization matrix alignment method. BMC Bioinformatics 8:238. doi: 10.1186/1471‐2105‐8‐238.
  Nielsen, M., Lundegaard, C., Worning, P., Lauemøller, S.L., Lamberth, K., Buus, S., Brunak, S., and Lund, O. 2003. Reliable prediction of T‐cell epitopes using neural networks with novel sequence representations. Protein Sci. 12:1007‐1017. doi: 10.1110/ps.0239403.
  Nielsen, M., Lundegaard, C., Blicher, T., Peters, B., Sette, A., Justesen, S., Buus, S., and Lund, O. 2008. Quantitative predictions of peptide binding to any HLA‐DR molecule of known sequence: NetMHCIIpan. PLoS Comput. Biol. 4:e1000107. doi: 10.1371/journal.pcbi.1000107.
  Nielsen, M., Lundegaard, C., Blicher, T., Lamberth, K., Harndahl, M., Justesen, S., Roder, G., Peters, B., Sette, A., and Lund, O. 2007. NetMHCpan, a method for quantitative predictions of peptide binding to any HLA‐A and‐B locus protein of known sequence. PloS One 2:e796. doi: 10.1371/journal.pone.0000796.
  Oseroff, C., Sidney, J., Kotturi, M.F., Kolla, R., Alam, R., Broide, D.H., Wasserman, S.I., Weiskopf, D., McKinney, D.M., and Chung, J.L. 2010. Molecular determinants of T cell epitope recognition to the common timothy grass allergen. J. Immunol. 185:943‐955. doi: 10.4049/jimmunol.1000405.
  Parker, K.C., Bednarek, M.A., and Coligan, J.E. 1994. Scheme for ranking potential HLA‐A2 binding peptides based on independent binding of individual peptide side‐chains. J. Immunol. 152:163‐175.
  Paul, S., Weiskopf, D., Angelo, M.A., Sidney, J., Peters, B., and Sette, A. 2013. HLA class I alleles are associated with peptide‐binding repertoires of different size, affinity, and immunogenicity. J. Immunol. 191:5831‐5839. doi: 10.4049/jimmunol.1302101 [doi] doi: 10.4049/jimmunol.1302101.
  Paul, S., Arlehamn, C.S.L., Scriba, T.J., Dillon, M.B., Oseroff, C., Hinz, D., McKinney, D.M., Pro, S.C., Sidney, J., and Peters, B. 2015. Development and validation of a broad scheme for prediction of HLA class II restricted T cell epitopes. J. Immunol. Methods 422:28‐34. doi: 10.1016/j.jim.2015.03.022.
  Paul, S., Kolla, R.V., Sidney, J., Weiskopf, D., Fleri, W., Kim, Y., Peters, B., and Sette, A. 2013b. Evaluating the immunogenicity of protein drugs by applying in vitro MHC binding data and the immune epitope database and analysis resource. Clin. Dev. Immunol. 2013:467852. doi: 10.1155/2013/467852.
  Peters, B., Sidney, J., Bourne, P., Bui, H., Buus, S., Doh, G., Fleri, W., Kronenberg, M., Kubo, R., and Lund, O. 2005. The immune epitope database and analysis resource: From vision to blueprint. PLoS Biol. 3:e91. doi: 10.1371/journal.pbio.0030091.
  Rammensee, H.G., Bachmann, J., Emmerich, N.P.N., Bachor, O.A., and Stevanovic, S. 1999. SYFPEITHI: Database for MHC ligands and peptide motifs. Immunogenetics 50:213‐219. doi: 10.1007/s002510050595.
  Reche, P.A. and Reinherz, E.L. 2007. Prediction of peptide‐MHC binding using profiles. In Immunoinformatics, pp. 185‐200. Springer, New York.
  Robinson, J., Halliwell, J.A., Hayhurst, J.D., Flicek, P., Parham, P., and Marsh, S.G. 2015. The IPD and IMGT/HLA database: Allele variant databases. Nucleic Acids Res. 43:D423‐D431. doi: 10.1093/nar/gku1161.
  Rudensky, A.Y., Preston‐Hurlburt, P., Hong, S., Barlow, A., and Janeway, C.A. 1991. Sequence analysis of peptides bound to MHC class II molecules. Nature 353:622‐627. doi: 10.1038/353622a0.
  Salimi, N., Fleri, W., Peters, B., and Sette, A. 2010. Design and utilization of epitope‐based databases and predictive tools. Immunogenetics 62:185‐196. doi: 10.1007/s00251‐010‐0435‐2.
  Sette, A. and Rappuoli, R. 2010. Reverse vaccinology: Developing vaccines in the era of genomics. Immunity 33:530‐541. doi: 10.1016/j.immuni.2010.09.017.
  Sette, A., Vitiello, A., Reherman, B., Fowler, P., Nayersina, R., Kast, W.M., Melief, C., Oseroff, C., Yuan, L., and Ruppert, J. 1994. The relationship between class I binding affinity and immunogenicity of potential cytotoxic T cell epitopes. J. Immunol. 153:5586‐5592.
  Sidney, J., Peters, B., Frahm, N., Brander, C., and Sette, A. 2008. HLA class I supertypes: A revised and updated classification. BMC Immunol. 9:1471‐2172. doi: 10.1186/1471‐2172‐9‐1.
  Sidney, J., Assarsson, E., Moore, C., Ngo, S., Pinilla, C., Sette, A., and Peters, B. 2008. Quantitative peptide binding motifs for 19 human and mouse MHC class I molecules derived using positional scanning combinatorial peptide libraries. Immunome Res. 4:2. doi: 10.1186/1745‐7580‐4‐2.
  Singh, H. and Raghava, G. 2001. ProPred: Prediction of HLA‐DR binding sites. Bioinformatics 17:1236. doi: 10.1093/bioinformatics/17.12.1236.
  Singh, H. and Raghava, G. 2003. ProPred1: Prediction of promiscuous MHC class‐I binding sites. Bioinformatics 19:1009‐1014. doi: 10.1093/bioinformatics/btg108.
  Stern, L.J., Brown, J.H., Jardetzky, T.S., Gorga, J.C., Urban, R.G., Strominger, J.L., and Wiley, D.C. 1994. Crystal structure of the human class II MHC protein HLA‐DR1 complexed with an influenza virus peptide. Nature 368:215‐221.
  Sturniolo, T., Bono, E., Ding, J., Raddrizzani, L., Tuereci, O., Sahin, U., Braxenthaler, M., Gallazzi, F., Protti, M.P., and Sinigaglia, F. 1999. Generation of tissue‐specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices. Nat. Biotechnol. 17:555‐561. doi: 10.1038/9858.
  Sylvester‐Hvid, C., Nielsen, M., Lamberth, K., Røder, G., Justesen, S., Lundegaard, C., Worning, P., Thomadsen, H., Lund, O., and Brunak, S. 2004. SARS CTL vaccine candidates—HLA supertype, genome‐wide scanning and biochemical validation. Scand. J. Immunol. 59:632‐632. doi: 10.1111/j.0300‐9475.2004.01423bd.x.
  Tangri, S., Mothé, B.R., Eisenbraun, J., Sidney, J., Southwood, S., Briggs, K., Zinckgraf, J., Bilsel, P., Newman, M., and Chesnut, R. 2005. Rationally engineered therapeutic proteins with reduced immunogenicity. J. Immunol. 174:3187‐3196. doi: 10.4049/jimmunol.174.6.3187.
  Trolle, T., Metushi, I.G., Greenbaum, J.A., Kim, Y., Sidney, J., Lund, O., Sette, A., Peters, B., and Nielsen, M. 2015. Automated benchmarking of peptide‐MHC class I binding predictions. Bioinformatics 31:2174‐2181. doi: 10.1093/bioinformatics/btv123 [doi].
  Vincenti, D., Carrara, S., De Mori, P., Pucillo, L.P., Petrosillo, N., Palmieri, F., Armignacco, O., Ippolito, G., Girardi, E., Amicosante, M., and Goletti, D. 2003. Identification of early secretory antigen target‐6 epitopes for the immunodiagnosis of active tuberculosis. Mol. Med. 9:105‐111.
  Wang, P., Sidney, J., Dow, C., Mothé, B., Sette, A., and Peters, B. 2008. A systematic assessment of MHC class II peptide binding predictions and evaluation of a consensus approach. PLoS Comput. Biol. 4:e1000048. doi: 10.1371/journal.pcbi.1000048.
  Wang, P., Sidney, J., Kim, Y., Sette, A., Lund, O., Nielsen, M., and Peters, B. 2010. Peptide binding predictions for HLA DR, DP and DQ molecules. BMC Bioinformatics 11:568. doi: 10.1186/1471‐2105‐11‐568.
  Wang, M., Lamberth, K., Harndahl, M., Røder, G., Stryhn, A., Larsen, M.V., Nielsen, M., Lundegaard, C., Tang, S.T., and Dziegiel, M.H. 2007. CTL epitopes for influenza A including the H5N1 bird flu; genome‐, pathogen‐, and HLA‐wide screening. Vaccine 25:2823‐2831. doi: 10.1016/j.vaccine.2006.12.038.
  Weiskopf, D., Angelo, M.A., de Azeredo, E.L., Sidney, J., Greenbaum, J.A., Fernando, A.N., Broadwater, A., Kolla, R.V., De Silva, A.D., de Silva, A.M., Mattia, K.A., Doranz, B.J., Grey, H.M., Shresta, S., Peters, B., and Sette, A. 2013. Comprehensive analysis of dengue virus‐specific responses supports an HLA‐linked protective role for CD8+ T cells. Proc. Natl. Acad. Sci. U.S.A. 110:E2046‐53. doi: 10.1073/pnas.1305227110.
  Zhang, L., Udaka, K., Mamitsuka, H., and Zhu, S. 2012. Toward more accurate pan‐specific MHC‐peptide binding prediction: A review of current methods and tools. Brief. Bioinformatics 13:350‐364. doi: 10.1093/bib/bbr060.
  Zhang, G.L., Khan, A.M., Srinivasan, K.N., August, J.T., and Brusic, V. 2005. MULTIPRED: A computational system for prediction of promiscuous HLA binding peptides. Nucleic Acids Res. 33:W172‐W179. doi: 10.1093/nar/gki452.
  Zhang, Q., Wang, P., Kim, Y., Haste‐Andersen, P., Beaver, J., Bourne, P.E., Bui, H.H., Buus, S., Frankild, S., Greenbaum, J., Lund, O., Lundegaard, C., Nielsen, M., Ponomarenko, J., Sette, A., Zhu, Z., and Peters, B. 2008. Immune epitope database analysis resource (IEDB‐AR). Nucleic Acids Res. 36:W513‐518. doi: 10.1093/nar/gkn254.
Internet Resources
  TepiTool: The newly developed online tool described in this protocol, for prediction of peptides binding to MHC class I and class II molecules.
  IEDB's analysis resource: A collection of tools for the prediction and analysis of immune epitopes.
  International ImMunoGeneTics Information system (IMGT): An integrated knowledge resource for the immunoglobulins or antibodies, T cell receptors, and major histocompatibility of human and other vertebrate species.
  HLA nomenclature Web site: Web site describing HLA allele nomenclature.
  Immuno Polymorphism Database (IPD‐MHC): A centralized repository for sequences of the Major Histocompatibility Complex (MHC) from a number of different species and information on their nomenclature.
  Allelefrequencies.net database: Database with frequencies of HLA alleles.
PDF or HTML at Wiley Online Library