Analysis and Annotation of Whole‐Genome or Whole‐Exome Sequencing–Derived Variants for Clinical Diagnosis

Elizabeth A. Worthey1

1 Department of Computer Science, University of Wisconsin, Milwaukee, Wisconsin
Publication Name:  Current Protocols in Human Genetics
Unit Number:  Unit 9.24
DOI:  10.1002/0471142905.hg0924s79
Online Posting Date:  October, 2013
GO TO THE FULL TEXT: PDF or HTML at Wiley Online Library


Over the last several years, next‐generation sequencing (NGS) has transformed genomic research through substantial advances in technology and reduction in the cost of sequencing, and also in the systems required for analysis of these large volumes of data. This technology is now being used as a standard molecular diagnostic test under particular circumstances in some clinical settings. The advances in sequencing have come so rapidly that the major bottleneck in identification of causal variants is no longer the sequencing but rather the analysis and interpretation. Interpretation of genetic findings in a clinical setting is scarcely a new challenge, but the task is increasingly complex in clinical genome‐wide sequencing given the dramatic increase in dataset size and complexity. This increase requires the development of novel or repositioned analysis tools, methodologies, and processes. This unit provides an overview of these items. Specific challenges related to implementation in a clinical setting are discussed. Curr. Protoc. Hum. Genet. 79:9.24.1‐9.24.24. © 2013 by John Wiley & Sons, Inc.

Keywords: sequencing; genome variant identification; genome variant annotation; genome variant interpretation

PDF or HTML at Wiley Online Library

Table of Contents

  • Introduction
  • Bioinformatics Analysis of Clinical WGS Data
  • Contrasting Research and Clinical Variant Analyses
  • Interpretation
  • Conclusions
  • Literature Cited
  • Figures
PDF or HTML at Wiley Online Library


PDF or HTML at Wiley Online Library



Literature Cited

Literature Cited
  1000 Genomes Project Consortium; Abecasis, G.R., Altshuler, D., Auton, A., Brooks, L.D., Durbin, R.M., Gibbs, R.A., Hurles, M.E., and McVean, G.A. 2010. A map of human genome variation from population‐scale sequencing. Nature 467:1061‐1073.
  Ajay, S.S., Parker, S.C., Abaan, H.O., Fajardo, K.V., and Margulies, E.H. 2011. Accurate and comprehensive sequencing of personal genomes. Genome Res. 219:1498‐1505.
  Amberger, J., Bocchini, C.A., Scott, A.F., and Hamosh, A. 2009. McKusick's Online Mendelian Inheritance in Man (OMIM). Nucleic Acids Res. 37:D793‐D796.
  Amberger, J., Bocchini, C., and Hamosh, A. 2011. A new face and new challenges for Online Mendelian Inheritance in Man OMIMR. Hum. Mutat. 325:564‐567.
  Ashburner, M. and Lewis, S. 2002. On ontologies for biologists: The Gene Ontology–untangling the web. Novartis Found Symp. 247:66‐80; discussion 80‐83, 84‐90, 244‐252.
  Bailey, J.A., Yavor, A.M., Massa, H.F., Trask, B.J., and Eichler, E.E. 2001. Segmental duplications: Organization and impact within the current human genome project assembly. Genome Res. 116:1005‐1017.
  Bainbridge, M.N., Wiszniewski, W., Murdock, D.R., Friedman, J., Gonzaga‐Jauregui, C., Newsham, I., Reid, J.G., Fink, J.K., Morgan, M.B., Gingras, M.C., Muzny, D.M., Hoang, L.D., Yousaf, S., Lupski, J.R., and Gibbs, R.A. 2011. Whole‐genome sequencing for optimized patient management. Sci. Transl. Med. 387:87re83.
  Bainbridge, M.N., Hu, H., Muzny, D.M., Musante, L., Lupski, J.R., Graham, B.H., Chen, W., Gripp, K.W., Jenny, K., Wienker, T.F., Yang, Y., Sutton, V.R., Gibbs, R.A., and Ropers, H.H. 2013. De novo truncating mutations in ASXL3 are associated with a novel clinical phenotype with similarities to Bohring‐Opitz syndrome. Genome Med. 52:11.
  Baker, S., Joecker, A., Church, G., Snyder, M., West, J., Salzberg, S., Worthey, E., Smith, T., Wang, J., and Reid, J.G. 2012. Genome interpretation and assembly‐recent progress and next steps. Nat. Biotechnol. 30:1081‐1083.
  Bale, S., Devisscher, M., Van Criekinge, W., Rehm, H.L., Decouttere, F., Nussbaum, R., Dunnen, J.T., and Willems, P. 2011. MutaDATABASE: a centralized and standardized DNA variation database. Nat. Biotechnol. 29:117‐118.
  Bauer, D.C. 2011. Variant calling comparison CASAVA1.8 and GATK. Nature Precedings.
  Becker, J., Semler, O., Gilissen, C., Li, Y., Bolz, H.J., Giunta, C., Bergmann, C., Rohrbach, M., Koerber, F., Zimmermann, K., de Vries, P., Wirth, B., Schoenau, E., Wollnik, B., Veltman, J.A., Hoischen, A., and Netzer, C. 2011. Exome sequencing identifies truncating mutations in human SERPINF1 in autosomal‐recessive osteogenesis imperfecta. Am. J. Hum. Genet. 883:362‐371.
  Belinky, F., Bahir, I., Stelzer, G., Zimmerman, S., Rosen, N., Nativ, N., Dalah, I., Iny Stein, T., Rappaport, N., Mituyama, T., Safran, M., and Lancet, D. 2013. Non‐redundant compendium of human ncRNA genes in GeneCards. Bioinformatics 292:255‐261.
  Bentley, D.R., Balasubramanian, S., Swerdlow, H.P., Smith, G.P., Milton, J., Brown, C.G., Hall, K.P., Evers, D.J., Barnes, C.L., Bignell, H.R., Boutell, J.M., Bryant, J., et al., 2008. Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456:53‐59.
  Berg, J.S., Khoury, M.J., and Evans, J.P. 2011. Deploying whole genome sequencing in clinical practice and public health: Meeting the challenge one bin at a time. Genet Med. 136:499‐504.
  Bertelli, C. and Greub, G. 2013. Rapid bacterial genome sequencing: Methods and applications in clinical microbiology. Clin. Microbiol. Infect.doi: 10.1111/1469‐0691.12217.
  Bick, D. and Dimmock, D. 2011. Whole exome and whole genome sequencing. Curr. Opin. Pediatr. 23:594‐600.
  Biesecker, B.B. and Peay, H.L. 2013. Genomic sequencing for psychiatric disorders: Promise and challenge. Int. J. Neuropsychopharmacol. 16:1667‐1672.
  Bilguvar, K., Ozturk, A.K., Louvi, A., Kwan, K.Y., Choi, M., Tatli, B., Yalnizoğlu, D., Tüysüz, B., Cağlayan, A.O., Gökben, S., Kaymakçalan, H., Barak, T., Bakircioğlu, M., Yasuno, K., Ho, W., Sanders, S., Zhu, Y., Yilmaz, S., Dinçer, A., Johnson, M.H., Bronen, R.A., Koçer, N., Per, H., Mane, S., Pamir, M.N., Yalçinkaya, C., Kumandaş, S., Topçu, M., Ozmen, M., Sestan, N., Lifton, R.P., State, M.W., and Günel, M. 2010. Whole‐exome sequencing identifies recessive WDR62 mutations in severe brain malformations. Nature 467:207‐210.
  Borge, K.S., Borresen‐Dale, A.L., and Lingaas, F. 2011. Identification of genetic variation in 11 candidate genes of canine mammary tumour. Vet. Comp. Oncol. 94:241‐250.
  Bork, P. and Bairoch, A. 1996. Go hunting in sequence databases but watch out for the traps. Trends Genet. 1210:425‐427.
  Bras, J., Guerreiro, R., and Hardy, J. 2012. Use of next‐generation sequencing and other whole‐genome strategies to dissect neurological disease. Nat. Rev. Neurosci. 137:453‐464.
  Butler, J., MacCallum, I., Kleber, M., Shlyakhter, I.A., Belmonte, M.K., Lander, E.S., Nusbaum, C., and Jaffe, D.B. 2008. ALLPATHS: de novo assembly of whole‐genome shotgun microreads. Genome Res. 185:810‐820.
  Carneiro, M.O., Russ, C., Ross, M.G., Gabriel, S.B., Nusbaum, C., and DePristo, M.A. 2012. Pacific Biosciences sequencing technology for genotyping and variation discovery in human data. BMC Genomics 13:375.
  Carnevali, P., Baccash, J., Halpern, A.L., Nazarenko, I., Nilsen, G.B., Pant, K.P., Ebert, J.C., Brownley, A., Morenzoni, M., Karpinchyk, V., Martin, B., Ballinger, D.G., and Drmanac, R. 2012b. Computational techniques for human genome resequencing using mated gapped reads. J. Comput. Biol. 193:279‐292.
  Challis, D., Yu, J., Evani, U.S., Jackson, A.R., Paithankar, S., Coarfa, C., Milosavljevic, A., Gibbs, R.A., and Yu, F. 2012. An integrative variant analysis suite for whole exome next‐generation sequencing data. BMC Bioinformatics 13:8.
  Chen, K., McLellan, M.D., Ding, L., Wendl, M.C., Kasai, Y., Wilson, R.K., and Mardis, E.R. 2007. PolyScan: An automatic indel and SNP detection approach to the analysis of human resequencing data. Genome Res. 175:659‐666.
  Chen, Y., Schmidt, B., and Maskell, D.L. 2013. A hybrid short read mapping accelerator. BMC Bioinformatics 14:67.
  Chen, Y.Z., Matsushita, M.M., Robertson, P., Rieder, M., Girirajan, S., Antonacci, F., Lipe, H., Eichler, E.E., Nickerson, D.A., Bird, T.D., and Raskind, W.H. 2012. Autosomal dominant familial dyskinesia and facial myokymia: Single exome sequencing identifies a mutation in adenylyl cyclase 5. Arch. Neurol. 695:630‐635.
  Chepelev, I., Wei, G., Tang, Q., and Zhao, K. 2009. Detection of single nucleotide variations in expressed exons of the human genome using RNA‐Seq. Nucleic Acids Res. 3716:e106.
  Choi, M., Scholl, U.I., Ji, W., Liu, T., Tikhonova, I.R., Zumbo, P., Nayir, A., Bakkaloğlu, A., Ozen, S., Sanjad, S., Nelson‐Williams, C., Farhi, A., Mane, S., and Lifton, R.P. 2009. Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc. Natl. Acad. Sci. U.S.A. 106:19096‐19101.
  Chou, J., Ohsumi, T.K., and Geha, R.S. 2012. Use of whole exome and genome sequencing in the identification of genetic causes of primary immunodeficiencies. Curr. Opin. Allergy Clin. Immunol. 12:623‐628.
  Church, D.M., Schneider, V.A., Graves, T., Auger, K., Cunningham, F., Bouk, N., Chen, H.C., Agarwala, R., McLaren, W.M., Ritchie, G.R., Albracht, D., Kremitzki, M., Rock, S., Kotkiewicz, H., Kremitzki, C., Wollam, A., Trani, L., Fulton, L., Fulton, R., Matthews, L., Whitehead, S., Chow, W., Torrance, J., Dunn, M., Harden, G., Threadgold, G., Wood, J., Collins, J., Heath, P., Griffiths, G., Pelan, S., Grafham, D., Eichler, E.E., Weinstock, G., Mardis, E.R., Wilson, R.K., Howe, K., Flicek, P., and Hubbard, T. 2011. Modernizing reference genome assemblies. PLoS Biol. 97:e1001091.
  Clarke, L., Zheng‐Bradley, X., Smith, R., Kulesha, E., Xiao, C., Toneva, I., Vaughan, B., Preuss, D., Leinonen, R., Shumway, M., Sherry, S., Flicek, P.; 1000 Genomes Project Consortium. 2012. The 1000 Genomes Project: Data management and community access. Nat. Methods 95:459‐462.
  Cooper, D.N., Stenson, P.D., and Chuzhanova, N.A. 2006. The Human Gene Mutation Database HGMD and its exploitation in the study of mutational mechanisms. Curr. Protoc. Bioinformatics 12:1.13.1‐1.13.20 [archived version available at].
  Costanzo, M.C., Park, J., Balakrishnan, R., Cherry, J.M., and Hong, E.L. 2011. Using computational predictions to improve literature‐based Gene Ontology annotations: A feasibility study. Database Oxford 2011:bar004.
  Craig, D.W., Pearson, J.V., Szelinger, S., Sekar, A., Redman, M., Corneveaux, J.J., Pawlowski, T.L., Laub, T., Nunn, G., Stephan, D.A., Homer, N., and Huentelman, M.J. 2008. Identification of genetic variants using bar‐coded multiplexed sequencing. Nat. Methods 510:887‐893.
  Danecek, P., Auton, A., Abecasis, G., Albers, C.A., Banks, E., DePristo, M.A., Handsaker, R.E., Lunter, G., Marth, G.T., Sherry, S.T., McVean, G., Durbin, R., and 1000 Genomes Project Analysis Group. 2011. The variant call format and VCFtools. Bioinformatics 27:2156‐2158.
  De Baets, G., Van Durme, J., Reumers, J., Maurer‐Stroh, S., Vanhee, P., Dopazo, J., Schymkowitz, J., and Rousseau, F. 2012. SNPeffect 4.0: On‐line prediction of molecular and structural effects of protein‐coding variants. Nucleic Acids Res. 40:D935‐D939.
  de Bakker, P.I., Saxena, R., and Graham, R.R. 2004. Variation in the human genome and risk to common disease. Keystone Symposium on ‘Human Genome Sequence Variation and the Inherited Basis of Common Diseases’. January 8‐13, Breckenridge, Colorado, U.S.A. Pharmacogenomics 5:157‐161.
  Denisov, G., Walenz, B., Halpern, A.L., Miller, J., Axelrod, N., Levy, S., and Sutton, G. 2008. Consensus generation and variant detection by Celera Assembler. Bioinformatics 248:1035‐1040.
  DePristo, M.A., Banks, E., Poplin, R., Garimella, K.V., Maguire, J.R., Hartl, C., Philippakis, A.A., del Angel, G., Rivas, M.A., Hanna, M., McKenna, A., Fennell, T.J., Kernytsky, A.M., Sivachenko, A.Y., Cibulskis, K., Gabriel, S.B., Altshuler, D., and Daly, M.J. 2011. A framework for variation discovery and genotyping using next‐generation DNA sequencing data. Nat. Genet. 435:491‐498.
  Didelot, X., Bowden, R., Wilson, D.J., Peto, T.E., and Crook, D.W. 2012. Transforming clinical microbiology with bacterial genome sequencing. Nat. Rev. Genet. 139:601‐612.
  Dingel, J., Hanus, P., Leonardi, N., Hagenauer, J., Zech, J., and Mueller, J.C. 2008. Local conservation scores without a priori assumptions on neutral substitution rates. BMC Bioinformatics 9:190.
  Dorfman, R., Nalpathamkalam, T., T, Taylor, C., Gonska, T., Keenan, K., Yuan, X.W., Corey, M., Tsui, L.C., Zielenski, J., and Durie, P. 2010. Do common in silico tools predict the clinical consequences of amino‐acid substitutions in the CFTR gene? Clin. Genet. 775:464‐473.
  Dulak, A.M., Stojanov, P., Peng, S., Lawrence, M.S., Fox, C., Stewart, C., Bandla, S., Imamura, Y., Schumacher, S.E., Shefler, E., McKenna, A., Carter, S.L., Cibulskis, K., Sivachenko, A., Saksena, G., Voet, D., Ramos, A.H., Auclair, D., Thompson, K., Sougnez, C., Onofrio, R.C., Guiducci, C., Beroukhim, R., Zhou, Z., Lin, L., Lin, J., Reddy, R., Chang, A., Landrenau, R., Pennathur, A., Ogino, S., Luketich, J.D., Golub, T.R., Gabriel, S.B., Lander, E.S., Beer, D.G., Godfrey, T.E., Getz, G., and Bass, A.J. 2013. Exome and whole‐genome sequencing of esophageal adenocarcinoma identifies recurrent driver events and mutational complexity. Nat. Genet. 455:478‐486.
  ENCODE Project Consortium. 2007. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447:799‐816.
  English, A.C., Richards, S., Han, Y., Wang, M., Vee, V., Qu, J., Qin, X., Muzny, D.M., Reid, J.G., Worley, K.C., and Gibbs, R.A. 2012. Mind the gap: Upgrading genomes with Pacific Biosciences RS long‐read sequencing technology. PLoS One 7:e47768.
  Finnis, M., Dayan, S., Hobson, L., Chenevix‐Trench, G., Friend, K., Ried, K., Venter, D., Woollatt, E., Baker, E., and Richards, R.I. 2005. Common chromosomal fragile site FRA16D mutation in cancer cells. Hum. Mol. Genet. 14:1341‐1349.
  Fokkema, I.F., Taschner, P.E., Schaafsma, G.C., Celli, J., Laros, J.F., and den Dunnen, J.T. 2011. LOVD v.2.0: The next generation in gene variant databases. Hum. Mutat. 325:557‐563.
  Forbes, S.A., Bindal, N., Bamford, S., Cole, C., Kok, C.Y., Beare, D., Jia, M., Shepherd, R., Leung, K., Menzies, A., Teague, J.W., Campbell, P.J., Stratton, M.R., and Futreal, P.A. 2011. COSMIC: Mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res. 39:D945‐D950.
  Frost, A., Elgort, M.G., Brandman, O., Ives, C., Collins, S.R., Miller‐Vedam, L., Weibezahn, J., Hein, M.Y., Poser, I., Mann, M., Hyman, A.A., and Weissman, J.S. 2012. Functional repurposing revealed by comparing S. pombe and S. cerevisiae genetic interactions. Cell 149:1339‐1352.
  Fu, W., O'Connor, T.D., Jun, G., Kang, H.M., Abecasis, G., Leal, S.M., Gabriel, S., Rieder, M.J., Altshuler, D., Shendure, J., Nickerson, D.A., Bamshad, M.J.; NHLBI Exome Sequencing Project, and Akey, J.M. 2013. Analysis of 6,515 exomes reveals the recent origin of most human protein‐coding variants. Nature 493:216‐220.
  Furey, T.S., Diekhans, M., Lu, Y., Graves, T.A., Oddy, L., Randall‐Maher, J., Hillier, L.W., Wilson, R.K., and Haussler, D. 2004. Analysis of human mRNAs with the reference genome sequence reveals potential errors, polymorphisms, and RNA editing. Genome Res. 14:2034‐2040.
  Galperin, M.Y. and Koonin, E.V. 2000. Who's your neighbor? New computational approaches for functional genomics. Nat. Biotechnol. 186:609‐613.
  Galperin, M.Y. and Koonin, E.V. 2001. Comparative genome analysis. Methods Biochem. Anal. 43:359‐392.
  Gardy, J.L. 2013. Investigation of disease outbreaks with genome sequencing. Lancet Infect. Dis. 132:101‐102.
  Gargis, A.S., Kalman, L., Berry, M.W., Bick, D.P., Dimmock, D.P., Hambuch, T., Lu, F., Lyon, E., Voelkerding, K.V., Zehnbauer, B.A., Agarwala, R., Bennett, S.F., Chen, B., Chin, E.L., Compton, J.G., Das, S., Farkas, D.H., Ferber, M.J., Funke, B.H., Furtado, M.R., Ganova‐Raeva, L.M., Geigenmüller, U., Gunselman, S.J., Hegde, M.R., Johnson, P.L., Kasarskis, A., Kulkarni, S., Lenk, T., Liu, C.S., Manion, M., Manolio, T.A., Mardis, E.R., Merker, J.D., Rajeevan, M.S., Reese, M.G., Rehm, H.L., Simen, B.B., Yeakley, J.M., Zook, J.M., and Lubin, I.M. 2012. Assuring the quality of next‐generation sequencing in clinical laboratory practice. Nat. Biotechnol. 30:1033‐1036.
  Gnerre, S., Lander, E.S., Lindblad‐Toh, K., and Jaffe, D.B. 2009. Assisted assembly: How to improve a de novo genome assembly by using related species. Genome Biol. 108:R88.
  Goh, V., Helbling, D., Biank, V., Jarzembowski, J., and Dimmock, D. 2012. Next‐generation sequencing facilitates the diagnosis in a child with twinkle mutations causing cholestatic liver failure. J. Pediatr. Gastroenterol. Nutr. 54:291‐294.
  Gonzaga‐Jauregui, C., Lupski, J.R., and Gibbs, R.A. 2012. Human genome sequencing in health and disease. Annu. Rev. Med. 63:35‐61.
  González‐Pérez, A. and López‐Bigas, N. 2011. Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. Am. J. Hum. Genet. 88:440‐449.
  Green, R.C., Berg, J.S., Berry, G.T., Biesecker, L.G., Dimmock, D.P., Evans, J.P., Grody, W.W., Hegde, M.R., Kalia, S., Korf, B.R., Krantz, I., McGuire, A.L., Miller, D.T., Murray, M.F., Nussbaum, R.L., Plon, S.E., Rehm, H.L., and Jacob, H.J. 2012. Exploring concordance and discordance for return of incidental findings from clinical sequencing. Genet. Med. 144:405‐410.
  Green, R.C., Berg, J.S., Grody, W.W., Kalia, S.S., Korf, B.R., Martin, C.L., McGuire, A.L., Nussbaum, R.L., O'Daniel, J.M., Ormond, K.E., Rehm, H.L., Watson, M.S., Williams, M.S., and Biesecker, L.G. 2013. ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing. Genet. Med. 15:565‐574.
  Hall, I.M. and Quinlan, A.R. 2012. Detection and interpretation of genomic structural variation in mammals. Methods Mol. Biol. 838:225‐248.
  Hastings, R., de Wert, G., Fowler, B., Krawczak, M., Vermeulen, E., Bakker, E., Borry, P., Dondorp, W., Nijsingh, N., Barton, D., Schmidtke, J., van El, C.G., Vermeesch, J., Stol, Y., Carmen Howard, H., and Cornel, M.C. 2012. The changing landscape of genetic testing and its impact on clinical and laboratory services and research in Europe. Eur. J. Hum. Genet. 20:911‐916.
  Haug, K., Salek, R.M., Conesa, P., Hastings, J., de Matos, P., Rijnbeek, M., Mahendraker, T., Williams, M., Neumann, S., Rocca‐Serra, P., Maguire, E., González‐Beltrán, A., Sansone, S.A., Griffin, J.L., and Steinbeck, C. 2013. MetaboLights—An open‐access general‐purpose repository for metabolomics studies and associated meta‐data. Nucleic Acids Res. 41:D781‐D786.
  Hirsch, V., Adger‐Johnson, D., Campbell, B., Goldstein, S., Brown, C., Elkins, W.R., and Montefiori, D.C. 1997. A molecularly cloned, pathogenic, neutralization‐resistant simian immunodeficiency virus, SIVsmE543‐3. J. Virol. 71:1608‐1620.
  Homer, N., Merriman, B., and Nelson, S.F. 2009. BFAST: An alignment tool for large scale genome resequencing. PLoS One 411:e7767.
  Hormozdiari, F., Alkan, C., Eichler, E.E., and Sahinalp, S.C. 2009. Combinatorial algorithms for structural variation detection in high‐throughput sequenced genomes. Genome Res. 19:1270‐1278.
  Howard, H.J., Horaitis, O., Cotton, R.G., Vihinen, M., Dalgleish, R., Robinson, P., Brookes, A.J., Axton, M., Hoffmann, R., and Tuffery‐Giraud, S. 2010. The Human Variome Project HVP 2009 forum towards establishing standards. Hum. Mutat. 31:366‐367.
  Jacob, H.J., Abrams, K., Bick, D.P., Brodie, K., Dimmock, D.P., Farrell, M., Geurts, J., Harris, J., Helbling, D., Joers, B.J., Kliegman, R., Kowalski, G., Lazar, J., Margolis, D.A., North, P., Northup, J., Roquemore‐Goins, A., Scharer, G., Shimoyama, M., Strong, K., Taylor, B., Tsaih, S.W., Tschannen, M.R., Veith, R.L., Wendt‐Andrae, J., Wilk, B., and Worthey, E.A. 2013. Genomics in clinical practice: Lessons from the front lines. Sci. Transl. Med.Jul 17;5(194):194cm5.
  Jiang, Z., Rokhsar, D.S., and Harland, R.M. 2009. Old can be new again: HAPPY whole genome sequencing, mapping and assembly. Int. J. Biol. Sci. 5:298‐303.
  Karakoc, E., Alkan, C., O'Roak, B.J., Dennis, M.Y., Vives, L., Mark, K., Rieder, M.J., Nickerson, D.A., and Eichler, E.E. 2012. Detection of structural variants and indels within exome data. Nat. Methods 9:176‐178.
  Kenna, K.P., McLaughlin, R.L., Hardiman, O., and Bradley, D.G. 2013. Using reference databases of genetic variation to evaluate the potential pathogenicity of candidate disease variants. Hum. Mutat. 34:836‐841.
  Kidd, J.M., Graves, T., Newman, T.L., Fulton, R., Hayden, H.S., Malig, M., Kallicki, J., Kaul, R., Wilson, R.K., and Eichler, E.E. 2010. A human genome structural variation sequencing resource reveals insights into mutational mechanisms. Cell 143:837‐847.
  Koboldt, D.C., Chen, K., Wylie, T., Larson, D.E., McLellan, M.D., Mardis, E.R., Weinstock, G.M., Wilson, R.K., and Ding, L. 2009. VarScan: Variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25:2283‐2285.
  Kogelnik, A.M., Lott, M.T., Brown, M.D., Navathe, S.B., Wallace, D.C. 1997. MITOMAP: An update on the status of the human mitochondrial genome database. Nucleic Acids Res. 25:196‐199.
  Kogelnik, A.M., Lott, M.T., Brown, M.D., Navathe, S.B., and Wallace, D.C. 1998. MITOMAP: A human mitochondrial genome database–1998 update. Nucleic Acids Res. 261:112‐115.
  Koolen, D.A., Veltman, J.A., Renier, W.O., Droog, R.P., van Kessel, A.G., and de Vries, B.B. 2004. Chromosome 22q11 deletion and pachygyria characterized by array‐based comparative genomic hybridization. Am. J. Med. Genet. A 131:322‐324.
  Korf, B.R. and Rehm, H.L. 2013. New approaches to molecular diagnosis. JAMA 309:1511‐1521.
  Ku, C.S., Cooper, D.N., Ziogas, D.E., Halkia, E., Tzaphlidou, M., and Roukos, D.H. 2013a. Research and clinical applications of cancer genome sequencing. Curr. Opin. Obstet. Gynecol. 25:3‐10.
  Ku, C.S., Polychronakos, C., Tan, E.K., Naidoo, N., Pawitan, Y., Roukos, D.H., Mort, M., and Cooper, D.N. 2013b. A new paradigm emerges from the study of de novo mutations in the context of neurodevelopmental disease. Mol. Psychiatry 18:141‐153.
  Kumar, P., Henikoff, S., and Ng, P.C. 2009. Predicting the effects of coding non‐synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4:1073‐1081.
  Kurzweil, V.C., Getman, M., NISC Comparative Sequencing Program, Green, E.D., and Lane, R.P. 2009. Dynamic evolution of V1R putative pheromone receptors between Mus musculus and Mus spretus. BMC Genomics 10:74.
  Lalonde, E., Albrecht, S., Ha, K.C., Jacob, K., Bolduc, N., Polychronakos, C., Dechelotte, P., Majewski, J., and Jabado, N. 2010. Unexpected allelic heterogeneity and spectrum of mutations in Fowler syndrome revealed by next‐generation exome sequencing. Hum. Mutat. 31:918‐923.
  Lander, E.S. 2011. Initial impact of the sequencing of the human genome. Nature 470:187‐197.
  Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., Funke, R., Gage, D., et al., 2001. Initial sequencing and analysis of the human genome. Nature 409:860‐921.
  Langmead, B., Trapnell, C., Pop, M., and Salzberg, S.L. 2009. Ultrafast and memory‐efficient alignment of short DNA sequences to the human genome. Genome Biol. 10:R25.
  Lappalainen, I., Lopez, J., Skipper, L., Hefferon, T., Spalding, J.D., Garner, J., Chen, C., Maguire, M., Corbett, M., Zhou, G., Paschall, J., Ananiev, V., Flicek, P, and Church, D.M. 2013. 1.DbVar and DGVa: Public archives for genomic structural variation. Nucleic Acids Res. 41:D936‐D941.
  Lee, H. and Schatz, M.C. 2012. Genomic dark matter: The reliability of short read mapping illustrated by the genome mappability score. Bioinformatics 28:2097‐2105.
  Li, H. 2012. Exploring single‐sample SNP and INDEL calling with whole‐genome de novo assembly. Bioinformatics 28:1838‐1844.
  Li, H. and Durbin, R. 2009. Fast and accurate short read alignment with Burrows‐Wheeler transform. Bioinformatics 25:1754‐1760.
  Li, M., Pang, S.Y., Song, Y., Kung, M.H., Ho, S.L., and Sham, P.C. 2013. Whole exome sequencing identifies a novel mutation in the transglutaminase 6 gene for spinocerebellar ataxia in a Chinese family. Clin. Genet. 83:269‐273.
  Li, R., Fan, W., Tian, G., Zhu, H., He, L., Cai, J., Huang, Q., Cai, Q., Li, B., Bai, Y., Zhang, Z., Zhang, Y., et al., 2010. The sequence and de novo assembly of the giant panda genome. Nature 463:311‐317.
  Li, S., Li, R., Li, H., Lu, J., Bolund, L., Schierup, M.H., and Wang, J. 2013. SOAPindel: Efficient identification of indels from short paired reads. Genome Res. 23:195‐200.
  Li, Y., Vinckenbosch, N., Huerta‐Sanchez, E., Jiang, T., Jiang, H., Albrechtsen, A., Andersen, G., Cao, H., Korneliussen, T., Grarup, N., Guo, Y., Hellman, I., Jin, X., Li, Q., Liu, J., Liu, X., Sparsø, T., Tang, M., Wu, H., Wu, R., Yu, C., Zheng, H., Astrup, A., Bolund, L., Holmkvist, J., Jørgensen, T., Kristiansen, K., Schmitz, O., Schwartz, T.W., Zhang, X., Li, R., Yang, H., Wang, J., Hansen, T., Pedersen, O., Nielsen, R., and Wang, J. 2010. Resequencing of 200 human exomes identifies an excess of low‐frequency non‐synonymous coding variants. Nat. Genet. 42:969‐972.
  Lin, Y., Li, J., Shen, H., Zhang, L., Papasian, C.J., and Deng, H.W. 2011. Comparative studies of de novo assembly tools for next‐generation sequencing technologies. Bioinformatics 27:2031‐2037.
  Lindberg, J., Klevebring, D., Liu, W., Neiman, M., Xu, J., Wiklund, P., Wiklund, F., Mills, I.G., Egevad, L., and Grönberg, H. 2013. Exome sequencing of prostate cancer supports the hypothesis of independent tumour origins. Eur. Urol. 63:347‐353.
  Luo, R., Liu, B., Xie, Y., Li, Z., Huang, W., Yuan, J., He, G., Chen, Y., Pan, Q., Liu, Y., Tang, J., Wu, G., Zhang, H., Shi, Y., Liu, Y., Yu, C., Wang, B., Lu, Y., Han, C., Cheung, D.W., Yiu, S.M., Peng, S., Xiaoqian, Z., Liu, G., Liao, X., Li, Y., Yang, H., Wang, J., Lam, T.W., and Wang, J. 2012. SOAPdenovo2: An empirically improved memory‐efficient short‐read de novo assembler. Gigascience 1:18.
  Lupski, J.R. 1998. Genomic disorders: Structural features of the genome can lead to DNA rearrangements and human disease traits. Trends Genet. 14:417‐422.
  Lupski, J.R., Reid, J.G., Gonzaga‐Jauregui, C., Rio Deiros, D., Chen, D.C., Nazareth, L., Bainbridge, M., Dinh, H., Jing, C., Wheeler, D.A., McGuire, A.L., Zhang, F., Stankiewicz, P., Halperin, J.J., Yang, C., Gehman, C., Guo, D., Irikat, R.K., Tom, W., Fantin, N.J., Muzny, D.M., and Gibbs, R.A. 2010. Whole‐genome sequencing in a patient with charcot‐marie‐tooth neuropathy. N. Engl. J. Med. 362:1181‐1191.
  Mardis, E.R. 2012. Applying next‐generation sequencing to pancreatic cancer treatment. Nat. Rev. Gastroenterol. Hepatol. 9:477‐486.
  Marques‐Bonet, T., Cheng, Z., She, X., Eichler, E.E., and Navarro, A., 2008. The genomic distribution of intraspecific and interspecific sequence divergence of human segmental duplications relative to human/chimpanzee chromosomal rearrangements. BMC Genomics 9:384.
  Martinez‐Alcantara, A., Ballesteros, E., Feng, C., Rojas, M., Koshinsky, H., Fofanov, V.Y., Havlak, P., and Fofanov, Y. 2009. PIQA: Pipeline for Illumina G1 genome analyzer data quality assessment. Bioinformatics 25:2438‐2439.
  Mayer, A.N., Dimmock, D.P., Arca, M.J., Bick, D.P., Verbsky, J.W., Worthey, E.A., Jacob, H.J., and Margolis, D.A. 2011. A timely arrival for genomic medicine. Genet. Med. 13:195‐196.
  McCarroll, S.A., Hadnott, T.N., Perry, G.H., Sabeti, P.C., Zody, M.C., Barrett, J.C., Dallaire, S., Gabriel, S.B., Lee, C., Daly, M.J., Altshuler, D.M.; International HapMap Consortium. 2006. Common deletion polymorphisms in the human genome. Nat. Genet. 38:86‐92.
  McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., and DePristo, M.A. 2010. The Genome Analysis Toolkit: A MapReduce framework for analyzing next‐generation DNA sequencing data. Genome Res. 20:1297‐1303.
  McKernan, K.J., Peckham, H.E., Costa, G.L., McLaughlin, S.F., Fu, Y., Tsung, E.F., Clouser, C.R., Duncan, C., Ichikawa, J.K., Lee, C.C., Zhang, Z., Ranade, S.S., Dimalanta, E.T., Hyland, F.C., Sokolsky, T.D., Zhang, L., Sheridan, A., Fu, H., Hendrickson, C.L., Li, B., Kotler, L., Stuart, J.R., Malek, J.A., Manning, J.M., Antipova, A.A., Perez, D.S., Moore, M.P., Hayashibara, K.C., Lyons, M.R., Beaudoin, R.E., Coleman, B.E., Laptewicz, M.W., Sannicandro, A.E., Rhodes, M.D., Gottimukkala, R.K., Yang, S., Bafna, V., Bashir, A., MacBride, A., Alkan, C., Kidd, J.M., Eichler, E.E., Reese, M.G., De La Vega, F.M., and Blanchard, A.P. 2009. Sequence and structural variation in a human genome uncovered by short‐read, massively parallel ligation sequencing using two‐base encoding. Genome Res. 19:1527‐1541.
  Merikangas, A.K., Corvin, A.P., and Gallagher, L. 2009. Copy‐number variants in neurodevelopmental disorders: Promises and challenges. Trends Genet. 25:536‐544.
  Mijuskovic, M., Brown, S.M., Tang, Z., Lindsay, C.R., Efstathiadis, E., Deriano, L., and Roth, D.B. 2012. A streamlined method for detecting structural variants in cancer genomes by short read paired‐end sequencing. PLoS One 7:e48314.
  Milenkovic, V.M., Langmann, T., Schreiber, R., Kunzelmann, K., and Weber, B.H. 2008. Molecular evolution and functional divergence of the bestrophin protein family. BMC Evol. Biol. 8:72.
  Miller, J.R., Koren, S., and Sutton, G. 2010. Assembly algorithms for next‐generation sequencing data. Genomics 95:315‐327.
  Mitchell, A.A., Zwick, M.E., Chakravarti, A., and Cutler, D.J. 2004. Discrepancies in dbSNP confirmation rates and allele frequency distributions from varying genotyping error rates and patterns. Bioinformatics 20:1022‐1032.
  Moorthie, S., Hall, A., Wright, C.F. 2013. Informatics and clinical genome sequencing: Opening the black box. Genet. Med. 15:165‐171.
  Nagarajan, N. and Pop, M. 2009. Parametric complexity of sequence assembly: Theory and applications to next generation sequencing. J. Comput. Biol. 16:897‐908.
  NCBI Resource Coordinators. 2013. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 41:D8‐D20.
  Ng, B.G., Buckingham, K.J., Raymond, K., Kircher, M., Turner, E.H., He, M., Smith, J.D., Eroshkin, A., Szybowska, M., Losfeld, M.E., Chong, J.X., Kozenko, M., Li, C., Patterson, M.C., Gilbert, R.D., Nickerson, D.A., Shendure, J., Bamshad, M.J.; University of Washington Center for Mendelian Genomics, Freeze, H.H. 2013. Mosaicism of the UDP‐galactose transporter SLC35A2 causes a congenital disorder of glycosylation. Am. J. Hum. Genet. 92:632‐636.
  Ng, S.B., Turner, E.H., Robertson, P.D., Flygare, S.D., Bigham, A.W., Lee, C., Shaffer, T., Wong, M., Bhattacharjee, A., Eichler, E.E., Bamshad, M., Nickerson, D.A., and Shendure, J. 2009. Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461:272‐276.
  Ng, S.B., Buckingham, K.J., Lee, C., Bigham, A.W., Tabor, H.K., Dent, K.M., Huff, C.D., Shannon, P.T., Jabs, E.W., Nickerson, D.A., Shendure, J., and Bamshad, M.J. 2010. Exome sequencing identifies the cause of a mendelian disorder. Nat. Genet. 42:30‐35.
  O'Rawe, J., Guangqing, S., Sun, G., Wu, Y., Wang, W., Hu, J., Bodily, P., Tian, L., Hakonarson, H., Johnson, W.E., Wei, Z., Wang, K., and Lyon, G.J. 2013. Low concordance of multiple variant‐calling pipelines: Practical implications for exome and genome sequencing. Genome Med. 53:28.
  O'Roak, B.J., Deriziotis, P., Lee, C., Vives, L., Schwartz, J.J., Girirajan, S., Karakoc, E., Mackenzie, A.P., Ng, S.B., Baker, C., Rieder, M.J., Nickerson, D.A., Bernier, R., Fisher, S.E., Shendure, J., and Eichler, E.E. 2011. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nat. Genet. 43:585‐589.
  O'Roak, B.J., Vives, L., Fu, W., Egertson, J.D., Stanaway, I.B., Phelps, I.G., Carvill, G., Kumar, A., Lee, C., Ankenman, K., Munson, J., Hiatt, J.B., Turner, E.H., Levy, R., O'Day, D.R., Krumm, N., Coe, B.P., Martin, B.K., Borenstein, E., Nickerson, D.A., Mefford, H.C., Doherty, D., Akey, J.M., Bernier, R., Eichler, E.E., and Shendure, J. 2012. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 338:1619‐1622.
  Osborne, J.D., Flatow, J., Holko, M., Lin, S.M., Kibbe, W.A., Zhu, L.J., Danila, M.I., Feng, G., and Chisholm, R.L. 2009. Annotating the human genome with Disease Ontology. BMC Genomics 10:S6.
  Pagon, R.A. 2006. GeneTests: An online genetic information resource for health care providers. J. Med. Libr. Assoc. 94:343‐348.
  Pagon, R.A., Tarczy‐Hornoch, P., Baskin, P.K., Edwards, J.E., Covington, M.L., Espeseth, M., Beahler, C., Bird, T.D., Popovich, B., Nesbitt, C., Dolan, C., Marymee, K., Hanson, N.B., Neufeld‐Kaiser, W., Grohs, G.M., Kicklighter, T., Abair, C., Malmin, A., Barclay, M., and Palepu, R.D. 2002. GeneTests‐GeneClinics: Genetic testing information for a growing audience. Hum. Mutat. 19:501‐509.
  Park, D.J., Lesueur, F., Nguyen‐Dumont, T., Pertesi, M., Odefrey, F., Hammet, F., Neuhausen, S.L., John, E.M., Andrulis, I.L., Terry, M.B., Daly, M., Buys, S., Le Calvez‐Kelm, F., Lonie, A., Pope, B.J., Tsimiklis, H., Voegele, C., Hilbers, F.M., Hoogerbrugge, N., Barroso, A., Osorio, A.; Breast Cancer Family Registry; Kathleen Cuningham Foundation Consortium for Research into Familial Breast Cancer, Giles, G.G., Devilee, P., Benitez, J., Hopper, J.L., Tavtigian, S.V., Goldgar, D.E., and Southey, M.C. 2012. Rare mutations in XRCC2 increase the risk of breast cancer. Am. J. Hum. Genet. 90:734‐739.
  Paszkiewicz, K. and Studholme, D.J. 2010. De novo assembly of short sequence reads. Brief. Bioinform. 11:457‐472.
  Patrinos, G.P., Smith, T.D., Howard, H., Al‐Mulla, F., Chouchane, L., Hadjisavvas, A., Hamed, S.A., Li, X.T., Marafie, M., Ramesar, R.S., Ramos, F.J., de Ravel, T., El‐Ruby, M.O., Shrestha, T.R., Sobrido, M.J., Tadmouri, G., Witsch‐Baumgartner, M., Zilfalil, B.A., Auerbach, A.D., Carpenter, K., Cutting, G.R., Dung, V.C., Grody, W., Hasler, J., Jorde, L., Kaput, J., Macek, M., Matsubara, Y., Padilla, C., Robinson, H., Rojas‐Martinez, A., Taylor, G.R., Vihinen, M., Weber, T., Burn, J., Qi, M., Cotton, R.G., Rimoin, D.; International Confederation of Countries Advisory Council. 2012. Human Variome Project country nodes: Documenting genetic information within a country. Hum. Mutat. 33:1513‐1519.
  Pollard, K.S., Hubisz, M.J., Rosenbloom, K.R., and Siepel, A. 2010. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 20:110‐121.
  Pop, M. 2009. Genome assembly reborn: Recent computational challenges. Brief. Bioinform. 10:354‐366.
  Porter, J.D. and Baker, R.S. 1997. Absence of oculomotor and trochlear motoneurons leads to altered extraocular muscle development in the Wnt‐1 null mutant mouse. Brain Res. Dev. Brain Res. 100:121‐126.
  Priest, J.R., Girirajan, S., Vu, T.H., Olson, A., Eichler, E.E., and Portman, M.A. 2012. Rare copy number variants in isolated sporadic and syndromic atrioventricular septal defects. Am. J. Med. Genet. A 158:1279‐1284.
  Quail, M.A., Kozarewa, I., Smith, F., Scally, A., Stephens, P.J., Durbin, R., Swerdlow, H., and Turner, D.J. 2008. A large genome center's improvements to the Illumina sequencing system. Nat. Methods 5:1005‐1010.
  Reese, M.G., Moore, B., Batchelor, C., Salas, F., Cunningham, F., Marth, G.T., Stein, L., Flicek, P., Yandell, M., and Eilbeck, K. 2010. A standard variation file format for human genome sequences. Genome Biol.11:R88.
  Reich, D.E., Gabriel, S.B., and Altshuler, D. 2003. Quality and completeness of SNP databases. Nat. Genet 33:457‐458.
  Reis, L.M., Tyler, R.C., Muheisen, S., Raggio, V., Salviati, L., Han, D.P., Costakos, D., Yonath, H., Hall, S., Power, P., and Semina, E.V. 2013. Whole exome sequencing in dominant cataract identifies a new causative factor, CRYBA2, and a variety of novel alleles in known genes. Hum. Genet. 132:761‐770.
  Richards, C.S., Bale, S., Bellissimo, D.B., Das, S., Grody, W.W., Hegde, M.R., Lyon, E., Ward, B.E.; Molecular Subcommittee of the ACMG Laboratory Quality Assurance Committee. 2007. ACMG recommendations for standards for interpretation and reporting of sequence variations: Revisions 2007. Genet. Med. 10:294‐300.
  Richards, C.S., Bale, S., Bellissimo, D.B., Das, S., Grody, W.W., Hegde, M.R., Lyon, E., Ward, B.E.; Molecular Subcommittee of the ACMG Laboratory Quality Assurance Committee. 2008. ACMG recommendations for standards for interpretation and reporting of sequence variations: Revisions 2007. Genet. Med. 10:294‐300.
  Ring, H.Z., Kwok, P.Y., and Cotton, R.G. 2006. Human Variome Project: An international collaboration to catalogue human genetic variation. Pharmacogenomics 7:969‐972.
  Rosenberg, S.M. and Hastings, P.J. 2004. Rebuttal: Adaptive mutation in Escherichia coli (Foster). J. Bacteriol. 186:4853.
  Rosenfeld, J.A., Mason, C.E., and Smith, T.M. 2012. Limitations of the human reference genome for personalized genomics. PLoS One 7:e40294.
  Roukos, D.H. and Ku, C.S. 2012. Clinical cancer genome and precision medicine. Ann. Surg. Oncol. 19:3646‐3650.
  Ruffalo, M., LaFramboise, T., and Koyutürk M. 2011. Comparative analysis of algorithms for next‐generation sequencing read alignment. Bioinformatics 27:2790‐2806.
  Safran, M., Dalah, I., Alexander, J., Rosen, N., Iny Stein, T., Shmoish, M., Nativ, N., Bahir, I., Doniger, T., Krug, H., Sirota‐Madi, A., Olender, T., Golan, Y., Stelzer, G., Harel, A., and Lancet, D. 2010. GeneCards Version 3: The human gene integrator. Database (Oxford) 2010:baq020.
  Sasson, O., Kaplan, N., and Linial, M. 2006. Functional annotation prediction: All for one and one for all. Protein Sci. 15:1557‐1562.
  Saunders, C.J., Miller, N.A., Soden, S.E., Dinwiddie, D.L., Noll, A., Alnadi, N.A., Andraws, N., Patterson, M.L., Krivohlavek, L.A., Fellis, J., Humphray, S., Saffrey, P., Kingsbury, Z., Weir, J.C., Betley, J., Grocock, R.J., Margulies, E.H., Farrow, E.G., Artman, M., Safina, N.P., Petrikin, J.E., Hall, K.P., and Kingsmore, S.F. 2012. Rapid whole‐genome sequencing for genetic disease diagnosis in neonatal intensive care units. Sci. Transl. Med. 4:154ra135.
  Schnoes, A.M., Brown, S.D., Dodevski, I., and Babbitt, P.C. 2009. Annotation error in public databases: Misannotation of molecular function in enzyme superfamilies. PLoS Comput. Biol. 5:e100605.
  Schrijver, I., Aziz, N., Farkas, D.H., Furtado, M., Gonzalez, A.F., Greiner, T.C., Grody, W.W., Hambuch, T., Kalman, L., Kant, J.A., Klein, R.D., Leonard, D.G., Lubin, I.M., Mao, R., Nagan, N., Pratt, V.M., Sobel, M.E., Voelkerding, K.V., and Gibson, J.S. 2012. Opportunities and challenges associated with clinical diagnostic genome sequencing: A report of the Association for Molecular Pathology. J. Mol. Diagn. 14:525‐540.
  Schwarz, J.M., Rödelsperger, C., Schuelke, M., and Seelow, D. 2010. MutationTaster evaluates disease‐causing potential of sequence alterations. Nat. Methods 7:575‐576.
  Shapiro, B. and Hofreiter, M. 2010. Analysis of ancient human genomes: using next generation sequencing, 20‐fold coverage of the genome of a 4,000‐year‐old human from Greenland has been obtained. Bioessays 32:388‐391.
  Sharp, A.J. 2009. Emerging themes and new challenges in defining the role of structural variation in human disease. Hum. Mutat. 30:135‐144.
  Shen, H., Li, J., Xu, C., Jiang, Y., Wu, Z., Zhao, F., Liao, L., Chen, J., Lin, Y., Tian, Q., Papasian, C.J., and Deng, H.W. 2013. Comprehensive characterization of human genome variation by high coverage whole‐genome sequencing of forty four Caucasians. PLoS One 8:e59494.
  Shen, Y., Wan, Z., Coarfa, C., Drabek, R., Chen, L., Ostrowski, E.A., Liu, Y., Weinstock, G.M., Wheeler, D.A., Gibbs, R.A., and Yu, F. 2010. A SNP discovery method to assess variant allele probability from next‐generation resequencing data. Genome Res. 20:273‐280.
  Sheth, N., Roca, X., Hastings, M.L., Roeder, T., Krainer, A.R., and Sachidanandam, R. 2006. Comprehensive splice‐site analysis using comparative genomics. Nucleic Acids Res. 34:3955‐3967.
  Sherry, S.T., Ward, M.H., Kholodov, M., Baker, J., Phan, L., Smigielski, E.M., and Sirotkin, K. 2001. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29:308‐311.
  Simpson, J.T., Wong, K., Jackman, S.D., Schein, J.E., Jones, S.J., and Birol, I. 2009. ABySS: A parallel assembler for short read sequence data. Genome Res. 19:1117‐1123.
  Smith, A.D., Xuan, Z., and Zhang, M.Q. 2008. Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinformatics 9:128.
  Smith, B., Ashburner, M., Rosse, C., Bard, J., Bug, W., Ceusters, W., Goldberg, L.J., Eilbeck, K., Ireland, A., Mungall, C.J.; OBI Consortium, Leontis, N., Rocca‐Serra, P., Ruttenberg, A., Sansone, S.A., Scheuermann, R.H., Shah, N., Whetzel, P.L., and Lewis, S. 2007. The OBO Foundry: Coordinated evolution of ontologies to support biomedical data integration. Nat. Biotechnol. 25:1251‐1255.
  Smith, D.R., Quinlan, A.R., Peckham, H.E., Makowsky, K., Tao, W., Woolf, B., Shen, L., Donahue, W.F., Tusneem, N., Stromberg, M.P., Stewart, D.A., Zhang, L., Ranade, S.S., Warner, J.B., Lee, C.C., Coleman, B.E., Zhang, Z., McLaughlin, S.F., Malek, J.A., Sorenson, J.M., Blanchard, A.P., Chapman, J., Hillman, D., Chen, F., Rokhsar, D.S., McKernan, K.J., Jeffries, T.W., Marth, G.T., and Richardson, P.M. 2008. Rapid whole‐genome mutational profiling using next‐generation sequencing technologies. Genome Res. 18:1638‐1642.
  Smith, T.D., Robinson, H.M., and Cotton, R.G. 2012. The Human Variome Project Beijing meeting. J. Med. Genet. 49:284‐289.
  Smith, T.F. and Zhang, X. 1997. The challenges of genome sequence annotation or the devil is in the details. Nat. Biotechnol. 15:1222‐1223.
  Snape, K., Ruark, E., Tarpey, P., Renwick, A., Turnbull, C., Seal, S., Murray, A., Hanks, S., Douglas, J., Stratton, M.R., and Rahman, N. 2012. Predisposition gene identification in common cancers by exome sequencing: Insights from familial breast cancer. Breast Cancer Res. Treat. 134:429‐433.
  Sneddon, T.P. and Church, D.M. 2012. Online resources for genomic structural variation. Methods Mol. Biol. 838:273‐289.
  Stankiewicz, P. and Lupski, J.R. 2010. Structural variation in the human genome and its role in disease. Annu. Rev. Med. 61:437‐455.
  Stenson, P.D., Ball, E.V., Mort, M., Phillips, A.D., Shiel, J.A., Thomas, N.S., Abeysinghe, S., Krawczak, M., and Cooper, D.N. 2003. Human Gene Mutation Database HGMD: 2003 update. Hum. Mutat. 21:577‐581.
  Stenson, P.D., Mort, M., Ball, E.V., Howells, K., Phillips, A.D., Thomas, N.S., and Cooper, D.N. 2009. The Human Gene Mutation Database: 2008 update. Genome Med. 11:13.
  Stubbs, A., McClellan, E.A., Horsman, S., Hiltemann, S.D., Palli, I., Nouwens, S., Koning, A.H., Hoogland, F., Reumers, J., Heijsman, D., Swagemakers, S., Kremer, A., Meijerink, J., Lambrechts, D., and van der Spek, P.J. 2012. Huvariome: A web server resource of whole genome next‐generation sequencing allelic frequencies to aid in pathological candidate gene selection. J. Clin. Bioinforma 2:19.
  Sundquist, A., Ronaghi, M., Tang, H., Pevzner, P., and Batzoglou, S. 2007. Whole‐genome sequencing and assembly with high‐throughput, short‐read technologies. PLoS One 2:e484.
  Thomas, P.D., Mi, H., and Lewis, S. 2007. Ontology annotation: Mapping genomic regions to biological function. Curr. Opin. Chem. Biol. 11:4‐11.
  Tong, M.Y., Cassa, C.A., and Kohane, I.S. 2011. Automated validation of genetic variants from large databases: Ensuring that variant references refer to the same genomic locations. Bioinformatics 276:891‐893.
  Treangen, T.J. and Salzberg, S.L. 2011. Repetitive DNA and next‐generation sequencing: Computational challenges and solutions. Nat. Rev. Genet. 13:36‐46.
  Tucker, E.J., Mimaki, M., Compton, A.G., McKenzie, M., Ryan, M.T., and Thorburn, D.R. 2012. Next‐generation sequencing in molecular diagnosis: NUBPL mutations highlight the challenges of variant detection and interpretation. Hum. Mutat. 332:411‐418.
  Valencia, A. 2005. Automatic annotation of protein function. Curr. Opin. Struct. Biol. 15:267‐274.
  Venter, J.C., Adams, M.D., Myers, E.W., Li, P.W., Mural, R.J., Sutton, G.G., Smith, H.O., Yandell, M., Evans, C.A., Holt, R.A., Gocayne, J.D., Amanatides, P., et al., 2001. The sequence of the human genome. Science 291:1304‐1351.
  Vijay, N., Poelstra, J.W., Kunstner, A., and Wolf, J.B. 2013. Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA‐seq experiments. Mol. Ecol. 22:620‐634.
  Wagner, M.J. 2013. Rare‐variant genome‐wide association studies: A new frontier in genetic analysis of complex traits. Pharmacogenomics 14:413‐424.
  Wang, K., Li, M., and Hakonarson, H. 2010. ANNOVAR: Functional annotation of genetic variants from high‐throughput sequencing data. Nucleic Acids Res. 3816:e164.
  Wang, L., Tsutsumi, S., Kawaguchi, T., Nagasaki, K., Tatsuno, K., Yamamoto, S., Sang, F., Sonoda, K., Sugawara, M., Saiura, A., Hirono, S., Yamaue, H., Miki, Y., Isomura, M., Totoki, Y., Nagae, G., Isagawa, T., Ueda, H., Murayama‐Hosokawa, S., Shibata, T., Sakamoto, H., Kanai, Y., Kaneda, A., Noda, T., and Aburatani, H. 2012. Whole‐exome sequencing of human pancreatic cancers and characterization of genomic instability caused by MLH1 haploinsufficiency and complete deficiency. Genome Res. 22:208‐219.
  Wang, L.L., Li, Y., and Zhou, S.F. 2009. A bioinformatics approach for the phenotype prediction of nonsynonymous single nucleotide polymorphisms in human cytochromes P450. Drug Metab. Dispos. 37:977‐991.
  Wang, S.K., Hu, Y., Simmer, J.P., Seymen, F., Estrella, N.M., Pal, S., Reid, B.M., Yidirim, M., Bayram, M., Bartlett, J.D., and Hu, J.C. 2013. Novel KLK4 and MMP20 mutations discovered by whole‐exome sequencing. J. Dent. Res. 92:266‐271.
  Wappenschmidt, B., Becker, A.A., Hauke, J., Weber, U., Engert, S., Köhler, J., Kast, K., Arnold, N., Rhiem, K., Hahnen, E., Meindl, A., and Schmutzler, R.K. 2012. Analysis of 30 putative BRCA1 splicing mutations in hereditary breast and ovarian cancer families identifies exonic splice site mutations that escape in silico prediction. PLoS One 7:e50800.
  Waterston, R.H., Lander, E.S., and Sulston, J.E. 2003. More on the sequencing of the human genome. Proc. Natl. Acad. Sci. U.S.A. 100:3022‐3024; author reply 3025‐3026.
  Watt, S., Jiao, W., Brown, A.M., Petrocelli, T., Tran, B., Zhang, T., McPherson, J.D., Kamel‐Reid, S., Bedard, P.L., Onetto, N., Hudson, T.J., Dancey, J., Siu, L.L., Stein, L., and Ferretti, V. 2013. Clinical genomics information management software linking cancer genome sequence and clinical decisions. GenomicsApr 17. pii: S0888‐7543(13)0070‐0. doi: 10.1016/j.ygeno.2013.04.007. [Epub ahead of print].
  Weedon, M.N., Hastings, R., Caswell, R., Xie, W., Paszkiewicz, K., Antoniadi, T., Williams, M., King, C., Greenhalgh, L., Newbury‐Ecob, R., and Ellard, S. 2011. Exome sequencing identifies a DYNC1H1 mutation in a large pedigree with dominant axonal Charcot‐Marie‐Tooth disease. Am. J. Hum. Genet. 89:308‐312.
  Weir, B.A., Woo, M.S., Getz, G., Perner, S., Ding, L., Beroukhim, R., Lin, W.M., Province, M.A., Kraja, A., Johnson, L.A., Shah, K., Sato, M., et al., 2007. Characterizing the cancer genome in lung adenocarcinoma. Nature 450:893‐898.
  Wetzel, J., Kingsford, C., and Pop, M. 2011. Assessing the benefits of using mate‐pairs to resolve repeats in de novo short‐read prokaryotic assemblies. BMC Bioinformatics 12:95.
  Wheeler, D.A., Srinivasan, M., Egholm, M., Shen, Y., Chen, L., McGuire, A., He, W., Chen, Y.J., Makhijani, V., Roth, G.T., Gomes, X., Tartaro, K., Niazi, F., Turcotte, C.L., Irzyk, G.P., Lupski, J.R., Chinault, C., Song, X.Z., Liu, Y., Yuan, Y., Nazareth, L., Qin, X., Muzny, D.M., Margulies, M., Weinstock, G.M., Gibbs, R.A., and Rothberg, J.M. 2008. The complete genome of an individual by massively parallel DNA sequencing. Nature 452:872‐876.
  Wirth, B., Garbes, L., and Riessland, M. 2013. How genetic modifiers influence the phenotype of spinal muscular atrophy and suggest future therapeutic approaches. Curr. Opin. Genet. Dev. 23:330‐338.
  Wolfe, A.L., Felock, P.J., Hastings, J.C., Blau, C.U., and Hazuda, D.J. 1996. The role of manganese in promoting multimerization and assembly of human immunodeficiency virus type 1 integrase as a catalytically active complex on immobilized long terminal repeat substrates. J. Virol. 70:1424‐1432.
  Worthey, E.A., Mayer, A.N., Syverson, G.D., Helbling, D., Bonacci, B.B., Decker, B., Serpe, J.M., Dasu, T., Tschannen, M.R., Veith, R.L., Basehore, M.J., Broeckel, U., Tomita‐Mitchell, A., Arca, M.J., Casper, J.T., Margolis, D.A., Bick, D.P., Hessner, M.J., Routes, J.M., Verbsky, J.W., Jacob, H.J., and Dimmock, D.P. 2011. Making a definitive diagnosis: Successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease. Genet. Med. 13:255‐262.
  Xi, R., Kim, T.M., and Park, P.J. 2010. Detecting structural variations in the human genome using next generation sequencing. Brief. Funct. Genomics 9:405‐415.
  Xiong, F., Gao, J., Li, J., Liu, Y., Feng, G., Fang, W., Chang, H., Xie, J., Zheng, H., Li, T., and He, L. 2009. Noncanonical and canonical splice sites: A novel mutation at the rare noncanonical splice‐donor cut site IVS4+1A>G of SEDL causes variable splicing isoforms in X‐linked spondyloepiphyseal dysplasia tarda. Eur. J. Hum. Genet. 17:510‐516.
  Yamaguchi, T., Hosomichi, K., Narita, A., Shirota, T., Tomoyasu, Y., Maki, K., and Inoue, I. 2011. Exome resequencing combined with linkage analysis identifies novel PTH1R variants in primary failure of tooth eruption in Japanese. J. Bone Miner. Res. 26:1655‐1661.
  Yu, T.W., Chahrour, M.H., Coulter, M.E., Jiralerspong, S., Okamura‐Ikeda, K., Ataman, B., Schmitz‐Abe, K., Harmin, D.A., Adli, M., Malik, A.N., D'Gama, A.M., Lim, E.T., Sanders, S.J., Mochida, G.H., Partlow, J.N., Sunu, C.M., Felie, J.M., Rodriguez, J., Nasir, R.H., Ware, J., Joseph, R.M., Hill, R.S., Kwan, B.Y., Al‐Saffar, M., Mukaddes, N.M., Hashmi, A., Balkhy, S., Gascon, G.G., Hisama, F.M., LeClair, E., Poduri, A., Oner, O., Al‐Saad, S., Al‐Awadi, S.A., Bastaki, L., Ben‐Omran, T., Teebi, A.S., Al‐Gazali, L., Eapen, V., Stevens, C.R., Rappaport, L., Gabriel, S.B., Markianos, K., State, M.W., Greenberg, M.E., Taniguchi, H., Braverman, N.E., Morrow, E.M., and Walsh, C.A. 2013. Using whole‐exome sequencing to identify inherited causes of autism. Neuron 77:259‐273.
  Zerbino, D.R. and Birney, E. 2008. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821‐829.
  Zhang, L., Zhang, J., Yang, J., Ying, D., Lau, Y.L., and Yang, W. 2013. PriVar: A toolkit for prioritizing SNVs and indels from next‐generation sequencing data. Bioinformatics 29:124‐125.
  Zhang, W., Chen, J., Yang, Y., Tang, Y., Shang, J., and Shen, B.A. 2011. Practical comparison of de novo genome assembly software tools for next‐generation sequencing technologies. PLoS One 6:e17915. doi: 10.1371/journal.pone.0017915.
  Zhang, Z., Burch, P.E., Cooney, A.J., Lanz, R.B., Pereira, F.A., Wu, J., Gibbs, R.A., Weinstock, G., and Wheeler, D.A. 2004. Genomic analysis of the nuclear receptor family: New insights into structure, regulation, and evolution from the rat genome. Genome Res. 14:580‐590.
PDF or HTML at Wiley Online Library