Yale Web supplement accompanying:

Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library
Hugo Y. K. Lam*, Xinmeng Jasmine Mu*, Adrian M. Stütz, Andrea Tanzer, Philip D. Cayting, Michael Snyder, Philip M. Kim, Jan O. Korbel*,#,§, Mark B. Gerstein#,§
Nature Biotechnology. Epub: 2009 Dec 27


Structural Variants (SVs) are a major source of genomic variation. However, the resolution of most SV maps published to date has been insufficient for accurate breakpoint assignment. This has limited further analyses such as inferring SV ancestral states and formation mechanisms (e.g. NAHR, non-allelic homologous recombination). To this end, we collected ~2,000 SVs with breakpoints at nucleotide resolution and standardized their description in a library. We demonstrate how rapidly scanning the reads from personal genomes against this allows accurate identification of previously overlooked SVs, which we demonstrate by extensive PCR validations. Next, we characterize each breakpoint by inferring its ancestral state (through comparison against primates) and by classifying its formation mechanism. Finally, we analyze the breakpoint sequences with respect to genomic landmarks, chromosomal location, sequence motifs, and physical properties. We find, for example, that the occurrence of insertions and deletions is more balanced than previously reported and that NAHR-formed breakpoints are associated with relatively rigid and stable DNA helices.



Library Compilation:

  • Levy, S. et al. The diploid genome sequence of an individual human. PLoS Biol 5, e254 (2007).
  • Tuzun, E. et al. Fine-scale structural variation of the human genome. Nat Genet 37, 727-732 (2005).
  • Kidd, J.M. et al. Mapping and sequencing of structural variation from eight human genomes. Nature 453, 56-64 (2008).
  • Mills, R.E. et al. An initial map of insertion and deletion (INDEL) variation in the human genome. Genome Res 16, 1182-1190 (2006).
  • Perry, G.H. et al. The fine-scale and complex architecture of human copy-number variation. Am J Hum Genet 82, 685-695 (2008).
  • Wheeler, D.A. et al. The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872-876 (2008).
  • Korbel, J.O. et al. Paired-end mapping reveals extensive structural variation in the human genome. Science 318, 420-426 (2007).
  • Kim, P.M. et al. Analysis of copy number variants and segmental duplications in the human genome: Evidence for a change in the process of formation in recent evolutionary history. Genome Res 18, 1865-1874 (2008).

* These authors contributed equally to this work
# These authors co-directed this work 
§ To whom correspondence should be addressed