- Open Access
Mechanisms for human genomic rearrangements
PathoGeneticsvolume 1, Article number: 4 (2008)
Genomic rearrangements describe gross DNA changes of the size ranging from a couple of hundred base pairs, the size of an average exon, to megabases (Mb). When greater than 3 to 5 Mb, such changes are usually visible microscopically by chromosome studies. Human diseases that result from genomic rearrangements have been called genomic disorders. Three major mechanisms have been proposed for genomic rearrangements in the human genome. Non-allelic homologous recombination (NAHR) is mostly mediated by low-copy repeats (LCRs) with recombination hotspots, gene conversion and apparent minimal efficient processing segments. NAHR accounts for most of the recurrent rearrangements: those that share a common size, show clustering of breakpoints, and recur in multiple individuals. Non-recurrent rearrangements are of different sizes in each patient, but may share a smallest region of overlap whose change in copy number may result in shared clinical features among different patients. LCRs do not mediate, but may stimulate non-recurrent events. Some rare NAHRs can also be mediated by highly homologous repetitive sequences (for example, Alu, LINE); these NAHRs account for some of the non-recurrent rearrangements. Other non-recurrent rearrangements can be explained by non-homologous end-joining (NHEJ) and the Fork Stalling and Template Switching (FoSTeS) models. These mechanisms occur both in germ cells, where the rearrangements can be associated with genomic disorders, and in somatic cells in which such genomic rearrangements can cause disorders such as cancer. NAHR, NHEJ and FoSTeS probably account for the majority of genomic rearrangements in our genome and the frequency distribution of the three at a given locus may partially reflect the genomic architecture in proximity to that locus. We provide a review of the current understanding of these three models.
Genomic rearrangements describe mutational changes in the genome such as duplication, deletion, insertion, inversion, and translocation that are different from the traditional Watson-Crick base pair alterations . Genomic rearrangements can represent polymorphisms that are neutral in function, or they can also convey phenotypes via diverse mechanisms, including changing the copy number (that is, copy number variation or CNV) of dosage-sensitive genes, disrupting genes, creating fusion genes or other mechanisms (reviewed in ). The pathological conditions caused by genomic rearrangements are collectively defined as genomic disorders [1–3].
Typically, the term 'genomic rearrangements' is only used to describe gross DNA changes ranging from thousands to sometimes millions of base pairs that can cover clusters of different genes . Genomic rearrangements of this size have been considered to be clearly distinct from the small-scale gene mutations (for example, point mutations, indels) regarding not only the size of the rearranged DNA but also the underlying mechanisms for both the formation of the rearrangements and the conveying of phenotypes (that is, mechanisms upstream and downstream of the rearrangements). Monogenic point mutations usually reflect errors of DNA replication and/or repair [1, 2], whereas the gross genomic rearrangements are often caused by other mechanisms mediated or stimulated by genomic structural features (that is, genomic architecture) . Disease-causing genomic rearrangements can be recurrent, with a common size and fixed breakpoints (that is, breakpoints cluster); or non-recurrent with different sizes and distinct breakpoints for each event. The non-recurrent rearrangements share a common genomic region-of-overlap, the smallest region of overlap (SRO), that encompasses the locus associated with the conveyed genomic disorder (Figure 1).
Three major mechanisms have been proposed for genomic rearrangements in the human genome: non-allelic homologous recombination (NAHR), non-homologous end-joining (NHEJ) and the Fork Stalling and Template Switching (FoSTeS) models.
Recurrent genomic rearrangements are caused by NAHR
1. NAHR occurs preferentially at the so-called 'hotspots' inside low-copy repeats
A large number of DNA rearrangements of the same genomic interval have been observed in different individuals, that is they have a recurrent nature . Most recurrent genomic rearrangements are caused by NAHR between two low-copy repeats (LCRs, also called segmental duplications, SD) [4, 5]. LCRs are region-specific DNA blocks usually of 10 to 300 kilobase (kb) in size and of > 95% to 97% similarity to each other [5, 6]. Bailey and Eichler recently reviewed the distribution and evolution of mammalian LCRs (referred to as SD therein) .
Due to their high degree of sequence identity, non-allelic copies of LCRs, instead of the copies at the usual allelic positions, can sometimes be aligned in meiosis or mitosis. This so-called 'misalignment' and the subsequent crossover between them can result in genomic rearrangements in progeny cells. The non-allelic copies thus act as the mediators (that is, substrates) of the homologous recombination and they are responsible for the observed breakpoint clustering. When the two LCRs are located on the same chromosome and in direct orientation, NAHR between them causes duplication and/or deletion. When they are on the same chromosome but in opposite orientation, NAHR results in inversion of the fragment flanked by them  (Figure 2a). NAHR between repeats on different chromosomes can lead to chromosomal translocation .
Evidence has shown that the strand exchanges during NAHR are not distributed evenly along the LCRs, but cluster in narrow 'hotspots' [7–10]. DNA structures capable of inducing double-strand breaks (DSB) (such as palindromes, non-B conformation DNA, minisatellites and DNA transposons) have often been found near the NAHR hotspots, indicating a potential link between NAHR and DSB [11, 12]. At the same time, extensive linkage disequilibrium studies as well as detailed mapping of single loci clearly revealed that allelic homologous recombination (AHR) also has preferred hotspots [13–15].
Using sequencing-based approaches, De Raedt et al.  and Lindsay et al.  examined the fine structure of crossovers at the Neurofibromatosis type 1 (NF1; MIM162200) locus and Charcot-Marie-Tooth disease type 1A (CMT1A; MIM118220)/Hereditary Neuropathy with Liability to Pressure Palsies (HNPP; MIM162500) locus which often undergo NAHR. De Raedt et al. showed that NAHR hotspots can have strikingly similar positions in the LCR as the AHR hotspots in paralogous sequences . Lindsay et al. found that in the same sequence fragment, NAHR hotspots can be located just adjacent to AHR hotspots and share similar properties of the distribution of strand exchanges . These data provided evidence that the NAHR hotspots could be functionally closely related to AHR hotspots. Some of the NAHR and AHR hotspots still fall into the same regions in the current human genome; some of them may have overlapped in our ancestral genomes .
2. NAHR occurs in both meiotic and mitotic cells
NAHR in germ line cells leads to constitutional genomic rearrangements that can be manifested as genomic disorders [1, 18]. Genomic disorders can be either inherited or sporadic, depending on whether the rearrangement was transmitted through the germ line or occurred de novo . Prominent examples of inherited genomic disorders caused by NAHR include CMT1A and HNPP, caused by the recurrent duplication/deletion of a 1.4 Megabase (Mb) DNA fragment on chromosome 17p12; and sporadic genomic disorders include Potocki-Lupski syndrome (PTLS; MIM610883)/Smith-Magenis syndrome (SMS; MIM182290) caused by the reciprocal duplication/deletion on 17p11.2. The identification and detailed study of these rearrangements in patients has contributed significantly to our current knowledge on mechanisms of genomic rearrangements (the reports delineating the first recurrent disease-associated duplication rearrangement include [20, 21], recently reviewed in [1, 22]).
NAHR can also occur in mitosis, resulting in mosaic populations of somatic cells carrying genomic rearrangements. It is well appreciated that many cancers are related to somatic genomic rearrangements, some due to somatic NAHR [23, 24]. Also, in blood cells of healthy persons elaborate PCR assays have been able to detect mosaic duplication, deletion [25, 26] and inversion mediated by NAHR . CNVs have been shown between monozygotic twins, highlighting the potential occurrence of genomic rearrangements in somatic cells . Somatic NAHR can cause genomic disorders with mosaic manifestations, one example being the somatic NF1 deletions causing segmental neurofibromatosis . Interestingly, Dempsey et al. reported a patient with mosaicism for both deletion del(22)(q11.2q11.2) and the reciprocal duplication dup(22)(q11.2q11.2), which were probably caused by a mitotic NAHR event early in embryogenesis .
The same pairs of LCRs can mediate both mitotic and meiotic NAHR events. The LCRs called REPA and REPB mapping in 17p11.2 are important mediators of somatic NAHRs leading to the formation of the dicentric isochromosome i(17q) in human neoplasia [31, 32]; they also convey frequent meiotic NAHRs and cause this genomic locus to be highly variable in different populations . Mitotic NAHR may not share the same hotspots with the meiotic NAHR mediated by the same pair of LCRs, as suggested by the observation of Turner et al. in their sperm-typing assay, where the primer pairs amplifying across the meiotic recombination hotspots in sperm DNA could not amplify any recombinant products from the DNA of blood cells . The frequency of meiotic and mitotic NAHR on the same LCRs can be different as well . Furthermore, the frequency and LCR usage of mitotic NAHRs could, theoretically, even vary among different somatic tissues although, to our knowledge, no data are currently available on this topic. Future studies using precise techniques to examine more loci should reveal further details on the similarities and differences between meiotic and mitotic NAHR.
3. Minimal efficient processing segments are required for efficient NAHR
For NAHR to take place, there must be segments of a minimal length sharing extremely high similarity or identity between the LCRs, named minimal efficient processing segments (MEPS). The importance of MEPS for intra- and interchromosomal mitotic recombination was demonstrated by Waldman and Liskay using mouse cell culture  and by Rubnitz and Subramani using monkey cell culture . The placement of only two single-nucleotide mismatches, reducing the longest uninterrupted homology between two repeats from 232 to 134 base pairs (bp), resulted in a 20-fold reduction in intrachromosomal recombination . Also, the frequency of interchromosomal recombination drops sharply when the homology was reduced from 214 to 163 bp .
The MEPS in human meiosis appear to be in the range of 300 to 500 bp in length, as empirically estimated from the analysis of the genomic rearrangements in CMT1A/HNPP patients . The MEPS of mitotic NAHR may be different from meiotic NAHR. Steinmann et al.  identified nine somatic NF1 deletions conveyed by homology stretches shorter than 114 base pairs. Not all meiosis or mitosis events have the same demand of MEPS. With their single sperm/cell assay, Lam and Jeffreys  identified both meiotic and mitotic NAHR events between human alpha-globin genes mediated by matching fragments smaller than 50 bp . The modest demand on MEPS in this case could be related to the proximity between the two NAHR substrate repeats. The distance between two LCRs is known to be one of the genomic architectural features that influence the efficiency of NAHR [2, 5, 37], and it has been observed that larger-sized genomic rearrangements, utilizing LCRs located further apart, often correlate with larger LCRs [2, 5]. The repeats in the alpha-globin locus are only 5 kb away from each other, whereas the two LCRs of CMT1A/HNPP are separated by 1.4 Mb. Nevertheless, most of the rearrangements causing genomic disorders actually take place between LCRs which are 10 to 400 kb in length and have > 96% sequence identity [2, 37]. The most frequent microdeletion syndrome, DiGeorge/Velocardiofacial (DG/VCFS; MIM188400, 192340) (frequency 1/4,000–1/8,000), is mediated by LCRs on chromosome 22q11.2, of 240 kb in length and sharing 99.7% sequence identity [38, 39].
4. Reciprocal deletions and duplications do not occur at the same frequencies
The relative frequency of the reciprocal deletions and duplications from the NAHR events mediated by the same pair of LCRs is of both biological and clinical importance. In meiosis, NAHR can take place between paralogues on the same chromatid (intrachromatid), on sister chromatids (intrachromosomal or interchromatid) or on the homologous chromosomes (interchromosomal) [5, 18]. Between two directly oriented LCRs, interchromatid and interchromosomal rearrangements result in reciprocal duplication and deletion, whereas intrachromatid rearrangements can only lead to deletion (Figure 2b). Thus, at least theoretically, the frequency of deletions should be always higher than duplications. The difference between the frequency of deletions and duplications reflects the frequency of intrachromatid NAHR.
The prevalence of several reciprocal duplication/deletion syndromes such as CMT1A/HNPP, PTLS/SMS, dup22(q11.2q11.2)/DG/VCFS has been used to estimate the relative frequency of duplications and deletions mediated by the same pairs of LCRs. The pitfall of these calculations is that one or even both events might be embryonically lethal or phenotypically mild so that the carriers will not be clinically ascertained. Furthermore, selection would occur in both germ cells and the organism, and may act differently on duplication versus deletion syndromes. To overcome these challenges, two groups took an experimental approach that used a single-sperm PCR assay to measure the duplication and deletion events directly. Turner et al. analyzed four NAHR loci related to well-studied genomic disorders (Williams-Beuren Syndrome deletion (WBS; MIM194050) and 7q11.23 duplication (MIM609757); the AZFa deletion (azoospermia (MIM415000) associated) and its reciprocal duplication; the HNPP deletion and CMT1A duplication; and the SMS deletion and PTLS duplication) in the sperm populations of five persons. Strikingly, they found that all five persons consistently displayed an approximately 2:1 ratio of deletion versus duplication in all three autosomal loci. For the AZFa locus on the Y-chromosome, the observed deletion versus duplication ratio 4:1 was even higher . Thus, at least in meiosis, reciprocal duplications and deletions do not occur at equal frequency .
It is not clear how general this 'two deletions versus one duplication' rule is. Lam and Jeffreys [25, 26] also performed single-sperm assays on the alpha-globin locus in two persons. One person showed the same deletion and duplication frequency, while the other person had a higher rate of duplications than deletions at this locus. This discrepancy between the two studies could be due to experimental design, as Turner et al. specifically measured the NAHR events across the so-called 'hotspots' in the LCR whereas Lam and Jeffreys observed the entire globin locus and could thus also record NAHR events outside the hotspots and other non-NAHR rearrangements. However, it could also reflect true differences among different NAHR loci, probably predisposed by the local genomic architecture (LCR length, distance of LCRs and so on).
Pedigree analysis of the haplotypes flanking the LCRs has often been used to differentiate between intra- and interchromosomal rearrangements [40–45]. These studies have revealed different findings for different syndromes. However, the haplotype assay has the limitation of being unable to differentiate between intrachromatid and interchromatid events, so the comparison can only be made between intrachromosomal (intra- plus interchromatid) and interchromosomal NAHR.
The single-sperm assay, however, allows the assessment of the intrachromatid events by observing the difference between deletion and duplication frequencies. Turner et al.  concluded that intrachromatid NAHR dominates at all hotspots they examined and the interchromatid NAHRs are very rare, with a frequency 50-fold lower than interchromosomal NAHRs. That the deletion versus duplication ratio at AZFa locus on Y chromosome is even higher than at the autosomal loci is likely because of the lack of interchromosomal NAHR.
The conclusion of Turner et al. agrees with the study of the WBS locus by Bayes et al. . However, it awaits further confirmation from data at other loci before it can be accepted as a general rule. One should also bear in mind that this finding is only relevant for the NAHR events at the hotspots and does not describe any other types of rearrangements caused by different mechanisms.
5. NAHR can be different between males and females
There seem to be differences in NAHR frequency between male and female gametogenesis, as reflected by the different percentage of the two parental origins which were observed for several genomic disorders. The overwhelming majority of CMT1A duplications (nine in nine cases as reported in  and 26 in 28 cases in ) as well as 85% of spinal muscular atrophy (SMA; MIM253300) deletions  originate in spermatogenesis; whereas 80% of NF1 deletions are of maternal origin [8, 49]. These apparent differences in maternally and paternally originated rearrangements might be due to intrinsic differences in NAHR between male and female germ lines, or might also reflect different selection bias against the rearranged allele between male and female germ lines, or a combination of both. Epigenetic modifications in male and female gametogenesis and gametes might contribute to both processes. The observed differences between male and female rearrangements do not seem to affect all NAHR loci to the same extent: for SMS/PTLS, no significant parental differences have been observed [50, 51].
Whereas meiotic NAHRs causing genomic rearrangements either originate in or are inherited through the germ line of the previous generation, mitotic NAHRs occur in the somatic cells of the same individual who bears the rearrangements. It is intriguing that mitotic NAHR could also have a bias in females and males. Steinmann et al.  observed that 12 of their 13 segmental NF patients with deletions caused by somatic NAHR are females. The reason for this bias is not immediately obvious; it is not known if this bias is specific for the genomic locus or somatic tissues involved in the pathogenesis of NF, or whether it may reflect more general differences between male and female mitotic NAHR. Little data are available at the present time.
6. Using the NAHR mechanism to predict genomic disorders
The recognition of NAHR originated from the study of genomic disorders . It is thus exciting that our now greater understanding of NAHR mechanisms, combined with bioinformatic analyses of the human genome, allows the prediction of regions prone to genomic instability, thus uncovering novel genomic disorders.
First, where recurrent deletions mediated by LCRs have been observed, we can confidently predict the occurrence of reciprocal duplication at the same sites, and vice versa. In recent years, with the application of mechanistic insight, we have witnessed the defining of the Potocki-Lupski syndrome as the predicted reciprocal rearrangement of SMS, dup(22)(q11.2) as the reciprocal rearrangement of DG/VCFS, and dup(7)(q11.23) as the reciprocal rearrangement for Williams-Beuren syndrome deletion [50–55]. The above-mentioned sperm-typing data of Turner et al. further confirmed the co-existence of the reciprocal rearrangements by experiments, while pointing out that the reciprocal syndromes can have unequal frequencies compared with the prevalence of the deletion syndromes.
Our lab has reported a 5 Mb uncommon but recurrent deletion in six SMS patients, which utilized alternative LCRs as NAHR substrates . Although the reciprocal duplication of the common recurrent SMS deletion has been found in a number of cases and led to the definition of the PTLS syndrome, patients with the reciprocal duplication of the uncommon recurrent deletion have not yet been identified. It is thus of great interest that Turner et al.  did observe this duplication in their sperm assay, further underscoring the reciprocal nature of NAHR and affirming the anticipation that this duplication may also be found in patients. It should be pointed out that until now, we have only identified six uncommon recurrent deletions in our cohort of SMS patients; if the frequency of the reciprocal duplication is half that of the deletion, patients with the uncommon duplication should be even more rare.
Furthermore, the NAHR mechanisms based on LCRs have also led to the finding of a number of new genomic disorders. The majority of DG/VCFS patients have either a common 3 Mb or an atypical 1.5 Mb deletion on 22q11.2 mediated by LCR22-2 and LCR22-4, or LCR22-3a and LCR22-4, respectively ([38, 54, 57] and the references therein). The architecture of 22q11.2, however, also harbors additional LCR22s . It was thus anticipated that recombinations mediated by other LCRs might also occur in this region. Indeed, using array comparative genomic hybridization (aCGH) techniques, Ben-Shachar et al. found six deletions mediated by LCR22-4, -5 and -6 . These deletions are distal from the common DG/VCFS deletions and the patients have phenotypes overlapping with but distinct from DG/VCFS. These deletions were defined as the 22q11.2 distal deletion syndrome (MIM611867), a new genomic disorder https://decipher.sanger.ac.uk/perl/application?action=syndromes;syndrome_id=32. The reciprocal duplications of these distal deletions have also been reported .
Also applying the principles of NAHR mediated by LCR, Sharp et al. [37, 58] predicted microdeletion/microduplication rearrangements in new chromosomal loci that were previously not known to cause genomic syndromes. The authors  created a map of potential 'rearrangement hotspots' of the human genome, by localizing 130 sites of paired LCRs (SD) that are ≥ 10 kb in length, show ≥ 95% sequence identity and are separated by 50 kb to 10 Mb of intervening sequence . A specific bacterial artificial chromosome (BAC) array was then designed including BAC clones interrogating each of these 130 NAHR candidate sites . After ruling out the basal level of copy number polymorphisms in these sites by hybridizing a control population of 316 individuals , the authors analyzed the genomes of 290 idiopathic mental retardation patients and found deletions in four chromosomal loci (17q21.31, 1q21.1, 15q13, and 15q24) that are likely sites of recurrent rearrangements . Three of the rearrangements were indeed identified as new microdeletion syndromes, with further cases found in other populations [58–60].
The microdeletion syndrome involving 17q21.31 was also identified by two other groups with a traditional systematic whole-genome array assaying individuals with idiopathic mental retardation [61–63]. In another study, the candidate NAHR loci array of Sharp et al. was used to assess 155 fetuses with congenital anomalies and identified a deletion involving 17q12 in a fetus with dysplastic kidneys . They extended their study to include additional cohorts of patients and found that the deletion is also associated with congenital renal abnormalities and diabetes. The deletions are all in the range of under 1 Mb to 4 Mb in size, thus below the limit of the resolution of traditional cytogenetic detection [59, 61, 63].
Interestingly, the reciprocal duplication of the microdeletion in 17q12 (mediated by the same LCRs) was identified in two individuals with mental retardation and/or epilepsy . The reciprocal duplication of the 15q13 deletion has also been identified in a healthy person  and the reciprocal duplication of the 17q21.31 deletion was reported in a patient with phenotypes including severe psychomotor developmental delay and facial dysmorphism . The duplications corresponding to the remaining microdeletions will probably also be identified soon, although it is not known yet what kind of phenotypes will be related to them.
Some simple non-recurrent rearrangements can occur via NHEJ
NHEJ is one of the two major mechanisms used by eukaryotic cells to repair DSB and has been described in organisms from bacteria to mammals [66–68]. NHEJ is routinely utilized by human cells to repair both 'physiological' DSBs, such as in V(D)J recombinations, and 'pathological' DSBs, such as those caused by ionizing radiation or reactive oxygen species. Inherited defects in NHEJ account for about 15% of human severe combined immunodeficiency (SCID) . NHEJ is also currently considered to be the major mechanism rejoining translocated chromosomes in cancer .
NHEJ proceeds in four steps (Figure 3a): detection of DSB; molecular bridging of both broken DNA ends; modification of the ends to make them compatible and ligatable; and the final ligation step . This process determines the two important characteristics of NHEJ: first, neither LCRs nor MEPS are obligatorily required for NHEJ; and second, NHEJ leaves an 'information scar'  at the rejoining site as the pre-rejoining editing of the ends includes cleavage or addition of several nucleotides from or to the ends .
Nobile et al. and Toffolati et al. [72, 73] sequenced the breakpoints of 19 patients with muscular dystrophy due to non-recurrent deletions in introns 47 and 48 of the DMD gene. These deletions were not flanked by LCRs and the junctions showed microhomology (2 to 4 nucleotides) in seven cases, short insertions (1 to 5 nucleotides) in three cases and short duplications of surrounding fragments up to 25 bp in three cases. Other junctions either contained short sequences of unknown origin or did not show any microhomology, which might be due to the editing process in NHEJ. These events thus fit well with the features of the NHEJ mechanism. Remarkably,16 of the 38 (42%) breakpoints in these two publications fell within repetitive elements such as LTR, LINE, Alu, MIR and MER2 DNA elements; also, sequence motifs known to be capable of causing DSB or curving DNA, such as TTTAAA, are present in proximity to many of these junctions [72, 73].
Inoue et al. identified two apparently NHEJ-mediated deletions of the PLP1 (proteolipid protein) gene in Xq22 in patients with Pelizaeus-Merzbacher disease (PMD; MIM312080). Breakpoint analysis showed 12 base pair and 34 base pair sequences of unknown origin at the junction . Interestingly, the distal breakpoints of both deletions were located in a 32 kb LCR termed LCR-PMDB . Shaw and Lupski reported two non-recurrent SMS deletions apparently caused by NHEJ; the proximal breakpoints of both deletions are localized in an LCR (the proximal SMS-REP) . One of them occurred within a MER5B transposon element in the SMS-REP, while the other was located in proximity to a MIR3 element and an L2 LINE sequence. The distal breakpoint of the latter deletion was localized between an LIMC4 LINE element and an AluSc element . Many breakpoints of 17p translocations and other unusual-sized deletions also occurred within LCRs . Consistent with the finding of repetitive and DNA breaking elements at the NHEJ breakpoints by Toffolatti et al. and Nobile et al., the locations of the PLP1 deletions and SMS deletions as well as the 17p translocation and deletion breakpoints map within the LCRs and are close to other repetitive DNA elements. These findings suggest that although NHEJ is not directly mediated by nor strictly dependent on certain genomic architectural elements in the way that NAHR is dependent on LCRs, it may still be stimulated and regulated by the genomic architecture [4, 76].
Combined with the DSB homologous repair (HR) as a two-step mechanism, NHEJ was also used to explain duplications [77, 78]. Woodward et al. and Lee et al. observed non-recurrent duplications in the PLP1 region in the majority of PMD patients; these duplications are non-recurrent although some of them do show breakpoint grouping (not clustering) at one end (Figure 1c) [77, 78]. Most of the duplications are tandem in orientation. Padiath et al. observed similar non-recurrent tandem duplications in the LMNB1 (coding for Lamin B1) region in subjects with autosomal dominant leukodystrophy . The junctions sometimes show microhomology [77, 79], and sometimes have insertions of one to six nucleotides [77, 78]. Woodward et al. and Lee et al. proposed that in the first step of the rearrangement, a single DSB occurred in one strand; one of the broken ends then invaded and copied from the sister chromatid and caused the duplication. The ends were then rejoined via NHEJ [77, 78].
A DNA replication-based mechanism FoSTeS can account for complex genomic rearrangements
The study of rearrangement mechanisms obviously benefits from the development of new techniques to observe the rearrangements and breakpoints with a higher resolution. In the past, fluorescence in situ hybridization (FISH) has defined the duplications and deletions with resolution to about one BAC clone (150 to 200 kb) and accelerated the discovery of NAHR and NHEJ mechanisms. Recently, the advent of array-based CGH [reviewed in [80, 81]] has provided an unprecedented ability to observe the often complex details of genomic rearrangements, and has led to the proposal of the DNA replication-based FoSTeS model as the third major mechanism for human genomic rearrangements .
Lee et al. used a 44 K Agilent custom array to study the genomic region surrounding PLP1 in PMD patients . This array, with resolution of almost two interrogating oligonucleotides each kb, enabled the observation of non-recurrent rearrangements in PMD patients that were more complicated than simple duplication or deletion. The apparent duplications initially observed by FISH are often actually interrupted by triplicated or deleted fragments, or fragments with normal copy numbers. Subsequent mapping of breakpoints revealed further complexity of these rearrangements by showing that some of the fragments are inverted or translocated to another region. Microhomology of two to five nucleotides was found at each sequenced breakpoint junction . One of the PMD cases resulting from FoSTeS-mediated complex rearrangement of the PLP1 locus is shown in Figure 4.
It is difficult to explain this complexity by either the NAHR or NHEJ recombination mechanisms. Inspired by the findings in Escherichia coli , Lee et al. proposed the replication Fork Stalling and Template Switching (FoSTeS) Model (Figure 3b). According to this model, during DNA replication, the DNA replication fork stalls at one position, the lagging strand disengages from the original template, transfers and then anneals, by virtue of microhomology at the 3' end, to another replication fork in physical proximity (not necessarily adjacent in primary sequence), 'primes', and restarts the DNA synthesis . The invasion and annealing depends on the microhomology between the invaded site and the original site. Upon annealing, the transferred strand primes its own template-driven extension at the transferred fork. This priming results in a 'join point' rather than a breakpoint, signified by a transition from one segment of the genome to another – the template-driven juxtaposition of genomic sequences. Switching to another fork located downstream (forward invasion) would result in a deletion, whereas switching to a fork located upstream (backward invasion) results in a duplication. Depending on whether the lagging or leading strand in the new fork was invaded and copied, and the direction of the fork progression, the erroneously incorporated fragment from the new replication fork would be in direct or inverted orientation to its original position. This procedure of disengaging, invading/annealing and synthesis/extension could occur multiple times in series (that is, FoSTeS × 2, FoSTeS × 3, and so on) (Figure 5), likely reflecting the poor processivity of the involved DNA polymerase, and causing the observed complex rearrangements.
Array CGH data on several other genomic regions, including the SMS/PTLS locus [50, 84] (Lupski Lab, manuscript in preparation) and the MECP2 locus [85–88] have confirmed the complex nature of many other non-recurrent rearrangements, some of which were thought to be simple deletion or tandem duplication before the oligoarray technique was available. Likewise, the FoSTeS mechanism can potentially explain some of the complex rearrangements observed at the DMD locus . The FoSTeS model is currently the only major rearrangement mechanism that could explain these complex rearrangements. Furthermore, some complex chromosome rearrangements (CCR) unveiled by recent cytogenetic data can also be explained by FoSTeS . Intriguingly, some tandem duplications in the PLP1 and LMNB region [77, 79] which were previously explained by a model combining HR and NHEJ, especially those with microhomology at the junction [77, 79], can be more parsimoniously explained by the FoSTeS model including the strand switching template only once (FoSTeS × 1).
Interestingly, similar to the PLP1 region, the SMS/PTLS and MECP2 regions were also found to have very complex genomic architecture with multiple LCRs [1, 85, 86]. These LCRs, although they do not mediate FoSTeS directly, might be able to bring replication forks together to facilitate the replication fork switching event. Furthermore, highly enriched Alu repeats and high GC-content sequences were observed in proximity to the MECP2 complex recombination region . So, like NAHR and NHEJ, FoSTeS is probably also influenced by the local genomic architecture. Unlike NAHR or NHEJ, FoSTeS rearrangement is currently based on the translocation of the end of a single nascent strand, so the genomic architectures facilitating FoSTeS may function via a mechanism that does not involve DSB intermediates. Nevertheless, a microhomology-mediated break-induced replication (MMBIR) model has also been proposed, in which the rearrangement is initiated by a single-end double-strand DNA break resulting from a collapsed replication fork (Hastings et al. personal communication). As more and more sophisticated array techniques are being used in more and more laboratories, we look forward to the discovery of more complex rearrangements and using them to further verify and modify the current FoSTeS model.
Some gross genomic rearrangements and small-scale gene mutations might share similar mechanisms
The most significant difference between FoSTeS and the other two rearrangement mechanisms (NAHR, NHEJ) is that it is a replication-based mechanism; the rearrangement is induced by errors in the replication procedure. It has been thought that small monogenic genetic mutations often reflect errors of DNA replication and/or repair , whereas genomic rearrangements are thought to be caused by other mechanisms induced by or associated with structural features (genomic architecture) of the local genomic region . The FoSTeS mechanism suggests that large genomic rearrangement involving thousands or even millions of DNA base pairs can be due to replication errors as well, perhaps also stimulated by local genome architecture such as cruciforms (Figure 6).
Chen and colleagues [90–93] studied the breakpoints of 'smaller' DNA rearrangements (between 21 bp and up to 10 kb) including duplications, deletions, insertions, and inversions collected in the Human Gene Mutation Database (HGMD) . They found that many of them have a complex nature (similar to the complex nature of the 'large' rearrangements now being observed using array CGH), instead of being simple duplications and deletions. They proposed the serial replication slippage (SRS) model to explain these complex gene mutations. The SRS model is an extension of the classical replication slippage model ; it assumes that the 3' end of the nascent strand could dissociate from the original template and invade other templates on the basis of microhomology. Depending on whether the strand slippage occurs forwards or backwards, the nascent strand will have a deletion or duplication. Making use of reversed repeats, the nascent strand can also invade in the reverse orientation and thus incorporate an inverted segment. The slippage can happen serially, creating the complex rearrangements Chen et al. observed of small sizes between 21 bp and several kb.
The SRS model proposed for small gene mutations shares some general features with the FoSTeS model proposed for the larger rearrangements. Both models assume serial replication slippage, and both stress the importance of the genomic architectural elements such as palindromic DNA, stem-loop structures, repeats and so on, which may facilitate the initial stalling of the replication fork. While the SRS model assumes that replication slippage occurs on closely adjacent sites (possibly inside the same replication fork) and causes DNA rearrangements of small sizes, the FoSTeS model emphasizes that the template switch can occur over long distances (120 kb to 550 kb observed to date) to another replication fork (given the spatial closeness of the two forks) and cause DNA rearrangements on a much larger scale. Furthermore, FoSTeS × 1 could explain deletion and duplication events previously proposed to occur via NHEJ, in a way similar to the explanation of small deletions and duplication using the SRS model; the observed microhomology at the join point reflecting the priming event rather than a recombination/repair process. It is interesting to realize that although we have been talking about monogenic (often small) and genomic (often large) rearrangements in different contexts, some of them apparently have similar complexity and might be caused by very similar mechanisms.
NAHR was the first major DNA rearrangement mechanism identified to cause genomic disorders. NAHR occurs during both meiosis and mitosis and it requires two LCRs with sufficient length of high homology to act as recombination substrates (Figures 2 and 6). Based upon the principles or 'rules' elucidated by studies of this mechanism, new genomic disorders have been successfully predicted and uncovered. Although this LCR-based prominent theme of NAHR remains the same, recent research has shown that some details of NAHR mechanism, such as the frequency of the recombination and the length requirement of homology between the LCRs, can differ between males and females and between meiosis and mitosis.
NHEJ and FoSTeS were later employed to explain other genomic rearrangements. Both models are still awaiting more data for further elucidation and modification. FoSTeS is a unique mechanism compared with NAHR and NHEJ, especially in that it is a replication-based rearrangement pathway and does not necessarily rely on the pre-formation of DSB. Although still very limited, our preliminary data imply that FoSTeS might be a major mechanism for duplication CNV and thus a major driver of the Ohno 'gene duplication/divergence' evolutionary hypothesis . Indeed, FoSTeS might also have been the driving force in the origin of the LCRs in the human genome. It is well known that DNA polymerases have an intrinsic error rate leading to base substitution, a fact which is central to genome stability, disease origins and evolution of species. It is tempting to speculate that there may be an endogenous polymerase error rate for FoSTeS as well, analogous to the base substitution error rate. A related question would be whether or not disorders that are frequently sporadic and occur via FoSTeS are associated with advanced paternal age, as are point mutations that are due to DNA replication errors . It has been proposed that carriers of hereditary non-polyposis colon cancer (HNPCC, MIM120435) with mutations in genes involved in the DNA mismatch repair pathway may be more susceptible to somatic genome rearrangements caused by NAHR events . One could also hypothesize that some other individuals could be more prone to genomic rearrangements mediated by FoSTeS because of mutations/functional polymorphisms in the DNA replication machinery.
It has been clearly shown that both NHEJ and FoSTeS can be indeed stimulated by local genomic architecture, but no direct association of specific DNA elements with either model (such as LCRs associated with NAHR) has been experimentally identified. It is an interesting question to which degree NHEJ and FoSTeS are structurally determined or enhanced by specific genome architecture and whether some day we may be able to predict regions of human genome instability caused by NHEJ and FoSTeS events, as we have predicted NAHR events and the related genomic disorders. Currently limited data suggest that a palindrome or cruciform may stimulate FoSTeS (Figure 6).
There are still many unsolved, exciting questions regarding the mechanisms of human genomic rearrangements in general. Evidence is emerging that genomic rearrangements, despite their likely common basic mechanisms, might be differently regulated between germ line and somatic cells, between embryogenesis and adulthood, and between cancer cells, stem cells, and differentiated cells [98, 99]. It is well known that other genome activities (such as transcription) can be fundamentally different in different cellular settings. It is thus tempting to relate the differences in genomic arrangements within these developmental contexts and cellular environments to the differences of other genome-involving processes, and to ask the question of whether there is an interaction or some kind of crosstalk between genomic rearrangement and other cellular processes. We know that NHEJ rearrangements are physiologically relevant in generating antibody diversity ; are there other 'programmed' rearrangements including inversions  which are employed in the development or regulation of other biological events? Finally, are there other mechanisms for genomic rearrangements in addition to the three discussed in this review?
For the latter question, some data are starting to emerge from two genome-wide structural variation studies. Korbel et al.  and Kidd et al.  used the paired-end-mapping (PEM)  and the fosmid-based end-sequencing-pair (ESP)  methods respectively, to systematically identify structural variants (SVs) in human genomes. Korbel et al. identified 1297 SVs including 853 deletions, 322 insertions and 122 inversions, and sequenced the breakpoints of 188 SV indels and 14 inversions. It is very interesting that almost all of the SVs bear signatures of either NAHR (surrounded by LCRs or repetitive sequences such as SINEs, LINEs), NHEJ or FoSTeS (microhomology at the junction), or retrotranspositions (mostly L1 elements). (Retrotransposition causes rearrangements in the genome via RNA-mediated mechanisms and is not the subject of this review.) Very few SVs do not fall into any of the three categories (Korbel, personal communications). Kidd et al. inferred mechanisms from breakpoints analysis for 227 SV indels and 34 inversions, and similarly identified evidence for NAHR, NHEJ or FoSTeS mechanisms. There are differences between the results of the two papers. The calculated ratio of NAHR-mediated events in SV indels, for example, is 14% according to Korbel et al., but much higher (39%) in Kidd et al. These differences may be due to the differences in their methodology or design; that of Kidd et al. is likely more efficient in detecting larger variations. Nevertheless, it seems that the three major rearrangement mechanisms – NAHR, NHEJ and FoSTeS – can explain the majority of the DNA rearrangements occurring in our genomes.
It is also of interest that the sequence analysis of both studies indicated that a portion of NAHR events utilize repetitive elements (SINEs, LINEs, LTRs), rather than LCRs as homology substrates. This finding is consistent with our previous data  showing that some non-recurrent deletions of SMS patients can be mediated by NAHR between Alu sequences. These Alus are from the evolutionarily youngest subfamilies AluS and AluY, and share a high degree of homology with each other. This homology apparently fulfills the conditions for MEPS and is enough to enable occasional non-allelic homology mediated recombination between two Alu sequences. However, the length of homology between two Alu sequences is much shorter than that between two usual LCRs, which may explain the lower frequency of the Alu-mediated recombination events than the LCR-mediated NAHRs.
Both PEM and ESP are based on the sequencing of small fragments (~3 kb for PEM and up to 40 kb for ESP) of the individual genomes and then comparing the distance between both ends of the fragments with the value of the reference genome. It should be noted that large duplications that can not be spanned by these small fragments might be underrepresented in the SVs identified by PEM and ESP because of the design of the methodology. Furthermore, these approaches: (i) may not readily detect complex genomic rearrangements, and (ii) the computational "filtering" accompanying the match of shotgun and short sequence reads to the reference genome may result in lack of identification of breakpoint sequences. On the other hand, this strategy is very powerful in identifying DNA sequence read information at the breakpoints of the deletion and inversion SVs. Future developments of even more sophisticated and sensitive genome-wide assay technologies will provide a more extensive overview of the structural variants in our genome and greatly facilitate the research on the mechanisms for CNV and other genomic rearrangements.
Lupski JR, Stankiewicz P: Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet. 2005, 1: e49-10.1371/journal.pgen.0010049.
Lupski JR: Genomic disorders: structural features of the genome can lead to DNA rearrangements and human disease traits. Trends Genet. 1998, 14: 417-422. 10.1016/S0168-9525(98)01555-8.
Lupski JR, Stankiewicz P: Genomic Disorders. 2006, Totowa, New Jersey: Humana Press
Shaw CJ, Lupski JR: Implications of human genome architecture for rearrangement-based disorders: the genomic basis of disease. Hum Mol Genet. 2004, 13 (Spec No 1): R57-64. 10.1093/hmg/ddh073.
Stankiewicz P, Lupski JR: Genome architecture, rearrangements and genomic disorders. Trends Genet. 2002, 18: 74-82. 10.1016/S0168-9525(02)02592-1.
Bailey JA, Eichler EE: Primate segmental duplications: crucibles of evolution, diversity and disease. Nat Rev Genet. 2006, 7: 552-564. 10.1038/nrg1895.
Reiter LT, Murakami T, Koeuth T, Pentao L, Muzny DM, Gibbs RA, Lupski JR: A recombination hotspot responsible for two inherited peripheral neuropathies is located near a mariner transposon-like element. Nat Genet. 1996, 12: 288-297. 10.1038/ng0396-288.
López-Correa C, Dorschner M, Brems H, Lázaro C, Clementi M, Upadhyaya M, Dooijes D, Moog U, Kehrer-Sawatzki H, Rutkowski JL, Fryns JP, Marynen P, Stephens K, Legius E: Recombination hotspot in NF1 microdeletion patients. Hum Mol Genet. 2001, 10: 1387-1392. 10.1093/hmg/10.13.1387.
Kurotaki N, Stankiewicz P, Wakui K, Niikawa N, Lupski JR: Sotos syndrome common deletion is mediated by directly oriented subunits within inverted Sos-REP low-copy repeats. Hum Mol Genet. 2005, 14: 535-542. 10.1093/hmg/ddi050.
Bi W, Park SS, Shaw CJ, Withers MA, Patel PI, Lupski JR: Reciprocal crossovers and a positional preference for strand exchange in recombination events resulting in deletion or duplication of chromosome 17p11.2. Am J Hum Genet. 2003, 73: 1302-1315. 10.1086/379979.
Lupski JR: Hotspots of homologous recombination in the human genome: not all homologous sequences are equal. Genome Biol. 2004, 5: 242-10.1186/gb-2004-5-10-242.
Wells RD: Non-B DNA conformations, mutagenesis and disease. Trends Biochem Sci. 2007, 32: 271-278. 10.1016/j.tibs.2007.04.003.
Greenawalt DM, Cui X, Wu Y, Lin Y, Wang HY, Luo M, Tereshchenko IV, Hu G, Li JY, Chu Y, Azaro MA, Decoste CJ, Chimge NO, Gao R, Shen L, Shih WJ, Lange K, Li H: Strong correlation between meiotic crossovers and haplotype structure in a 2.5-Mb region on the long arm of chromosome 21. Genome Res. 2006, 16: 208-214. 10.1101/gr.4641706.
Jeffreys AJ, Neumann R, Panayi M, Myers S, Donnelly P: Human recombination hot spots hidden in regions of strong marker association. Nat Genet. 2005, 37: 601-606. 10.1038/ng1565.
Tiemann-Boege I, Calabrese P, Cochran DM, Sokol R, Arnheim N: High-resolution recombination patterns in a region of human chromosome 21 measured by sperm typing. PLoS Genet. 2006, 2: e70-10.1371/journal.pgen.0020070.
Raedt TD, Stephens M, Heyns I, Brems H, Thijs D, Messiaen L, Stephens K, Lazaro C, Wimmer K, Kehrer-Sawatzki H, Vidaud D, Kluwe L, Marynen P, Legius E: Conservation of hotspots for recombination in low-copy repeats associated with the NF1 microdeletion. Nat Genet. 2006, 38: 1419-1423. 10.1038/ng1920.
Lindsay SJ, Khajavi M, Lupski JR, Hurles ME: A chromosomal rearrangement hotspot can be identified from population genetic variation and is coincident with a hotspot for allelic recombination. Am J Hum Genet. 2006, 79: 890-902. 10.1086/508709.
Turner DJ, Miretti M, Rajan D, Fiegler H, Carter NP, Blayney ML, Beck S, Hurles ME: Germline rates of de novo meiotic deletions and duplications causing several genomic disorders. Nat Genet. 2008, 40: 90-95. 10.1038/ng.2007.40.
Lupski JR: Genomic rearrangements and sporadic disease. Nat Genet. 2007, 39 (Suppl 7): S43-47. 10.1038/ng2084.
Lupski JR, de Oca-Luna RM, Slaugenhaupt S, Pentao L, Guzzetta V, Trask BJ, Saucedo-Cardenas O, Barker DF, Killian JM, Garcia CA, Chakravarti A, Patel PI: DNA duplication associated with Charcot-Marie-Tooth disease type 1A. Cell. 1991, 66: 219-232. 10.1016/0092-8674(91)90613-4.
Raeymaekers P, Timmerman V, Nelis E, De Jonghe P, Hoogendijk JE, Baas F, Barker DF, Martin JJ, De Visser M, Bolhuis PA, Van Broeckhoven C, HMSN Collaborative Research Group: Duplication in chromosome 17p11.2 in Charcot-Marie-Tooth neuropathy type 1a (CMT 1a). The HMSN Collaborative Research Group. Neuromuscul Disord. 1991, 1: 93-97. 10.1016/0960-8966(91)90055-W.
Lupski JR, Timmerman V: The CMT1A duplication: a historical perspective viewed from two sides of an ocean. Edited by: Lupski JR, Stankiewicz P. 2006, Totowa, New Jersey: Humana Press, 3-17.
Darai-Ramqvist E, Sandlund A, Müller S, Klein G, Imreh S, Kost-Alimova M: Segmental duplications and evolutionary plasticity at tumor chromosome break-prone regions. Genome Res. 2008, 18: 370-379. 10.1101/gr.7010208.
Fridlyand J, Snijders AM, Ylstra B, Li H, Olshen A, Segraves R, Dairkee S, Tokuyasu T, Ljung BM, Jain AN, McLennan J, Ziegler J, Chin K, Devries S, Feiler H, Gray JW, Waldman F, Pinkel D, Albertson DG: Breast tumor copy number aberration phenotypes and genomic instability. BMC Cancer. 2006, 6: 96-10.1186/1471-2407-6-96.
Lam KW, Jeffreys AJ: Processes of copy-number change in human DNA: the dynamics of α-globin gene deletion. Proc Natl Acad Sci USA. 2006, 103: 8921-8927. 10.1073/pnas.0602690103.
Lam KW, Jeffreys AJ: Processes of de novo duplication of human α-globin genes. Proc Natl Acad Sci USA. 2007, 104: 10950-10955. 10.1073/pnas.0703856104.
Flores M, Morales L, Gonzaga-Jauregui C, Domínguez-Vidaña R, Zepeda C, Yañez O, Gutiérrez M, Lemus T, Valle D, Avila MC, Blanco D, Medina-Ruiz S, Meza K, Ayala E, García D, Bustos P, González V, Girard L, Tusie-Luna T, Dávila G, Palacios R: Recurrent DNA inversion rearrangements in the human genome. Proc Natl Acad Sci USA. 2007, 104: 6099-6106. 10.1073/pnas.0701631104.
Bruder CE, Piotrowski A, Gijsbers AA, Andersson R, Erickson S, de Ståhl TD, Menzel U, Sandgren J, von Tell D, Poplawski A, Crowley M, Crasto C, Partridge EC, Tiwari H, Allison DB, Komorowski J, van Ommen GJ, Boomsma DI, Pedersen NL, den Dunnen JT, Wirdefeldt K, Dumanski JP: Phenotypically concordant and discordant monozygotic twins display different DNA copy-number-variation profiles. Am J Hum Genet. 2008, 82: 763-771. 10.1016/j.ajhg.2007.12.011.
Steinmann K, Cooper DN, Kluwe L, Chuzhanova NA, Senger C, Serra E, Lazaro C, Gilaberte M, Wimmer K, Mautner VF, Kehrer-Sawatzki H: Type 2 NF1 deletions are highly unusual by virtue of the absence of nonallelic homologous recombination hotspots and an apparent preference for female mitotic recombination. Am J Hum Genet. 2007, 81: 1201-1220. 10.1086/522089.
Dempsey MA, Schwartz S, Waggoner DJ: Mosaicism del(22)(q11.2q11.2)/dup(22)(q11.2q11.2) in a patient with features of 22q11.2 deletion syndrome. Am J Med Genet A. 2007, 143: 1082-1086.
Barbouti A, Stankiewicz P, Nusbaum C, Cuomo C, Cook A, Höglund M, Johansson B, Hagemeijer A, Park SS, Mitelman F, Lupski JR, Fioretos T: The breakpoint region of the most common isochromosome, i(17q), in human neoplasia is characterized by a complex genomic architecture with large, palindromic, low-copy repeats. Am J Hum Genet. 2004, 74: 1-10. 10.1086/380648.
Mendrzyk F, Korshunov A, Toedt G, Schwarz F, Korn B, Joos S, Hochhaus A, Schoch C, Lichter P, Radlwimmer B: Isochromosome breakpoints on 17p in medulloblastoma are flanked by different classes of DNA sequence repeats. Genes Chromosomes Cancer. 2006, 45: 401-410. 10.1002/gcc.20304.
Carvahlo CM, Lupski JR: Copy number variation at the breakpoint region of the most common isochromosome i(17q) in human neoplasia. Genome Res. 2008, Epub ahead of publication
Waldman AS, Liskay RM: Dependence of intrachromosomal recombination in mammalian cells on uninterrupted homology. Mol Cell Biol. 1988, 8: 5350-5357.
Rubnitz J, Subramani S: The minimum amount of homology required for homologous recombination in mammalian cells. Mol Cell Biol. 1984, 4: 2253-2258.
Reiter LT, Hastings PJ, Nelis E, De Jonghe P, Van Broeckhoven C, Lupski JR: Human meiotic recombination products revealed by sequencing a hotspot for homologous strand exchange in multiple HNPP deletion patients. Am J Hum Genet. 1998, 62: 1023-1033. 10.1086/301827.
Sharp AJ, Locke DP, McGrath SD, Cheng Z, Bailey JA, Vallente RU, Pertz LM, Clark RA, Schwartz S, Segraves R, Oseroff VV, Albertson DG, Pinkel D, Eichler EE: Segmental duplications and copy-number variation in the human genome. Am J Hum Genet. 2005, 77: 78-88. 10.1086/431652.
McDermid HE, Morrow BE: Genomic disorders on 22q11. Am J Hum Genet. 2002, 70: 1077-1088. 10.1086/340363.
Scambler PJ: The 22q11 deletion syndromes. Hum Mol Genet. 2000, 9: 2421-2426. 10.1093/hmg/9.16.2421.
Shaw CJ, Bi W, Lupski JR: Genetic proof of unequal meiotic crossovers in reciprocal deletion and duplication of 17p11.2. Am J Hum Genet. 2002, 71: 1072-1081. 10.1086/344346.
López Correa C, Brems H, Lázaro C, Marynen P, Legius E: Unequal meiotic crossover: a frequent cause of NF1 microdeletions. Am J Hum Genet. 2000, 66: 1969-1974. 10.1086/302920.
Saitta SC, Harris SE, Gaeth AP, Driscoll DA, McDonald-McGinn DM, Maisenbacher MK, Yersak JM, Chakraborty PK, Hacker AM, Zackai EH, Ashley T, Emanuel BS: Aberrant interchromosomal exchanges are the predominant cause of the 22q11.2 deletion. Hum Mol Genet. 2004, 13: 417-428. 10.1093/hmg/ddh041.
Trost D, Wiebe W, Uhlhaas S, Schwindt P, Schwanitz G: Investigation of meiotic rearrangements in DGS/VCFS patients with a microdeletion 22q11.2. J Med Genet. 2000, 37: 452-454. 10.1136/jmg.37.6.452.
Bayés M, Magano LF, Rivera N, Flores R, Pérez Jurado LA: Mutational mechanisms of Williams-Beuren syndrome deletions. Am J Hum Genet. 2003, 73: 131-151. 10.1086/376565.
Robinson WP, Dutly F, Nicholls RD, Bernasconi F, Peñaherrera M, Michaelis RC, Abeliovich D, Schinzel AA: The mechanisms involved in formation of deletions and duplications of 15q11-q13. J Med Genet. 1998, 35: 130-136.
Palau F, Löfgren A, De Jonghe P, Bort S, Nelis E, Sevilla T, Martin JJ, Vilchez J, Prieto F, Van Broeckhoven C: Origin of the de novo duplication in Charcot-Marie-Tooth disease type 1A: unequal nonsister chromatid exchange during spermatogenesis. Hum Mol Genet. 1993, 2: 2031-2035. 10.1093/hmg/2.12.2031.
Lopes J, Vandenberghe A, Tardieu S, Ionasescu V, Lévy N, Wood N, Tachi N, Bouche P, Latour P, Brice A, LeGuern E: Sex-dependent rearrangements resulting in CMT1A and HNPP. Nat Genet. 1997, 17: 136-137. 10.1038/ng1097-136.
Wirth B, Schmidt T, Hahnen E, Rudnik-Schöneborn S, Krawczak M, Müller-Myhsok B, Schönling J, Zerres K: De novo rearrangements found in 2% of index patients with spinal muscular atrophy: mutational mechanisms, parental origin, mutation rate, and implications for genetic counseling. Am J Hum Genet. 1997, 61: 1102-1111. 10.1086/301608.
Lázaro C, Gaona A, Ainsworth P, Tenconi R, Vidaud D, Kruyer H, Ars E, Volpini V, Estivill X: Sex differences in mutational rate and mutational mechanism in the NF1 gene in neurofibromatosis type 1 patients. Hum Genet. 1996, 98: 696-699. 10.1007/s004390050287.
Potocki L, Bi W, Treadwell-Deering D, Carvalho CM, Eifert A, Friedman EM, Glaze D, Krull K, Lee JA, Lewis RA, Mendoza-Londono R, Robbins-Furman P, Shaw C, Shi X, Weissenberger G, Withers M, Yatsenko SA, Zackai EH, Stankiewicz P, Lupski JR: Characterization of Potocki-Lupski syndrome (dup(17)(p11.2p11.2)) and delineation of a dosage-sensitive critical interval that can convey an autism phenotype. Am J Hum Genet. 2007, 80: 633-649. 10.1086/512864.
Potocki L, Chen KS, Park SS, Osterholm DE, Withers MA, Kimonis V, Summers AM, Meschino WS, Anyane-Yeboa K, Kashork CD, Shaffer LG, Lupski JR: Molecular mechanism for duplication 17p11.2- the homologous recombination reciprocal of the Smith-Magenis microdeletion. Nat Genet. 2000, 24: 84-87. 10.1038/71743.
Berg JS, Brunetti-Pierri N, Peters SU, Kang SH, Fong CT, Salamone J, Freedenberg D, Hannig VL, Prock LA, Miller DT, Raffalli P, Harris DJ, Erickson RP, Cunniff C, Clark GD, Blazo MA, Peiffer DA, Gunderson KL, Sahoo T, Patel A, Lupski JR, Beaudet AL, Cheung SW: Speech delay and autism spectrum behaviors are frequently associated with duplication of the 7q11.23 Williams-Beuren syndrome region. Genet Med. 2007, 9: 427-44.
Ensenauer RE, Adeyinka A, Flynn HC, Michels VV, Lindor NM, Dawson DB, Thorland EC, Lorentz CP, Goldstein JL, McDonald MT, Smith WE, Simon-Fayard E, Alexander AA, Kulharya AS, Ketterling RP, Clark RD, Jalal SM: Microduplication 22q11.2, an emerging syndrome: clinical, cytogenetic, and molecular analysis of thirteen patients. Am J Hum Genet. 2003, 73: 1027-1040. 10.1086/378818.
Ou Z, Berg JS, Yonath H, Enciso VB, Miller DT, Picker J, Lenzi T, Keegan CE, Sutton VR, Belmont J, Chinault AC, Lupski JR, Cheung SW, Roeder E, Patel A: Microduplications of 22q11.2 are frequently inherited and are associated with variable phenotypes. Genet Med. 2008, 10: 267-277.
Somerville MJ, Mervis CB, Young EJ, Seo EJ, del Campo M, Bamforth S, Peregrine E, Loo W, Lilley M, Pérez-Jurado LA, Morris CA, Scherer SW, Osborne LR: Severe expressive-language delay related to duplication of the Williams-Beuren locus. N Engl J Med. 2005, 353: 1694-1701. 10.1056/NEJMoa051962.
Shaw CJ, Withers MA, Lupski JR: Uncommon deletions of the Smith-Magenis syndrome region can be recurrent when alternate low-copy repeats act as homologous recombination substrates. Am J Hum Genet. 2004, 75: 75-8. 10.1086/422016.
Ben-Shachar S, Ou Z, Shaw CA, Belmont JW, Patel MS, Hummel M, Amato S, Tartaglia N, Berg J, Sutton VR, Lalani SR, Chinault AC, Cheung SW, Lupski JR, Patel A: 22q11.2 distal deletion: a recurrent genomic disorder distinct from DiGeorge syndrome and velocardiofacial syndrome. Am J Hum Genet. 2008, 82: 214-22. 10.1016/j.ajhg.2007.09.014.
Sharp AJ, Hansen S, Selzer RR, Cheng Z, Regan R, Hurst JA, Stewart H, Price SM, Blair E, Hennekam RC, Fitzpatrick CA, Segraves R, Richmond TA, Guiver C, Albertson DG, Pinkel D, Eis PS, Schwartz S, Knight SJ, Eichler EE: Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome. Nat Genet. 2006, 38: 1038-1042. 10.1038/ng1862.
Sharp AJ, Mefford HC, Li K, Baker C, Skinner C, Stevenson RE, Schroer RJ, Novara F, De Gregori M, Ciccone R, Broomer A, Casuga I, Wang Y, Xiao C, Barbacioru C, Gimelli G, Bernardina BD, Torniero C, Giorda R, Regan R, Murday V, Mansour S, Fichera M, Castiglia L, Failla P, Ventura M, Jiang Z, Cooper GM, Knight SJ, Romano C, Zuffardi O, Chen C, Schwartz CE, Eichler EE: A recurrent 15q13.3 microdeletion syndrome associated with mental retardation and seizures. Nat Genet. 2008, 40: 322-328. 10.1038/ng.93.
Sharp AJ, Selzer RR, Veltman JA, Gimelli S, Gimelli G, Striano P, Coppola A, Regan R, Price SM, Knoers NV, Eis PS, Brunner HG, Hennekam RC, Knight SJ, de Vries BB, Zuffardi O, Eichler EE: Characterization of a recurrent 15q24 microdeletion syndrome. Hum Mol Genet. 2007, 16: 567-572. 10.1093/hmg/ddm016.
Koolen DA, Vissers LE, Pfundt R, de Leeuw N, Knight SJ, Regan R, Kooy RF, Reyniers E, Romano C, Fichera M, Schinzel A, Baumer A, Anderlid BM, Schoumans J, Knoers NV, van Kessel AG, Sistermans EA, Veltman JA, Brunner HG, de Vries BB: A new chromosome 17q21.31 microdeletion syndrome associated with a common inversion polymorphism. Nat Genet. 2006, 38: 999-1001. 10.1038/ng1853.
Lupski JR: Genome structural variation and sporadic disease traits. Nat Genet. 2006, 38: 974-976. 10.1038/ng0906-974.
Shaw-Smith C, Pittman AM, Willatt L, Martin H, Rickman L, Gribble S, Curley R, Cumming S, Dunn C, Kalaitzopoulos D, Porter K, Prigmore E, Krepischi-Santos AC, Varela MC, Koiffmann CP, Lees AJ, Rosenberg C, Firth HV, de Silva R, Carter NP: Microdeletion encompassing MAPT at chromosome 17q21.3 is associated with developmental delay and learning disability. Nat Genet. 2006, 38: 1032-1037. 10.1038/ng1858.
Mefford HC, Clauin S, Sharp AJ, Moller RS, Ullmann R, Kapur R, Pinkel D, Cooper GM, Ventura M, Ropers HH, Tommerup N, Eichler EE, Bellanne-Chantelot C: Recurrent reciprocal genomic rearrangements of 17q12 are associated with renal disease, diabetes, and epilepsy. Am J Hum Genet. 2007, 81: 1057-1069. 10.1086/522591.
Kirchhoff M, Bisgaard AM, Duno M, Hansen FJ, Schwartz M: A 17q21.31 microduplication, reciprocal to the newly described 17q21.31 microdeletion, in a girl with severe psychomotor developmental delay and dysmorphic craniofacial features. Eur J Med Genet. 2007, 50: 256-263. 10.1016/j.ejmg.2007.05.001.
Lieber MR, Ma Y, Pannicke U, Schwarz K: Mechanism and regulation of human non-homologous DNA end-joining. Nat Rev Mol Cell Biol. 2003, 4: 712-720. 10.1038/nrm1202.
Roth DB, Porter TN, Wilson JH: Mechanisms of nonhomologous recombination in mammalian cells. Mol Cell Biol. 1985, 5: 2599-2607.
Weterings E, van Gent DC: The mechanism of non-homologous end-joining: a synopsis of synapsis. DNA Repair (Amst). 2004, 3: 1425-1435. 10.1016/j.dnarep.2004.06.003.
Schwarz K, Ma Y, Pannicke U, Lieber MR: Human severe combined immune deficiency and DNA repair. Bioessays. 2003, 25: 1061-1070. 10.1002/bies.10344.
Lieber MR, Lu H, Gu J, Schwarz K: Flexibility in the order of action and in the enzymology of the nuclease, polymerases, and ligase of vertebrate non-homologous DNA end joining: relevance to cancer, aging, and the immune system. Cell Res. 2008, 18: 125-133. 10.1038/cr.2007.108.
Lieber MR: The mechanism of human nonhomologous DNA end joining. J Biol Chem. 2008, 283: 1-5. 10.1074/jbc.R700039200.
Nobile C, Toffolatti L, Rizzi F, Simionati B, Nigro V, Cardazzo B, Patarnello T, Valle G, Danieli GA: Analysis of 22 deletion breakpoints in dystrophin intron 49. Hum Genet. 2002, 110: 418-421. 10.1007/s00439-002-0721-7.
Toffolatti L, Cardazzo B, Nobile C, Danieli GA, Gualandi F, Muntoni F, Abbs S, Zanetti P, Angelini C, Ferlini A, Fanin M, Patarnello T: Investigating the mechanism of chromosomal deletion: characterization of 39 deletion breakpoints in introns 47 and 48 of the human dystrophin gene. Genomics. 2002, 80: 523-530. 10.1016/S0888-7543(02)96861-8.
Inoue K, Osaka H, Thurston VC, Clarke JT, Yoneyama A, Rosenbarker L, Bird TD, Hodes ME, Shaffer LG, Lupski JR: Genomic rearrangements resulting in PLP1 deletion occur by nonhomologous end joining and cause different dysmyelinating phenotypes in males and females. Am J Hum Genet. 2002, 71: 838-853. 10.1086/342728.
Shaw CJ, Lupski JR: Non-recurrent 17p11.2 deletions are generated by homologous and non-homologous mechanisms. Hum Genet. 2005, 116: 1-7. 10.1007/s00439-004-1204-9.
Stankiewicz P, Shaw CJ, Dapper JD, Wakui K, Shaffer LG, Withers M, Elizondo L, Park SS, Lupski JR: Genome architecture catalyzes nonrecurrent chromosomal rearrangements. Am J Hum Genet. 2003, 72: 1101-1116. 10.1086/374385.
Woodward KJ, Cundall M, Sperle K, Sistermans EA, Ross M, Howell G, Gribble SM, Burford DC, Carter NP, Hobson DL, Garbern JY, Kamholz J, Heng H, Hodes ME, Malcolm S, Hobson GM: Heterogeneous duplications in patients with Pelizaeus-Merzbacher disease suggest a mechanism of coupled homologous and nonhomologous recombination. Am J Hum Genet. 2005, 77: 966-987. 10.1086/498048.
Lee JA, Inoue K, Cheung SW, Shaw CA, Stankiewicz P, Lupski JR: Role of genomic architecture in PLP1 duplication causing Pelizaeus-Merzbacher disease. Hum Mol Genet. 2006, 15: 2250-2265. 10.1093/hmg/ddl150.
Padiath QS, Saigoh K, Schiffmann R, Asahara H, Yamada T, Koeppen A, Hogan K, Ptáèek LJ, Fu YH: Lamin B1 duplications cause autosomal dominant leukodystrophy. Nat Genet. 2006, 38: 1114-1123. 10.1038/ng1872.
Emanuel BS, Saitta SC: From microscopes to microarrays: dissecting recurrent chromosomal rearrangements. Nat Rev Genet. 2007, 8: 869-883. 10.1038/nrg2136.
Stankiewicz P, Beaudet AL: Use of array CGH in the evaluation of dysmorphology, malformations, developmental delay, and idiopathic mental retardation. Curr Opin Genet Dev. 2007, 17: 182-192. 10.1016/j.gde.2007.04.009.
Lee JA, Carvalho CM, Lupski JR: A DNA replication mechanism for generating nonrecurrent rearrangements associated with genomic disorders. Cell. 2007, 131: 1235-1247. 10.1016/j.cell.2007.11.037.
Slack A, Thornton PC, Magner DB, Rosenberg SM, Hastings PJ: On the mechanism of gene amplification induced under stress in Escherichia coli. PLoS Genet. 2006, 2: e48-10.1371/journal.pgen.0020048.
Vissers LE, Stankiewicz P, Yatsenko SA, Crawford E, Creswick H, Proud VK, de Vries BB, Pfundt R, Marcelis CL, Zackowski J, Bi W, van Kessel AG, Lupski JR, Veltman JA: Complex chromosome 17p rearrangements associated with low-copy repeats in two patients with congenital anomalies. Hum Genet. 2007, 121: 697-709. 10.1007/s00439-007-0359-6.
Bauters M, Van Esch H, Friez MJ, Boespflug-Tanguy O, Zenker M, Vianna-Morgante AM, Rosenberg C, Ignatius J, Raynaud M, Hollanders K, Govaerts K, Vandenreijt K, Niel F, Blanc P, Stevenson RE, Fryns JP, Marynen P, Schwartz CE, Froyen G: Non-recurrent MECP2 duplications mediated by genomic architecture-driven DNA breaks and break-induced replication repair. Genome Res. 2008, 18: 847-858. 10.1101/gr.075903.107.
del Gaudio D, Fang P, Scaglia F, Ward PA, Craigen WJ, Glaze DG, Neul JL, Patel A, Lee JA, Irons M, Berry SA, Pursley AA, Grebe TA, Freedenberg D, Martin RA, Hsich GE, Khera JR, Friedman NR, Zoghbi HY, Eng CM, Lupski JR, Beaudet AL, Cheung SW, Roa BB: Increased MECP2 gene copy number as the result of genomic duplication in neurodevelopmentally delayed males. Genet Med. 2006, 8: 784-792.
Meins M, Lehmann J, Gerresheim F, Herchenbach J, Hagedorn M, Hameister K, Epplen JT: Submicroscopic duplication in Xq28 causes increased expression of the MECP2 gene in a boy with severe mental retardation and features of Rett syndrome. J Med Genet. 2005, 42: e12-10.1136/jmg.2004.023804.
Van Esch H, Bauters M, Ignatius J, Jansen M, Raynaud M, Hollanders K, Lugtenberg D, Bienvenu T, Jensen LR, Gécz J, Moraine C, Marynen P, Fryns JP, Froyen G: Duplication of the MECP2 region is a frequent cause of severe mental retardation and progressive neurological symptoms in males. Am J Hum Genet. 2005, 77: 442-453. 10.1086/444549.
Zhang Z, Takeshima Y, Awano H, Nishiyama A, Okizuka Y, Yagi M, Matsuo M: Tandem duplications of two separate fragments of the dystrophin gene in a patient with Duchenne muscular dystrophy. J Hum Genet. 2008, 53: 215-219. 10.1007/s10038-007-0235-1.
Chen JM, Chuzhanova N, Stenson PD, Férec C, Cooper DN: Intrachromosomal serial replication slippage in trans gives rise to diverse genomic rearrangements involving inversions. Hum Mutat. 2005, 26: 362-373. 10.1002/humu.20230.
Chen JM, Chuzhanova N, Stenson PD, Férec C, Cooper DN: Complex gene rearrangements caused by serial replication slippage. Hum Mutat. 2005, 26: 125-134. 10.1002/humu.20202.
Chen JM, Chuzhanova N, Stenson PD, Férec C, Cooper DN: Meta-analysis of gross insertions causing human genetic disease: novel mutational mechanisms and the role of replication slippage. Hum Mutat. 2005, 25: 207-221. 10.1002/humu.20133.
Sheen CR, Jewell UR, Morris CM, Brennan SO, Férec C, George PM, Smith MP, Chen JM: Double complex mutations involving F8 and FUNDC2 caused by distinct break-induced replication. Hum Mutat. 2007, 28: 1198-1206. 10.1002/humu.20591.
Stenson PD, Ball EV, Mort M, Phillips AD, Shiel JA, Thomas NS, Abeysinghe S, Krawczak M, Cooper DN: Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat. 2003, 21: 577-581. 10.1002/humu.10212.
Streisinger G, Okada Y, Emrich J, Newton J, Tsugita A, Terzaghi E, Inouye M: Frameshift mutations and the genetic code. This paper is dedicated to Professor Theodosius Dobzhansky on the occasion of his 66th birthday. Cold Spring Harb Symp Quant Biol. 1966, 31: 77-84.
Ohno S: Gene duplication and the uniqueness of vertebrate genomes circa 1970–1999. Semin Cell Dev Biol. 1999, 10: 517-522. 10.1006/scdb.1999.0332.
Lupski JR, Roth JR, Weinstock GM: Chromosomal duplications in bacteria, fruit flies, and humans. Am J Hum Genet. 1996, 58: 21-27.
Voet T, Vanneste E, Ampe M, Konings P, Le Caignec C, Melotte C, Debrock S, Schuit F, Moreau Y, Verbeke G, Fryns JP, D'Hooghe T, Vermeesch JR: Chromosomal rearrangements arise at high frequency during early human embryogeneis. Welcome Trust Genomic Disorders Workshop. 2008, Hinxton, UK
Bradley A: Keynote lecture. Welcome Trust Genomic Disorders Workshop. 2008, Hinxton, UK
Korbel JO, Urban AE, Affourtit JP, Godwin B, Grubert F, Simons JF, Kim PM, Palejev D, Carriero NJ, Du L, Taillon BE, Chen Z, Tanzer A, Saunders AC, Chi J, Yang F, Carter NP, Hurles ME, Weissman SM, Harkins TT, Gerstein MB, Egholm M, Snyder M: Paired-end mapping reveals extensive structural variation in the human genome. Science. 2007, 318: 420-426. 10.1126/science.1149504.
Kidd JM, Cooper GM, Donahue WF, Hayden HS, Sampas N, Graves T, Hansen N, Teague B, Alkan C, Antonacci F, Haugen E, Zerr T, Yamada NA, Tsang P, Newman TL, Tüzün E, Cheng Z, Ebling HM, Tusneem N, David R, Gillett W, Phelps KA, Weaver M, Saranga D, Brand A, Tao W, Gustafson E, McKernan K, Chen L, Malig M, Smith JD, Korn JM, McCarroll SA, Altshuler DA, Peiffer DA, Dorschner M, Stamatoyannopoulos J, Schwartz D, Nickerson DA, Mullikin JC, Wilson RK, Bruhn L, Olson MV, Kaul R, Smith DR, Eichler EE: Mapping and sequencing of structural variation from eight human genomes. Nature. 2008, 453: 56-64. 10.1038/nature06862.
The authors would like to thank our colleagues Drs. Pawel Stankiewicz, Jan Korbel, Jonathan Berg and Bernice Morrow for their critical reading and intellectual input. WG is a Feodor-Lynen Research Fellow generously supported by the Alexander-von-Humboldt Stiftung. Work in the Lupski laboratory has been sponsored by the National Institutes of Health, the March of Dimes and the Charcot-Marie-Tooth Association.
The authors declare that they have no competing interests.
WG and JRL wrote the review manuscript. FZ participated in the discussion and helped to edit the figures. All authors read and approved the final manuscript.