Abstract
Several new genomic disorders caused by copy number variation (CNV) of genes whose dosage is critical for the physiological function of the nervous system have been recently identified. Dup(7)(q11.23) patients carry duplications of the genomic region deleted in Williams-Beuren syndrome, they are characterized by prominent speech delay. The phenotypes of Potocki-Lupski syndrome and MECP2 duplication syndrome were neuropsychologically examined in detail, which revealed autism as an endophenotype and a prominent behavioral feature of these disorders. Tandem duplication of LMNB1 was reported to cause adult-onset autosomal dominant leukodystrophy. PAFAH1B1/LIS1 and YWHAE, which were deleted in isolated lissencephaly (PAFAH1B1/LIS1 alone) and Miller-Dieker syndrome (both genes), were found to be duplicated in patients with developmental delay. Finally, two novel microdeletion syndromes affecting 17q21.31 and 15q13.3, as well as their reciprocal duplications, were also identified. In this review, we provide an overview of the phenotypic manifestation of these syndromes and the rearrangements causing them.
Introduction
Much progress has been made in the research on CNV and neurological diseases since our last review on this topic (Lee and Lupski, 2006) two years ago. Some known disorders were further characterized and several new genomic disorders with major neurological manifestations were identified, some of which are caused by rearrangements of genomic regions which were not related to any genomic disorder and others caused by rearrangements reciprocal to genomic intervals of known disorders. A few of these new findings are summarized and discussed in this review. Both autism and schizophrenia have been shown to be genomic disorders, they will be discussed elsewhere in this issue.
Williams-Beuren region reciprocal duplication
A significant recent finding in neurogenetics is the characterization of the reciprocal duplication involving the Williams-Beuren syndrome (WBS, MIM194050) region. WBS is caused by genomic deletions in 7q11.23 that typically encompass 25–30 genes. WBS has the prevalence of 1/7,500 to 1/20,000 and is among the most well-characterized microdeletion syndromes (Greenberg, 1990; Osborne, 1999; Tassabehji, 2003). The majority of WBS patients carry a common recurrent 1.55 Mb deletion, which apparently occurs via non-allelic homologous recombination (NAHR) (Stankiewicz and Lupski, 2002; Bayes et al., 2003). The reciprocal duplication of this deletion has long been expected; but it was not until 2005 that this duplication was finally identified in a single patient with severe speech delay (Somerville et al., 2005). Since then, 16 further duplications in the WBS critical region (WBCR) have been reported (Somerville et al., 2005; Kriek et al., 2006; Berg et al., 2007; Depienne et al., 2007; Kirchhoff et al., 2007a; Torniero et al., 2007, 2008; Merritt and Lindor, 2008; Orellana et al., 2008), enabling the definition and the first characterizations of the dup(7)(q11.23) syndrome (MIM 609757).
Phenotypic manifestations of dup(7)(q11.23)
Overall, dup(7)(q11.23) syndrome has milder and more variable manifestations compared to WBS. WBS patients have a distinctive and characteristic facial phenotype, whereas the dup(7)(q11.23) patients, though having some mild dysmorphology, do not share a common gestalt (Somerville et al., 2005; Kriek et al., 2006; Berg et al., 2007; Depienne et al., 2007; Kirchhoff et al., 2007a; Torniero et al., 2007, 2008; Merritt and Lindor, 2008; Orellana et al., 2008). Most WBS patients display cardiovascular and connective tissue anomalies (the most common form being supravalcular aortic stenosis SVAS) resulting from haploinsufficiency of the ELN gene inside WBCR (Fig. 1) which codes for the protein elastin. Intragenic deletions and point mutations in ELN have been found in patients with similar cardiovascular phenotypes but without WBS deletions, confirming ELN as the substrate of the cardiovascular components of WBS phenotypes (Morris and Mervis, 2000). Interestingly, no organ anomalies were observed in most of the dup(7)(q11.23) patients. One potential explanation is that the development of the cardiovascular and connective tissues is only sensitive to decreased, but not to an abnormally increased dosage of ELN. Alternatively, the variation in gene expression level is quantitatively not as significant when the ELN gene is changed from two copies to three copies in duplication patients instead of from two copies to one copy in the case of deletions.
The most significant phenotypic component observed in almost all dup(7)(q11.23) patients is their moderate to severe language delay, paired with normal to only mildly impaired nonverbal and visuospatial skills (Somerville et al., 2005; Kriek et al., 2006; Berg et al., 2007; Depienne et al., 2007; Kirchhoff et al., 2007a; Torniero et al., 2007, 2008; Merritt and Lindor, 2008; Orellana et al., 2008). This pattern is the opposite of the typical WBS patients who display fluent expressive language alongside poor visuospatial skills (Mervis and Klein-Tasman, 2000). This contrast is of extraordinary interest, as it indicates the existence of one or more genes in WBCR that are highly dosage sensitive (in case of both increased and decreased gene amount) for the development of the neuronal circuits and pathways involved in human speech and language ability (Fisher, 2005; Tassabehji and Donnai, 2006). Moreover, speech delay is such a common general pediatric problem; other than sometimes being related to hearing impairment, there are relatively few insights into its biological causes.
Mechanisms for the deletions and duplications in dup(7)(q11.23)
The WBS region displays a highly complex genomic structure; it contains three groups (A, B, C) of low copy repeats (LCRs) (Stankiewicz and Lupski, 2002) ranging from 44 to 146 kb in size. Each group includes at least three highly (up to 99.5%) homologous copies of LCRs which are localized in three different modules (centromeric, medial and telomeric) (Cusco et al., 2008) (Fig. 1). In 89% of WBS cases, the deletions are 1.55 Mb in length and arise via NAHR between the two LCRs Bc and Bm (Bayes et al., 2003; Cusco et al., 2008). An uncommon but recurrent deletion of 1.84 Mb has also been identified in 8% of WBS patients (Cusco et al., 2008); this deletion is caused by NAHR between an alternative LCR pair Ac and Am (Bayes et al., 2003) (Fig. 1).
Besides these two recurrent deletions, a number of nonrecurrent deletions of atypical sizes have also been identified (Wu et al., 1998; Botta et al., 1999; Tassabehji et al., 1999, 2005; Korenberg et al., 2000; Gagliardi et al., 2003; Heller et al., 2003; Karmiloff-Smith et al., 2003; Howald et al., 2006; Schubert and Laccone, 2006). These atypical deletions, and the differences in their phenotypic manifestations, have been very informative in delineating the WBS endophenotypes and especially in searching for genes associated with language and cognition (Tassabehji, 2003). Different from the two recurrent deletions, most of these nonrecurrent deletions do not have both breakpoints localized in LCRs, so they could have occurred via either non homologous end joining (NHEJ) (Shaw and Lupski, 2005) or replication fork stalling and template switching (FoSTeS) (Lee et al., 2007), as well as NAHR mediated by elements other than LCRs (such as the Alu repeats; Babcock et al., 2003; Shaw and Lupski, 2005). As little information on the exact breakpoints of these nonrecurrent deletions is available, no conclusion can be drawn yet as to which mechanism indeed causes these deletions.
Sixteen duplications in the WBS region have been reported to date. The breakpoints of the two duplications described by Somerville et al. and Torniero et al. are localized in Bm and Bc (Fig. 1); these two duplications are thus the bona fide reciprocal duplications of the 1.55 Mb WBS common recurrent deletion. The breakpoints of the duplications described in Kriek et al. (2006), Berg et al. (2007), Depienne et al. (2007), Kirchhoff et al. (2007a), Merritt and Lindor (2008), Orellana et al. (2008) and Torniero et al. (2008) were not exactly mapped. They could be in any LCR or in the non-duplicated region between the LCRs Am and Bt; the duplications could thus be reciprocal to the common recurrent deletion, uncommon recurrent deletion (if the breakpoints are in Ac and Am), or perhaps some may represent nonrecurrent duplications. The two duplications carried by case 4 reported in Berg et al. (2007) and case 2 in Kriek et al. (2006) are clearly NOT reciprocal to the recurrent WBS deletions; one of them extends beyond WBCS into the telomeric direction, another one affects only one single gene FKBP6 in WBCS. The identification of more duplications and the mapping of their breakpoints will help us further understand the mechanisms for these rearrangements. Furthermore, the phenotypic comparison among duplications with different sizes, as well as between reciprocal deletions and duplications will be very useful in further delineation of the dup(7)(q11.23) syndrome and understanding the substrates and pathways of the dosage sensitivity in the WBS critical region.
Using a PCR-based sperm typing assay, Turner et al. (2008) mapped the frequency of the 1.55 Mb common WBS deletion and its reciprocal duplication in meiosis; they observed a two deletion to one duplication ratio in the frequencies of the reciprocal events. Considering the prevalence of WBS and the fact that the overwhelming majority of the WBS deletions are indeed due to the 1.55 Mb deletion, one should expect more duplication cases to be found soon. However, the phenotype of dup(7)(q11.23) syndrome seems to be much milder and does not include the multiple organ anomalies observed in WBS; the chance that these individuals are medically ascertained and receive comprehensive genetic tests is thus lower than that for the WBS patients.
Flanking CNVs and SVs associated with the deletions and duplications in dup(7)(q11.23)
Only a few cases of parental inheritance have been reported for the WBS deletion; otherwise, almost all WBS deletions are sporadic de novo events occurring in the meiosis of one of the parents (the transmitting progenitor) (Cusco et al., 2008). Several structural variants of the WBS region were observed to be frequently associated with the transmitting progenitors. A 2 Mb inversion mediated by the inversely oriented centromeric and telomeric LCR modules, and including the whole WBCR, occurs more frequently in the transmitting parents than non-transmitting parents and control persons (Osborne et al., 2001; Bayes et al., 2003). An inversion does not change the copy number of genes and is not a true copy number variation (CNV); it is, however, a very important and frequent element of genomic structural variation (SV), as also observed by disease-independent genome-wide analyses of structural variations (Flores et al., 2007; Korbel et al., 2007; Kidd et al., 2008). Most recently, Cusco et al. (2008) identified several CNV polymorphisms of the LCR blocks, the most frequent ones being deletion and duplication of the centromeric LCRs mediated by NAHR between Cc and Cm. The deletion allele was found to be significantly more frequent in transmitting progenitors than non-transmitting progenitors and control individuals.
In four of the 16 dup(7)(q11.23) cases described to date, not enough information on inheritance pattern is available. Quite different from the sporadic nature of WBS, among the remaining 12 informative dup(7)(q11.23) cases, one inherited the duplication from his father (Kriek et al., 2006); three from their mothers (Berg et al., 2007; Torniero et al., 2008); 4/12 (33%) is astonishingly high compared to the familial WBS cases. This difference may be related to the milder phenotype of dup(7)(q11.23) compared to WBS (Berg et al., 2007). It will be very interesting to investigate if the CNV alleles associated with the transmission of the deletions also occur in higher frequency in the transmitting progenitors of the duplications. The theoretical answer would be yes.
MECP2 duplication syndrome
The X-linked gene MECP2 located in Xq28 encodes the methyl-CpG-binding protein 2 (MeCP2). By functioning as both a transcription activator and repressor, MeCP2 regulates the expression of a wide range of genes in CNS (Chahrour et al., 2008). Point mutations and intragenic deletions in MECP2 have been found to be the major cause of the neurodevelopmental disorder Rett syndrome (RTT, MIM312750). In females, the manifestation of Rett syndrome includes stereotypic hand wringing, loss of speech, acquired microcephaly, mental retardation with autistic features, seizures, ataxia, and breathing dysrhythmias. The only males manifesting bona fide Rett syndrome when carrying the same mutations are those with an 47, XXY karyotype (reviewed in Moretti and Zoghbi, 2006). Males with normal karyotype and carrying the same mutations causing Rett syndrome in females suffer from severe encephalopathy and infantile death (Zeev et al., 2002). Hypomorphic mutations barely causing any phenotype in females cause mental retardation, tremors and a variety of neuropsychiatric features in males (Meloni et al., 2000; Cohen et al., 2002; Kleefstra et al., 2002).
A number of deletions involving MECP2 have been identified in female patients with Rett syndrome and one male patient with severe neonatal encephalopathy (Schollen et al., 2003; Laccone et al., 2004; Ravn et al., 2005; Hardwick et al., 2007; Scala et al., 2007). Skewed X inactivation (SCI) was identified in some, but not all informative deletion patients. Although some deletions include a part of the downstream flanking gene IRAK1(Laccone et al., 2004; Pan et al., 2006; Hardwick et al., 2007; Scala et al., 2007), most of them are intragenic and involve parts of exon 3 and/or exon 4 of MECP2 (Ravn et al., 2005); none of the larger (>1 kb) deletions has been reported to be recurrent. Interestingly, 12 of 21 nonrecurrent deletions with precisely mapped boundaries have one of their breakpoints in a 150 bp genomic region, which is also a hotspot for the smaller deletions (20–500 bp) confined within exon 4 (reviewed in Hardwick et al., 2007). This phenomenon was named breakpoint grouping (Lee et al., 2006), it has been described for the nonrecurrent duplications at the Pelizeaus-Merzbacher Disease (PMD, MIM312080) locus (Lee et al., 2006) and the nonrecurrent deletions at the Smith-Magenis syndrome (SMS; MIM182290) locus (Stankiewicz et al., 2003). Similar to these loci, the MECP2 locus also occurs in a region with highly complex genomic architecture characterized through multiple LCRs (del Gaudio et al., 2006). The occurrence of breakpoint grouping further highlights the important roles of genomic architecture in the formation of genomic rearrangements.
Recently, a number of genomic duplications involving MECP2 have also been identified in males manifesting a progressive neurodevelopmental syndrome (Van Esch et al., 2005; Meins et al., 2005; del Gaudio et al., 2006; Friez et al., 2006; Bauters et al., 2008; Carvalho et al., submitted). The common phenotypes include severe to profound mental retardation, poor speech development, infantile hypotonia, recurrent infections, epilepsy and progressive spasticity. Tavyev et al. (submitted), for the first time, performed detailed neuropsychiatric examinations including Autism Diagnostic Observation Schedule (ADOS) and Autism Diagnostic Interview-Revised (ADI-R) on nine males carrying MECP2 duplication and demonstrated that autism is a defining feature of the MECP2 duplication syndrome (Tavyev et al., submitted). The neuronal phenotype of MECP2 duplication (MIM 300260) is thus remarkably similar to that of Rett Syndrome caused by loss of function of MECP2 with both disorders displaying mental retardation and autism. Analyses at cellular and molecular levels, however, showed that the detailed pathomechanisms underlying the two syndromes are completely different (Chahrour et al., 2008). Interestingly, neuropsychological examination of the mothers of the duplication patients, who are carriers of the duplication, showed that they display significant psychiatric symptoms including generalized anxiety, depression, and compulsion that preceded the birth of their children, although a nearly 100% skewing of X inactivation favoring the duplication chromosome was observed (Tavyev et al., submitted). However, the skewed X inactivation was measured in the blood and does not necessarily reflect what is occurring in the brain.
No recurrent MECP2 duplication has been observed yet, the boundaries of the duplications are scattered around a 3 Mb region flanking MECP2. High-resolution array CGH and subsequent PCR experiments revealed that some of the duplications have complex structure (Bauters et al., 2008; Carvalho et al., submitted) and may have arisen via the FoSTeS mechanism. FoSTeS has also been observed for rearrangements at the PMD locus, similarly localized in a highly complex genomic region (Lee et al., 2007). Surprisingly, the duplications in the MECP2 region always encompass much larger genomic fragments than the deletions. The smallest duplication in the MECP2 region identified to date is 154 kb (Carvalho et al., submitted) and involves at least the complete open reading frames of IRAK1 and MECP2. This smallest duplication already exceeds the largest MECP2 region deletions reported to date (Ravn et al., 2005; Hardwick et al., 2007).
The difference in the size of MECP2 region duplications and deletions is curious. It might be due to the different mechanisms underlying the origin of these rearrangements and reflects a true difference in the size of the deletions and duplication in this region; alternatively, it can be related to phenotypic selection, as has been postulated for the PMD associated PLP1(proteolipid protein 1) duplications (Inoue et al., 2002). The rearrangements not observed may have an early lethal or extremely mild phenotype, keeping the carriers from being medically ascertained. Skewed-X inactivation, which has been observed in all duplication carriers and some deletion patients, may also play a role in the phenotypic manifestation of these rearrangements.
ADLD and LMNB1 duplication
After Charcot-Marie-Tooth disease type 1A (CMT1A, MIM118220) (Lupski et al., 1991; Raeymaekers et al., 1991) and Pelizeaus-Merzbacher Disease (PMD, MIM312080) (Inoue, 2005), the adult-onset autosomal dominant leukodystrophy (ADLD, MIM169500) became the third member in the group of demyelinating disorders that can be caused by abnormal gene dosage (Padiath et al., 2006). ADLD is a very rare (only five families have been reported worldwide), slowly progressive and fatal condition which manifests in the fourth to sixth decade of life (Schwankhaus et al., 1994). The characteristic early autonomic abnormalities include bowel/bladder dysfunction, impotence in male patients, decreased sweating and others. Later in the course of their disease, ADLD patients develop ataxia and signs of pyramidal tract involvement (Brown et al., 1987). Symmetrical demyelination in the CNS can be found in MRI scans and neuropathological examination (Schwankhaus et al., 1988). If not for the symmetry, such lesions could be mistaken for those observed in multiple sclerosis.
Via linkage analysis, Fu and colleagues identified a locus on 5q31 which is linked to ADLD in a five-generation kindred (Coffeen et al., 2000). In another segregating kindred, ADLD was reported as being linked to 5q23 (Marklund et al., 2006). Later, Fu and colleagues detected a tandem duplication of a 170 kb genomic fragment at 5q31 that segregates with the disease in the ADLD family (Padiath et al., 2006). The duplication does not disrupt any known gene and encompasses the complete ORF of two genes, of them only LMNB1 (encoding Lamin B1) is expressed in the brain. Indeed, increased expression of LMNB1,at both RNA and protein levels, was found in brain tissue of the patients bearing the duplication. Also, overexpressing human LMNB1 or its Drosophila orthologue resulted in neurodegenerative phenotypes in flies. It is thus highly likely that the duplication of LMNB1 is causative for ADLD.
Lamin B1 is a member of the intermediate filament family of proteins and is part of the nuclear lamina underlying the nuclear envelope of eukaryotic cells (Worman and Bonne, 2007). Mutations in its close paralogue, LMNA/C, have been found to be causative for more than ten different disorders. Interestingly, some of the conditions result from haploinsufficiency of LMNA (Worman and Bonne, 2007). Time-lapse videomicroscopy showed that the protein Lamin B1 serves a fundamental role within the nuclear envelope by anchoring the nucleus to the cytoskeleton (Ji et al., 2007). Silencing of LMNB1 via si RNA causes cell death in human cells and C. elegans (Liu et al., 2000; Harborth et al., 2001). Homozygous Lmnb1 knockout mice die at birth, with defects in lung and bone (Vergnes et al., 2004). Intriguingly, the heterozygous knockout mice were reported to be fertile and ‘phenotypically indistinguishable from wildtype mice’ (Vergnes et al., 2004). It is not known how the increased dosage of Lamin B1 causes the phenotypes in ADLD.
Duplication in the 17p13 lissencephaly and Miller-Dieker syndrome region
Heterozygote deletions in the chromosomal region 17p13 have long been known to cause two distinct disorders with partially overlapping phenotypes: isolated lissencephaly sequence (ILS, MIM607432) and Miller-Dieker syndrome (MDS, MIM 247200). Analyses of the phenotype-genotype associations of these two disorders (Cardoso et al., 2003) have highlighted the importance of two genes for the phenotypic manifestation of the deletion: PAFAH1B1/LIS1 and YWHAE(coding for the protein 14-3-3ε). The deletions involving only PAFAH1B1/LIS1 (Reiner et al., 1993) but not YWHAE cause ILS, characterized by lissencephaly and minimal or no other dysmorphic features (Cardoso et al., 2003). ILS can be also caused by point mutations in the PAFAH1B1/LIS1 gene (Uyanik et al., 2007). When the deletion encompasses both PAFAH1B1/LIS1 and YWHAE, MDS is manifested, with lissencephaly plus significant facial dysmorphism, as well as occasionally other congenital anomalies (Dobyns et al., 1991). In addition, the lissencephaly phenotype in MDS is in general more severe than in ILS (Dobyns et al., 1991; Cardoso et al., 2003). Interestingly, two patients carrying deletions of only YWHAE but not PAFAH1B1/LIS1 have also been reported, who have no lissencephaly, but only mental retardation (Cardoso et al., 2003).
PAFAH1B1/LIS1 encodes a 45-kDa noncatalytic subunit of platelet-activating factor acetylhydrolase 1B (Hattori et al., 1994). YWHAE, or 14-3-3 ε, encodes the ε isoform of the 14-3-3 proteins, which bind to phosphoserine/phosphothreonine motifs in numerous binding partners and play important roles in a multitude of cellular functions (Kjarland et al., 2006). PAFAH1B1/LIS1 and YWHAE/14-3-3ε function in the same physiological pathway affecting neuronal migration (Toyo-oka et al., 2003; Wynshaw-Boris, 2007). Reduced dosage of either Pafah1b1/Lis1 or Ywhae/14-3-3ε in mouse results in neuronal migration defects and development impairments. The phenotypes are more severe in double heterozygous knockout mice (Pafah1b1 +/–, Ywhae +/–) when the dosage of both genes is decreased (Toyo-oka et al., 2003), which is consistent with the human data.
Interestingly, Mei et al. also reported a heterozygous duplication in PAFAH1B1/LIS1 in a patient with lissencephaly grade 2 (Mei et al., 2008). This duplication only includes exon 3–5 of PAFAH1B1/LIS1 and likely disrupts its ORF and reduces the effective gene dosage. The pathological effect of this duplication would be thus probably similar to that of the loss-of-function deletions in PAFAH1B1/LIS1. Duplications truly increasing gene dosages were only detected very recently. Applying array CGH techniques, Bi et al. (2009) detected heterozygote duplication of either PAFAH1B1/LIS1 or YWHAE/14-3-3 εalone, as well as a large duplication involving both YWHAE/14-3-3 εand PAFAH1B1/LIS1 in seven unrelated cases. RT-PCR confirmed the enhanced expression level of these two genes at increased gene dosage. Increased PAFAH1B1/LIS1 dosage causes failure to thrive, moderate to severe developmental delay, small brain and mild brain structural abnormalities. Duplication of YWHAE/14-3-3 εis associated with dysmorphic faces, macrosomia and developmental delay. None of the duplication patients showed a lissencephaly phenotype. Bi et al. also showed that transgenic mice overexpressing Pafah1b1/Lis1 mimicked part of the phenotypes observed in human, confirming the dosage-sensitivity of PAFAH1B1/LIS1 at both decreased and increased expression.
The size and localization of most ILS and MDS deletions were analyzed via FISH (Cardoso et al., 2003; Chabchoub et al., 2006; Mei et al., 2008) and sequence-tag mapping (Cardoso et al., 2003), only one case was analyzed with BAC-array (Chabchoub et al., 2006). These methods are not optimal for detecting complex rearrangements or to map the exact boundaries of the rearrangements. It is thus not possible to conclude if the cases with similar deletions (Cardoso et al., 2003) indeed represent recurrent rearrangements. Mei et al. obtained the breakpoints of deletions in four lissencephaly patients via long-range PCR (Mei et al., 2008). All obtained breakpoints are localized in or adjacent to Alu sequence. Sequence analysis showed that two of the deletions probably occurred via Alu-mediated NAHR, with one of them found in two patients, as has been previously shown for nonrecurrent SMS deletions in 17p11.2 (Shaw and Lupski, 2005). It is not clear from the reported data if it is a true recurrent deletion or if these two patients inherited this deletion from a common ancestor. Another deletion apparently resulted from an NHEJ mechanism, with no homology found at the junction but a piece of ‘filler-DNA’ of 24 bp (Mei et al., 2008). The high-density oligo array with 2–3 oligos interrogating each kb DNA as well as subsequent PCR reactions enabled Bi et al. to observe several complex rearrangements with the duplications interrupted by triplication or normal copy number. These complex rearrangements are likely the result of the recently proposed replication Fork Stalling and Template Switching (FoSTeS) mechanism (Lee et al., 2007). Breakpoint analyses are being performed for further information regarding the mechanisms underlying these rearrangements.
17q21.31 microdeletion syndrome
A new genomic disorder affecting the nervous system, the microdeletion syndrome 17q21.31 (MIM 610443) was identified by three groups simultaneously in 2006 (Koolen et al., 2006; Sharp et al., 2006; Shaw-Smith et al., 2006), after several single cases with similar deletions had been reported earlier (Park et al., 1992; Khalifa et al., 1993; Varela et al., 2006). In a recent multi-center clinical and molecular study on 22 patients, Koolen et al. (2008) estimated the prevalence of the 17q21.31 microdeletion syndrome to be 1/16,000. Common phenotypes observed in almost all affected individuals include developmental delay, hypotonia, a friendly/amiable behavior and mild but characteristic dysmorphic facial features (Koolen et al., 2006, 2008; Sharp et al., 2006; Shaw-Smith et al., 2006). The deletions are of 500–650 kb in size; the minimal critical region was refined to a 424 kb genomic segment (Sharp et al., 2006; Koolen et al., 2008) and encompasses six genes including CRHR1 (coding for corticotrophin releasing hormone receptor 1) and MAPT (coding for microtubule-associated protein tau), both highly expressed in brain (Sharp et al., 2006). Gain-of-function mutations in MAPT are known to cause autosomal dominant forms of frontal temporal dementia with parkinsonism (MIM 600274) (Hutton et al., 1998; D’Souza et al., 1999; Rademakers et al., 2004); common genetic variation of MAPT is associated with progressive supranuclear palsy (PSNP, MIM 601104) (Pittman et al., 2005) and Alzheimer’s disease (Myers et al., 2005). Haploinsufficiency of MAPT has never been reported in humans before (Koolen et al., 2006). Interestingly, in mice lacking the Tau/Mapt gene (Harada et al., 1994), muscle weakness, hyperactivity in a novel environment and impairment of contextual fear conditioning were found (Ikegami et al., 2000), in parallel to the motor developmental defect and learning disability observed in del(17)(q21.31) patients (Koolen et al., 2006; Shaw-Smith et al., 2006). These data render a possible role of MAPT in the pathomechanisms underlying this new genomic disorder microdeletion 17q21.31.
The deletions in 17q21.31 always occur de novo; their breakpoints were mapped in large flanking LCR blocks of highly complex structure (Koolen et al., 2006; Sharp et al., 2006; Shaw-Smith et al., 2006). Two major haplotypes named H1 and H2 exist for this region (Stefansson et al., 2005). The H2 lineage harbors a 900 kb inversion covering the 17q21.31 deletion region and is under positive selection in European populations, where it has a frequency of 20%. One of the profound differences between these two haplotypes is that the inversion places the two LCRs directly mediating the 17q21.31 deletions in the same orientation and thus facilitates/enables the NAHR rearrangements (Stankiewicz and Lupski, 2002) which delete the flanked interval (Lupski, 2006; Sharp et al., 2006). Indeed, haploype analyses in all three reports found that the transmitting parent of the deletion patient obligatorily carries either H1/H2 heterozygote or H2/H2 homozygote genotype. No transmitting parent without the H2 haplotype has ever been identified.
Interestingly, a duplication of the critical region deleted in the 17q21.31 deletion syndrome has also been identified in a girl with severe psychomotor developmental delay and dysmorphic craniofacial features (Kirchhoff et al., 2007b). Like the transmitting parents of the deletion patients, the transmitting father of this girl also carries the H2 haplotype (H1/H2 heterozygote) (Kirchhoff et al., 2007b). It would be very interesting to compare the phenotype of the duplication patients with the patients carrying isolated gain-of-function mutations in MAPT in detail, to evaluate the contribution of MAPT to the duplication phenotypes. However, this comparison would be much more informative after more duplication patients are described and more precise knowledge can be gained about the phenotypic manifestation of the 17q21.31 duplication.
15q13.3 microdeletion syndrome
Using their BAC array specifically designed to interrogate genome-wide 130 NAHR candidate sites flanked by LCRs (Sharp et al., 2005, 2006), Sharp et al. (2008) identified another new genomic disorder, the microdeletion syndrome 15q13.3 (MIM612001), which is associated with mild to moderate mental retardation, seizure and/or abnormal EEG findings, mild and variable facial and digital dysmorphism. The recurrent deletion is 1.5 Mb long and encompasses six genes. The proximal and distal breakpoints of the deletions are localized in two large LCR/SD blocks designated BP4 and BP5, respectively (Sharp et al., 2008). These data, and the apparent recurrent nature of the deletions both suggest that these deletions in 15q13.3 are caused by NAHR mediated by BP4 and BP5.
Although the genomic localization of the deleted region is adjacent to the common deletion region in Prader-Willi syndrome (PWS, MIM176270) and Angelman syndrome (AS, MIM105830), none of the six included genes is known to be imprinted (Sharp et al., 2008). One of these genes, CHRNA7 (cholinergic receptor, neuronal nicotinic, alpha polypeptide 7), encodes the alpha7 subunit of an ion channel, nicotinic acetylcholine receptor, that mediates neuronal signal transmission (Berg and Conroy, 2002). CHRNA7 has been an interesting candidate gene for genetic epilepsies for many years; it is the paralogue of CHRNA4, the first gene ever found to be mutated in an idiopathic epilepsy syndrome, autosomal dominant nocturnal frontal lobe epilepsy (ADNFLE, MIM600513) (Steinlein et al., 1995). Mutations in the gene coding for another subunit of the nicotinic acetylcholine receptors, CHRNB2 (De Fusco et al., 2000; Phillips et al., 2001) and another paralogue of CHRNA7, CHRNA2 (Aridon et al., 2006), were also found to cause epilepsy syndromes. Linkage studies pinpointed CHRNA7 as a possible susceptibility factor for both juvenile myoclonic epilepsy type 2 (EJM2, MIM604827) (Elmslie et al., 1997) and benign epilepsy of childhood with centrotemporal spikes (MIM 117100) (Neubauer et al., 1997). Furthermore, knockout mice lacking Chrna7 show a hypersynchronous neocortical EEG phenotype (Orr-Urtreger et al., 1997). However, pathological point mutations of CHRNA7 have not been found in any epilepsy syndromes (Taske et al., 2002; Ortrud Steinlein, personal communication). The finding of the microdeletion 15q13.3 syndrome now sheds new light onto CHRNA7 and its possible role in epileptogenesis. Interestingly, a partial pseudogene of CHRNA7, CHRFAM7A is located less than 2 Mb away (Steinlein and Bertrand, 2008), not only adding difficulties to mutational analyses of CHRNA7, but also rendering the genomic architecture of this locus even more complex, as these two genes CHRNA7 and CHRFAM7A can function as substrates for NAHR themselves.
Theoretically, the occurrence of the reciprocal duplication can be mostly expected for the region undergoing an NAHR-mediated recurrent deletion (Stankiewicz and Lupski, 2002). Indeed, Sharp et al. identified an apparent reciprocal duplication of the 1.5 Mb deletion in a healthy control individual, indicating that the increased copy number of the critical genes in this region may not affect the neuronal system in a way as dramatic as the decreased copy number (Sharp et al., 2008). Interestingly, the LCR/SDs in the two NAHR-mediating LCR blocks BP4 and BP5 have inversed orientation in the draft sequence of the UCSC sequence browser (www.genome.ucsc.edu). According to the mechanisms of NAHR (Lupski, 1998; Stankiewicz and Lupski, 2002), there is probably also an inverted haplotype in this region, which would place BP4 and BP5 into the same orientation and facilitate the NAHR rearrangements. Indeed, this inversion was found in the transmitting progenitors of both 15q13.3 microdeletion carrying patients whose parent of origin could be determined. Further investigations are still needed for a statistically relevant conclusion because of the high frequency of this inversion in the normal population.
Conclusions
Since the 1998 review that coined and defined the term genomic disorders (Lupski, 1998), a multitude of genomic disorders caused by genomic rearrangements instead of the traditional Watson-Crick base pair changes have been identified. Many of these disorders are due to altered gene dosage, or copy number variation (CNV), of one or more dosage-sensitive genes inside the rearranged region. The recent development of new techniques such as array-CGH to observe the rearrangements and their breakpoints with a higher resolution have greatly facilitated and catalyzed the identification and characterization of CNVs and novel genomic disorders; they have also yielded insights into molecular mechanisms for the rearrangements as well as the biological nature of the dosage-sensitivity of the rearranged regions which result in diseases (reviewed by Lupski and Stankiewicz, 2005; Emanuel and Saitta, 2007; Stankiewicz and Beaudet, 2007; Gu et al., 2008).
In this review, we summarized some of the most recently defined genomic disorders affecting the nervous system. These disorders can be neurodevelopmental (as WBS and the reciprocal duplication), neurodegenerative (as ADLD), or neuropsychiatric diseases (as autism; Sebat et al., 2007; Kumar et al., 2008; Weiss et al., 2008; and schizophrenia: Lupski, 2008; Stefansson et al., 2008; The International Schizophrenia Consortium, 2008; Walsh et al., 2008; Xu et al., 2008) which are described elsewhere in this issue). From the rearrangement point of view, CNV-based genomic disorders could be due to duplication, deletion or sometimes complex rearrangements including both events in different genomic segments (Lee et al., 2007). The mechanisms underlying these pathological rearrangements, as for any other genomic rearrangements, can be NAHR, NHEJ or FoSTeS (Lupski and Stankiewicz, 2005; Lee et al., 2007; Gu et al., 2008). Not many publications have reported NHEJ or FoSTeS as the presumptive mechanism underlying a pathological rearrangement, which is partially because the characteristics of NHEJ and FoSTeS are not resolved by many traditional techniques and are not yet as well known among the geneticists as NAHR and are often not recognized, as in the case of the atypical nonrecurrent deletions in WBS described in the first part of this review. High resolution genome analysis tools and direct sequencing of breakpoints are often required to reveal complex rearrangements. We expect to see more NHEJ- or FoSTeS-mediated pathological rearrangements reported in the future, especially in the analyses of the ‘atypical’ or nonrecurrent rearrangements.
It should be emphasized that the identification of a new genomic disorder is by no means accomplished after the original finding of overlapping genomic rearrangements and common phenotypes at a certain genomic locus. The reciprocal duplications to the Smith-Magenis Syndrome deletions were already described by Potocki et al. (2000) as a new genomic disorder identified in seven individuals in 2000, but it was not until 2007 that the same group and their collaborators have recruited more patients and performed comprehensive, multidisciplinary clinical and molecular analyses on 35 patients bearing this duplication, thus giving the first comprehensive clinical description of this disorder, now named the Potocki-Lupski syndrome (PTLS, MIM610883) (Potocki et al., 2007). The clinical data refined the key phenotypes of PTLS, which include infantile hypotonia, failure to thrive, mild to moderate mental retardation and autism spectrum disorders. It was not until this comprehensive clinical study with 35 patients that the autistic features and structural cardiovascular anomalies were also identified to be among the key manifestations of PTLS. The number of patients and the state-of-art array CGH technique also enabled the precise mapping of the rearrangements and further delineations of the phenotypes, which narrowed down the critical region of PTLS to a 1.3 Mb region containing RAI1, the critical dosage-sensitive gene for SMS (Slager et al., 2003; Bi et al., 2004, 2006; Girirajan et al., 2005). We look forward to similar developments in the research of all newly defined genomic disorders, to the new clinical and biological insight that the future studies of these disorders will bring us.
For many genomic disorders with distinct abnormalities, the reciprocal rearrangements have partially milder phenotypes (such as the facial dysmorphology in the reciprocal duplication of the WBS) or a reduced penetrance (as in the reciprocal duplication of the WBS deletion and the microdeletion 15q13.3 syndrome), implicating that dosage sensitivity does not always apply for both increased or decreased dosage of the same gene. In some other cases, the reciprocal rearrangements have partially opposite or reciprocal endophenotype (as the difference in the speech vs. visuospatial development between WBS and dup(7)(q11.23) syndrome), indicating a quantitative read out system downstream of a critical gene. There are also instances when duplications and deletions involving the same gene manifest similar phenotypes, although through completely different molecular pathways, as in deletions and duplications of MECP2. Whether and how too much or too little gene product will be recognized and/or tolerated by the organism is not a trivial fact, it reflects the function and regulation of an individual gene and the pathways in which it is involved. Careful analysis and comparison of the phenotypes of reciprocal genomic disorders/conditions would thus likely provide interesting insight into the biology of the genes underlying their pathomechanisms.
References
Work in the Lupski laboratory has been sponsored by the National Institutes of Health, the March of Dimes and the Charcot-Marie-Tooth Association. W.G. is a Feodor-Lynen Research Fellow generously supported by the Alexander-von-Humboldt Stiftung.