Development of bioinformatics resources for display and analysis of copy number and other structural variants in the human genomeZhang J.a · Feuk L.a · Duggan G.E.a · Khaja R.a · Scherer S.W.a, b
aThe Centre for Applied Genomics and the Program in Genetics and Genomic Biology, The Hospital for Sick Children and bDepartment of Molecular and Medical Genetics, University of Toronto, Toronto, Ontario (Canada)
The discovery of an abundance of copy number variants (CNVs; gains and losses of DNA sequences >1 kb) and other structural variants in the human genome is influencing the way research and diagnostic analyses are being designed and interpreted. As such, comprehensive databases with the most relevant information will be critical to fully understand the results and have impact in a diverse range of disciplines ranging from molecular biology to clinical genetics. Here, we describe the development of bioinformatics resources to facilitate these studies. The Database of Genomic Variants (http://projects.tcag.ca/variation/) is a comprehensive catalogue of structural variation in the human genome. The database currently contains 1,267 regions reported to contain copy number variation or inversions in apparently healthy human cases. We describe the current contents of the database and how it can serve as a resource for interpretation of array comparative genomic hybridization (array CGH) and other DNA copy imbalance data. We also present the structure of the database, which was built using a new data modeling methodology termed Cross-Referenced Tables (XRT). This is a generic and easy-to-use platform, which is strong in handling textual data and complex relationships. Web-based presentation tools have been built allowing publication of XRT data to the web immediately along with rapid sharing of files with other databases and genome browsers. We also describe a novel tool named eFISH (electronic fluorescence in situ hybridization) (http://projects.tcag.ca/efish/), a BLAST-based program that was developed to facilitate the choice of appropriate clones for FISH and CGH experiments, as well as interpretation of results in which genomic DNA probes are used in hybridization-based experiments.
Request reprints from Stephen W. Scherer
The Centre for Applied Genomics
Program in Genetics and Genomic Biology
The Hospital for Sick Children
MaRS Centre – East Tower, 101 College Street, Room 14-701
Toronto, Ontario, M5G 1L7 (Canada)
telephone: +1-416-813-7613; fax: +1-416-813-8319
J.Z. and L.F. contributed equally to this work.
Supported by Genome Canada, the McLaughlin Centre for Molecular Medicine, and the Hospital for Sick Children Foundation. L.F. is supported by the Swedish Medical Research Council. S.W.S. is an investigator of the Canadian Institutes of Health Research and an International Scholar of the Howard Hughes Medical Institute.
Manuscript received: 23 March 2006
Accepted in revised form for publication by A. Geurts van Kessel,: 15 May 2006.
Number of Print Pages : 10
Number of Figures : 6, Number of Tables : 1, Number of References : 26
Cytogenetic and Genome Research
Vol. 115, No. 3-4, Year 2006 (Cover Date: November 2006)
Journal Editor: Schmid, M. (Würzburg)
ISSN: 1424–8581 (print), 1424–859X (Online)
For additional information: http://www.karger.com/CGR