SNP-Based Analysis of Genetic Substructure in the German PopulationSteffens M.a · Lamina C.b · Illig T.b · Bettecken T.d · Vogler R.e · Entz P.f · Suk E.-K.f · Toliat M.R.f · Klopp N.b · Caliebe A.g · König I.R.h · Köhler K.i · Lüdemann J.j · Diaz Lacava A.a · Fimmers R.a · Lichtner P.d · Ziegler A.h · Wolf A.g · Krawczak M.g · Nürnberg P.f · Hampe J.e · Schreiber S.e · Meitinger T.c, d · Wichmann H.-E.b · Roeder K.k · Wienker T.F.a · Baur M.P.a
aInstitute of Medical Biometry, Informatics and Epidemiology, Rheinische Friedrich-Wilhelms-University, Bonn, bInstitute of Epidemiology, GSF National Research Center, Munich-Neuherberg, cInstitute of Human Genetics, Technical University Munich, Munich, dInstitute of Human Genetics, GSF National Research Center, Munich-Neuherberg, eDepartment of General Internal Medicine, Christian-Albrechts-University, Kiel, fGene Mapping Center at the Max-Delbrück-Center, Berlin, gInstitute of Medical Statistics and Informatics, Christian-Albrechts-University, Kiel, hInstitute of Medical Biometry and Statistics, University of Lübeck, Lübeck, iDepartment of Genetic Epidemiology, University of Göttingen, Göttingen, jInstitute of Clinical Chemistry and Laboratory Medicine, University of Greifswald, Greifswald, Germany; kDepartment of Statistics, Carnegie Mellon University, Pittsburgh, Pa., USA
Do you have an account?
- Rent for 48h to view
- Buy Cloud Access for unlimited viewing via different devices
- Synchronizing in the ReadCube Cloud
- Printing and saving restrictions apply
Rental: USD 8.50
Cloud: USD 20.00
Article / Publication Details
Objective: To evaluate the relevance and necessity to account for the effects of population substructure on association studies under a case-control design in central Europe, we analysed three samples drawn from different geographic areas of Germany. Two of the three samples, POPGEN (n = 720) and SHIP (n = 709), are from north and north-east Germany, respectively, and one sample, KORA (n = 730), is from southern Germany. Methods: Population genetic differentiation was measured by classical F-statistics for different marker sets, either consisting of genome-wide selected coding SNPs located in functional genes, or consisting of selectively neutral SNPs from ‘genomic deserts’. Quantitative estimates of the degree of stratification were performed comparing the genomic control approach [Devlin B, Roeder K: Biometrics 1999;55:997–1004], structured association [Pritchard JK, Stephens M, Donnelly P: Genetics 2000;155:945–959] and sophisticated methods like random forests [Breiman L: Machine Learning 2001;45:5–32]. Results: F-statistics showed that there exists a low genetic differentiation between the samples along a north-south gradient within Germany (FST(KORA/POPGEN): 1.7 · 10–4; FST(KORA/SHIP): 5.4 · 10–4; FST(POPGEN/SHIP): –1.3 · 10–5). Conclusion: Although the FST -values are very small, indicating a minor degree of population structure, and are too low to be detectable from methods without using prior information of subpopulation membership, such as STRUCTURE [Pritchard JK, Stephens M, Donnelly P: Genetics 2000;155:945–959], they may be a possible source for confounding due to population stratification.
© 2006 S. Karger AG, Basel
- Deng HW: Population admixture may appear to mask, change or reverse genetic effects of genes underlying complex traits. Genetics 2001;159:1319–1323.
- Shriver MD, Kennedy GC, Parra EJ, Lawson HA, Sonpar V, Huang J, Akey JM, Jones KW: The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs. Hum Genomics 2004;1:274–286.
- Devlin B, Roeder K: Genomic control for association studies. Biometrics 1999;55:997–1004.
- Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics 2000;155:945–959.
Breiman L: Random Forests. Machine Learning 2001;45:5–32.
- Wahlund S: Zusammensetzung von Populationen und Korrelationserscheinungen vom Standpunkt der Vererbungslehre aus betrachtet. Hereditas 1928;11:65–106.
Jorgenson E, Liu X, Witte JS: Case-control analyses: Geneopardy! Genet Epidemiol 2005;29(suppl 1):S86–S90.
- Wacholder S, Rothman N, Caporaso N: Counterpoint: bias from population stratification is not a major threat to the validity of conclusions from epidemiological studies of common polymorphisms and cancer. Cancer Epidemiol Biomarkers Prev 2002;11:513–520.
- Wacholder S, Rothman N, Caporaso N: Population stratification in epidemiologic studies of common genetic variants and cancer: quantification of bias. J Natl Cancer Inst 2000;92:1151–1158.
- Thomas DC, Witte JS: Point: population stratification: A problem for case-control studies of candidate-gene associations? Cancer Epidemiol Biomarkers Prev 2002;11:505–512.
- Marchini J, Cardon LR, Phillips MS, Donnelly P: The effects of human population structure on large genetic association studies. Nat Genet 2004;36:512–517.
- Devlin B, Bacanu SA, Roeder K: Genomic Control to the extreme. Nat Genet 2004;36:1129–1130.
- Devlin B, Roeder K, Bacanu SA: Unbiased methods for population-based association studies. Genet Epidemiol 2001;21:273–284.
- Bacanu SA, Devlin B, Roeder K: The power of genomic control. Am J Hum Genet 2000;66:1933–1944.
- Helgason A, Yngvadottir B, Hrafnkelsson B, Gulcher J, Stefansson K: An Icelandic example of the impact of population structure on association studies. Nat Genet 2005;37:90–95.
- Campbell CD, Ogburn EL, Lunetta KL, Lyon HN, Freedman ML, Groop LC, Altshuler D, Ardlie KG, Hirschhorn JN: Demonstrating stratification in a European American population. Nat Genet 2005;37:868–872.
- Pritchard JK: Deconstructing maize population structure. Nat Genet 2001;28:203–204.
- Romualdi C, Balding D, Nasidze IS, Risch G, Robichaux M, Sherry ST, Stoneking M, Batzer MA, Barbujani G: Patterns of human diversity, within and among continents, inferred from biallelic DNA polymorphisms. Genome Res 2002;12:602–612.
- Pritchard JK, Rosenberg NA: Use of unlinked genetic markers to detect population stratification in association studies. Am J Hum Genet 1999;65:220–228.
- Bamshad MJ, Wooding S, Watkins WS, Ostler CT, Batzer MA, Jorde LB: Human population genetic structure and inference of group membership. Am J Hum Genet 2003;72:578–589.
- Hao K, Li C, Rosenow C, Wong WH: Detect and adjust for population stratification in population-based association study using genomic control markers: an application of Affymetrix Genechip Human Mapping 10K array. Eur J Hum Genet 2004;12:1001–1006.
- Yang BZ, Zhao H, Kranzler HR, Gelernter J: Practical population group assignment with selected informative markers: characteristics and properties of Bayesian clustering via STRUCTURE. Genet Epidemiol 2005;28:302–312.
- Maraganore DM, de AM, Lesnick TG, Strain KJ, Farrer MJ, Rocca WA, Pant PV, Frazer KA, Cox DR, Ballinger DG: High-resolution whole-genome association study of Parkinson disease. Am J Hum Genet 2005;77:685–693.
Rosenberg NA, Mahajan S, Ramachandran S, Zhao C, Pritchard JK, Feldman MW: Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure. PLoS Genet 2005;1:e70.
- Holle R, Happich M, Lowel H, Wichmann HE, OKSG: KORA – a research platform for population based health research. Gesundheitswesen 2005;67:19–25.
- Wichmann HE, Gieger C, Illig T, otKSG: KORA-gen – Resource for population genetics, controls and a broad spectrum of disease phenotypes. KORA-gen – Ressource für Bevölkerungsgenetik, Kontrolle und ein breites Spektrum an Krankheitsphänotypen. Gesundheitswesen 2005;67:26–30.
- Krawczak M, Nikolaus S, von Eberstein H, Croucher PJ, El Mokhtari NE, Schreiber S: PopGen: Population-based recruitment of patients and controls for the analysis of complex genotype-phenotype relationships. Community Genet 2006;9:55-61.
John U, Greiner B, Hensel E, Ludemann J, Piek M, Sauer S, Adam C, Born G, Alte D, Greiser E, Haertel U, Hense HW, Haerting J, Willich S, Kessler C: Study of Health In Pomerania (SHIP): A health examination survey in an east German region: Objectives and design. Soz Präventivmed 2001;46:186–194.
- Luedemann J, Schminke U, Berger K, Piek M, Willich SN, Doring A, John U, Kessler C: Association between behavior-dependent cardiovascular risk factors and asymptomatic carotid atherosclerosis in a general population. Stroke 2002;33:2929–2935.
- Abecasis GR, Cherny SS, Cookson WO, Cardon LR: GRR: Graphical representation of relationship errors. Bioinformatics 2001;17:742–743.
- Wright S: Coefficients of inbreeding and relationship. Am Nat 1922;56:330–338.
- Wright S: Systems of mating. I. The biometric relations between parent and offspring. Genetics 1921;6:111–178.
- Wright S: The genetical structure of populations. Annals of Eugenics 1951;15:323–354.
- Goudet J: Fstat version 1.2: A computer program to calculate F-statistics. J Hered 1995;86:485–486.
- Liu K, Muse SV: PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 2005;21:2128–2129.
Weir BS: Genetic Data Analysis II. Sunderland, Sinauer Associates, 1996.
- Clopper CJ, Pearson E: The use of confidence or fiducial limits illustrated in the case of binomial. Biometrica 1934;26:404–413.
- Barbujani G, Magagni A, Minch E, Cavalli-Sforza LL: An apportionment of human DNA diversity. Proc Natl Acad Sci USA 1997;94:4516–4519.
- Jorde LB, Watkins WS, Bamshad MJ, Dixon ME, Ricker CE, Seielstad MT, Batzer MA: The distribution of human genetic diversity: A comparison of mitochondrial, autosomal, and Y-chromosome data. Am J Hum Genet 2000;66:979–988.
- Edwards AW: Human genetic diversity: Lewontin’s fallacy. Bioessays 2003;25:798–801.
Cavalli-Sforza LL, Menozzi P, Piazza A: The History and Geography of Human Genes. Princeton University Press, 1994.
- Morton NE: The future of genetic epidemiology. Ann Med 1992;24:557–562.
- Chakraborty R, Jin L: A unified approach to study hypervariable polymorphisms: Statistical considerations of determining relatedness and population distances. EXS 1993;67:153–175.
- Köhler K, Bickeböller H: Case-Control Association Tests Correcting for Population Stratification. Ann Hum Genet 2006;70:98–115.
Voight BF, Pritchard JK: Confounding from cryptic relatedness in case-control association studies. PLoS Genet 2005;1:e32.
- Freedman ML, Reich D, Penney KL, McDonald GJ, Mignault AA, Patterson N, Gabriel SB, Topol EJ, Smoller JW, Pato CN, Pato MT, Petryshen TL, Kolonel LN, Lander ES, Sklar P, Henderson B, Hirschhorn: Assessing the impact of population stratification on genetic association studies. Nat Genet 2004;36:388–393.
Article / Publication Details
Copyright / Drug Dosage / DisclaimerCopyright: All rights reserved. No part of this publication may be translated into other languages, reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying, recording, microcopying, or by any information storage and retrieval system, without permission in writing from the publisher.
Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug.
Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements.