Power and Sample Size Calculations for Genetic Case/Control Studies Using Gene-Centric SNP Maps: Application to Human Chromosomes 6, 21, and 22 in Three Populationsde La Vega F.M. · Gordon D. · Su X. · Scafe C. · Isaac H. · Gilbert D.A. · Spier E.G.
aApplied Biosystems, Foster City, Calif., and bLaboratory of Statistical Genetics, Rockefeller University, New York, N.Y., USA
Power and sample size calculations are critical parts of any research design for genetic association. We present a method that utilizes haplotype frequency information and average marker-marker linkage disequilibrium on SNPs typed in and around all genes on a chromosome. The test statistic used is the classic likelihood ratio test applied to haplotypes in case/control populations. Haplotype frequencies are computed through specification of genetic model parameters. Power is determined by computation of the test’s non-centrality parameter. Power per gene is computed as a weighted average of the power assuming each haplotype is associated with the trait. We apply our method to genotype data from dense SNP maps across three entire chromosomes (6, 21, and 22) for three different human populations (African-American, Caucasian, Chinese), three different models of disease (additive, dominant, and multiplicative) and two trait allele frequencies (rare, common). We perform a regression analysis using these factors, average marker-marker disequilibrium, and the haplotype diversity across the gene region to determine which factors most significantly affect average power for a gene in our data. Also, as a ‘proof of principle’ calculation, we perform power and sample size calculations for all genes within 100 kb of the PSORS1 locus (chromosome 6) for a previously published association study of psoriasis. Results of our regression analysis indicate that four highly significant factors that determine average power to detect association are: disease model, average marker-marker disequilibrium, haplotype diversity, and the trait allele frequency. These findings may have important implications for the design of well-powered candidate gene association studies. Our power and sample size calculations for the PSORS1 gene appear consistent with published findings, namely that there is substantial power (>0.99) for most genes within 100 kb of the PSORS1 locus at the 0.01 significance level.
Francisco M. De La Vega, PhD
850 Lincoln Centre Dr.
Foster City, CA 94404 (USA)
Tel. +1 650 638 6989, Fax +1 650 554 2577, E-Mail email@example.com
Received: November 29, 2004
Accepted after revision: July 12, 2005
Published online: September 2, 2005
Number of Print Pages : 18
Number of Figures : 6, Number of Tables : 4, Number of References : 76
Human Heredity (International Journal of Human and Medical Genetics)
Vol. 60, No. 1, Year 2005 (Cover Date: 2005)
Journal Editor: Devoto, M. (Wilmington, Del.)
ISSN: 0001–5652 (print), 1423–0062 (Online)
For additional information: http://www.karger.com/hhe