Login to MyKarger

New to MyKarger? Click here to sign up.

Login with Facebook

Forgot your password?

Authors, Editors, Reviewers

For Manuscript Submission, Check or Review Login please go to Submission Websites List.

Submission Websites List

Institutional Login
(Shibboleth or Open Athens)

For the academic login, please select your country in the dropdown list. You will be redirected to verify your credentials.

Original Paper

Free Access

Two-Stage Extreme Phenotype Sequencing Design for Discovering and Testing Common and Rare Genetic Variants: Efficiency and Power

Kang G.a, c · Lin D.a · Hakonarson H.b · Chen J.a

Author affiliations

aDepartment of Biostatistics and Epidemiology, University of Pennsylvania, and bCenter for Applied Genomics, The Joseph Stokes Jr. Research Institute, The Children’s Hospital of Philadelphia, Philadelphia, Pa., and cDepartment of Biostatistics, St. Jude Children’s Research Hospital, Memphis, Tenn., USA

Corresponding Author

Jinbo Chen, PhD

Department of Biostatistics and Epidemiology

University of Pennsylvania Perelman School of Medicine

Philadelphia, PA 19104 (USA)

Tel. +1 215 746 3915, E-Mail jinboche@mail.med.upenn.edu

Related Articles for ""

Hum Hered 2012;73:139–147

Do you have an account?

Login Information

Contact Information

I have read the Karger Terms and Conditions and agree.


Next-generation sequencing technology provides an unprecedented opportunity to identify rare susceptibility variants. It is not yet financially feasible to perform whole-genome sequencing on a large number of subjects, and a two-stage design has been advocated to be a practical option. In stage I, variants are discovered by sequencing the whole genomes of a small number of carefully selected individuals. In stage II, the discovered variants of a large number of individuals are genotyped to assess associations. Individuals with extreme phenotypes are typically selected in stage I. Using simulated data for unrelated individuals, we explore two important aspects of this two-stage design: the efficiency of discovering common and rare single-nucleotide polymorphisms (SNPs) in stage I and the impact of incomplete SNP discovery in stage I on the power of testing associations in stage II. We applied a sum test and a sum of squared score test for gene-based association analyses evaluating the power of the two-stage design. We obtained the following results from extensive simulation studies and analysis of the GAW17 dataset. When individuals with trait values more extreme than the 99.7–99th quantile were included in stage I, the two-stage design could achieve the same power as or even higher than the one-stage design if the rare causal variants had large effect sizes. In such design, fewer than half of the total SNPs including more than half of the causal SNPs were discovered, which included nearly all SNPs with minor allele frequencies (MAFs) ≥5%, more than half of the SNPs with MAFs between 1% and 5%, and fewer than half of the SNPs with MAFs <1%. Although a one-stage design may be preferable to identify multiple rare variants having small to moderate effect sizes, our observations support using the two-stage design as a cost-effective option for next-generation sequencing studies.

© 2012 S. Karger AG, Basel

Article / Publication Details

First-Page Preview
Abstract of Original Paper

Received: July 05, 2011
Accepted: February 10, 2012
Published online: June 07, 2012
Issue release date: July 2012

Number of Print Pages: 9
Number of Figures: 4
Number of Tables: 1

ISSN: 0001-5652 (Print)
eISSN: 1423-0062 (Online)

For additional information: http://www.karger.com/HHE

Copyright / Drug Dosage / Disclaimer

Copyright: All rights reserved. No part of this publication may be translated into other languages, reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying, recording, microcopying, or by any information storage and retrieval system, without permission in writing from the publisher.
Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug.
Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements.