IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1000481.html
   My bibliography  Save this article

Discovery of Rare Variants via Sequencing: Implications for the Design of Complex Trait Association Studies

Author

Listed:
  • Bingshan Li
  • Suzanne M Leal

Abstract

There is strong evidence that rare variants are involved in complex disease etiology. The first step in implicating rare variants in disease etiology is their identification through sequencing in both randomly ascertained samples (e.g., the 1,000 Genomes Project) and samples ascertained according to disease status. We investigated to what extent rare variants will be observed across the genome and in candidate genes in randomly ascertained samples, the magnitude of variant enrichment in diseased individuals, and biases that can occur due to how variants are discovered. Although sequencing cases can enrich for casual variants, when a gene or genes are not involved in disease etiology, limiting variant discovery to cases can lead to association studies with dramatically inflated false positive rates.Author Summary: One focus of human genetics is localizing genes that are involved in the etiology of complex diseases. Although emphasis has been placed on mapping common variants, recent studies have demonstrated that rare variants also play an important role in complex trait etiology and their identification should have a greater impact on risk assessment, disease prevention, and treatment due to their large genetic effects. Genome-wide association studies are used to identify common variants by genotyping tagSNPs that are proxies for common causal variants. This study design is not adequately powered for association studies of rare variants; instead, causal variants must be identified and then analyzed. With the development of sequencing technologies, it is feasible to sequence candidate genes and, soon, entire genomes to obtain data on rare variants for complex disease association studies. We investigated several questions that are germane to the discovery of rare variants within a sample; for example, proportion of variants discovered within a random sample and enrichment of causal variants within samples of cases compared to a random sample. We also demonstrate that when an excess of cases are sequenced to discover variants and the remaining samples are genotyped, this design strategy can lead to inflated false positive rates.

Suggested Citation

  • Bingshan Li & Suzanne M Leal, 2009. "Discovery of Rare Variants via Sequencing: Implications for the Design of Complex Trait Association Studies," PLOS Genetics, Public Library of Science, vol. 5(5), pages 1-9, May.
  • Handle: RePEc:plo:pgen00:1000481
    DOI: 10.1371/journal.pgen.1000481
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1000481
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1000481&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1000481?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1000481. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.