IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v6y2007i1n4.html
   My bibliography  Save this article

Using Duplicate Genotyped Data in Genetic Analyses: Testing Association and Estimating Error Rates

Author

Listed:
  • Tintle Nathan L

    (Hope College)

  • Gordon Derek

    (Rutgers University)

  • McMahon Francis J

    (NIH)

  • Finch Stephen J

    (Stony Brook University)

Abstract

Although researchers use duplicate genotyped data to calculate an inconsistency rate, there is no power analysis to assess the value of the duplicate data. In this paper, we present a model in which the genotyping error rate is related to the inconsistency rate. We extend the g genotype by h phenotype chi-squared test to incorporate the duplicate genotyped data. When a subject is inconsistently genotyped (that is, has two observed genotypes), our procedure is to allocate 0.5 units to each of the two genotypes. We specify the multivariate analysis of variance (MANOVA) test comparing these extended counts. We provide freely available software for this test and also for a permutation test used on small samples. A simulation study shows that the asymptotic null distribution of the MANOVA test holds when the total number of subjects, N, is at least 300. We also document with a simulation study that the asymptotic distribution of this test under various alternative hypotheses is a satisfactory approximation to the simulated power. In all cases, the power of the MANOVA test using the duplicate genotyped data is greater than the power of the chi-squared test ignoring the duplicate data. Power increases ranged from 0.776% to 4.652% for 80% powered tests and 0.292% to 2.028% for 95% powered tests. Researchers now can compute the value of the duplicate genotyped data as part of the design of the study.

Suggested Citation

  • Tintle Nathan L & Gordon Derek & McMahon Francis J & Finch Stephen J, 2007. "Using Duplicate Genotyped Data in Genetic Analyses: Testing Association and Estimating Error Rates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 6(1), pages 1-29, February.
  • Handle: RePEc:bpj:sagmbi:v:6:y:2007:i:1:n:4
    DOI: 10.2202/1544-6115.1251
    as

    Download full text from publisher

    File URL: https://doi.org/10.2202/1544-6115.1251
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.2202/1544-6115.1251?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Attia John & Thakkinstian Ammarin & McElduff Patrick & Milne Elizabeth & Dawson Somer & Scott Rodney J & Klerk Nicholas de & Armstrong Bruce & Thompson John, 2010. "Detecting Genotyping Error Using Measures of Degree of Hardy-Weinberg Disequilibrium," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-17, January.
    2. Ahn Kwangmi & Gordon Derek & Finch Stephen J, 2009. "Increase of Rejection Rate in Case-Control Studies with the Differential Genotyping Error Rates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-9, May.
    3. Airat Bekmetjev & Dirk VanBruggen & Brian McLellan & Benjamin DeWinkle & Eric Lunderberg & Nathan Tintle, 2012. "The Cost-Effectiveness of Reclassification Sampling for Prevalence Estimation," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-6, February.
    4. Borchers Bryce & Brown Marshall & McLellan Brian & Bekmetjev Airat & Tintle Nathan L, 2009. "Incorporating Duplicate Genotype Data into Linear Trend Tests of Genetic Association: Methods and Cost-Effectiveness," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-20, May.
    5. Fridley, Brooke L. & Turner, Stephen T. & Chapman, Arlene B. & Rodin, Andrei S. & Boerwinkle, Eric & Bailey, Kent R., 2008. "Reproducibility of genotypes as measured by the affymetrix GeneChip® 100K Human Mapping Array set," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5367-5374, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:6:y:2007:i:1:n:4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.