IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v52y2008i12p5367-5374.html
   My bibliography  Save this article

Reproducibility of genotypes as measured by the affymetrix GeneChip® 100K Human Mapping Array set

Author

Listed:
  • Fridley, Brooke L.
  • Turner, Stephen T.
  • Chapman, Arlene B.
  • Rodin, Andrei S.
  • Boerwinkle, Eric
  • Bailey, Kent R.

Abstract

Genotyping errors that are undetected in genome-wide association studies using single nucleotide polymorphisms (SNPs) may degrade the likelihood of detecting true positive associations. To estimate the frequency of genotyping errors and assess the reproducibility of genotype calls, we analyzed two sets of duplicate data, one dataset containing twenty blind duplicates and another dataset containing twenty-eight nonrandom duplicates, from a genome-wide association study using Affymetrix GeneChip®100 K Human Mapping Arrays. For the twenty blind duplicates the overall agreement in genotyping calls as measured with the Kappa statistics, was 0.997, with a discordancy rate of 0.27%. For the twenty-eight nonrandom duplicates, the overall agreement was lower, 0.95, with a higher discordancy rate of 4.53%. The accuracy and probability of concordancy were inversely related to the genotyping uncertainty score, i.e., as the genotyping uncertainty score increased, the concordancy and probability of concordant calls decreased. Lowering of the uncertainty score threshold for rejection of genotype calls from the Affymetrix recommended value of 0.25 to 0.20 resulted in an increased predicted accuracy from 92.6% to 95% with a slight increase in the "No Call" rate from 1.81% to 2.33%. Hence, we suggest using a lower uncertainty score threshold, say 0.20, which will result in higher accuracy in calls at a modest decrease in the call rate.

Suggested Citation

  • Fridley, Brooke L. & Turner, Stephen T. & Chapman, Arlene B. & Rodin, Andrei S. & Boerwinkle, Eric & Bailey, Kent R., 2008. "Reproducibility of genotypes as measured by the affymetrix GeneChip® 100K Human Mapping Array set," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5367-5374, August.
  • Handle: RePEc:eee:csdana:v:52:y:2008:i:12:p:5367-5374
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(08)00281-8
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tintle Nathan L & Gordon Derek & McMahon Francis J & Finch Stephen J, 2007. "Using Duplicate Genotyped Data in Genetic Analyses: Testing Association and Estimating Error Rates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 6(1), pages 1-29, February.
    2. Gordon Derek & Yang Yaning & Haynes Chad & Finch Stephen J & Mendell Nancy R & Brown Abraham M & Haroutunian Vahram, 2004. "Increasing Power for Tests of Genetic Association in the Presence of Phenotype and/or Genotype Error by Use of Double-Sampling," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 3(1), pages 1-35, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Allison, David B. & Visscher, Peter M. & Rosa, Guilherme J.M. & Amos, Christopher I., 2009. "Statistical genetics & statistical genomics: Where biology, epistemology, statistics, and computation collide," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1531-1534, March.
    2. Borchers Bryce & Brown Marshall & McLellan Brian & Bekmetjev Airat & Tintle Nathan L, 2009. "Incorporating Duplicate Genotype Data into Linear Trend Tests of Genetic Association: Methods and Cost-Effectiveness," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-20, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Borchers Bryce & Brown Marshall & McLellan Brian & Bekmetjev Airat & Tintle Nathan L, 2009. "Incorporating Duplicate Genotype Data into Linear Trend Tests of Genetic Association: Methods and Cost-Effectiveness," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-20, May.
    2. Attia John & Thakkinstian Ammarin & McElduff Patrick & Milne Elizabeth & Dawson Somer & Scott Rodney J & Klerk Nicholas de & Armstrong Bruce & Thompson John, 2010. "Detecting Genotyping Error Using Measures of Degree of Hardy-Weinberg Disequilibrium," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-17, January.
    3. Airat Bekmetjev & Dirk VanBruggen & Brian McLellan & Benjamin DeWinkle & Eric Lunderberg & Nathan Tintle, 2012. "The Cost-Effectiveness of Reclassification Sampling for Prevalence Estimation," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-6, February.
    4. Ahn Kwangmi & Gordon Derek & Finch Stephen J, 2009. "Increase of Rejection Rate in Case-Control Studies with the Differential Genotyping Error Rates," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-9, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:52:y:2008:i:12:p:5367-5374. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.