IDEAS home Printed from https://ideas.repec.org/a/nat/nathum/v9y2025i3d10.1038_s41562-024-02061-w.html
   My bibliography  Save this article

The impact of self-report inaccuracy in the UK Biobank and its interplay with selective participation

Author

Listed:
  • Tabea Schoeler

    (University of Lausanne
    University College London
    Swiss Institute of Bioinformatics)

  • Jean-Baptiste Pingault

    (University College London
    King’s College London)

  • Zoltán Kutalik

    (University of Lausanne
    Swiss Institute of Bioinformatics
    University Center for Primary Care and Public Health)

Abstract

Although the use of short self-report measures is common practice in biobank initiatives, such a phenotyping strategy is inherently prone to reporting errors. To explore challenges related to self-report errors, we first derived a reporting error score in the UK Biobank (UKBB; n = 73,127), capturing inconsistent self-reporting in time-invariant phenotypes across multiple measurement occasions. We then performed genome-wide scans on the reporting error score, applied downstream analyses (linkage disequilibrium score regression and Mendelian randomization) and compared its properties to the UKBB participation propensity. Finally, we improved phenotype resolution for 24 measures and inspected the changes in genomic findings. We found that reporting error was present across all 33 assessed self-report measures, with repeatability levels as low as 47% (childhood body size). Reporting error was not independent from UKBB participation, evidenced by the negative genetic correlation between the two outcomes (rg = −0.77), their shared causes (for example, education) and the loss in self-report accuracy following participation bias correction. Across all analyses, the impact of reporting error ranged from reduced power (for example, for gene discovery) to biased estimates (for example, if present in the exposure variable) and attenuation of genome-wide quantities (for example, 21% relative attenuation in SNP heritability for childhood height). Our findings highlight that both self-report accuracy and selective participation are competing biases and sources of poor reproducibility for biobank-scale research.

Suggested Citation

  • Tabea Schoeler & Jean-Baptiste Pingault & Zoltán Kutalik, 2025. "The impact of self-report inaccuracy in the UK Biobank and its interplay with selective participation," Nature Human Behaviour, Nature, vol. 9(3), pages 584-594, March.
  • Handle: RePEc:nat:nathum:v:9:y:2025:i:3:d:10.1038_s41562-024-02061-w
    DOI: 10.1038/s41562-024-02061-w
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41562-024-02061-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41562-024-02061-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lauren J. Beesley & Bhramar Mukherjee, 2022. "Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification," Biometrics, The International Biometric Society, vol. 78(1), pages 214-226, March.
    2. Cawley, John & Maclean, Johanna Catherine & Hammer, Mette & Wintfeld, Neil, 2015. "Reporting error in weight and its implications for bias in economic models," Economics & Human Biology, Elsevier, vol. 19(C), pages 27-44.
    3. Donald M Lyall & Breda Cullen & Mike Allerhand & Daniel J Smith & Daniel Mackay & Jonathan Evans & Jana Anderson & Chloe Fawns-Ritchie & Andrew M McIntosh & Ian J Deary & Jill P Pell, 2016. "Cognitive Test Scores in UK Biobank: Data Reduction in 480,416 Participants and Longitudinal Stability in 20,346 Participants," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-10, April.
    4. Clare Bycroft & Colin Freeman & Desislava Petkova & Gavin Band & Lloyd T. Elliott & Kevin Sharp & Allan Motyer & Damjan Vukcevic & Olivier Delaneau & Jared O’Connell & Adrian Cortes & Samantha Welsh &, 2018. "The UK Biobank resource with deep phenotyping and genomic data," Nature, Nature, vol. 562(7726), pages 203-209, October.
    5. Franco, Annie & Malhotra, Neil & Simonovits, Gabor & Zigerell, L. J., 2017. "Developing Standards for Post-Hoc Weighting in Population-Based Survey Experiments," Journal of Experimental Political Science, Cambridge University Press, vol. 4(2), pages 161-172, July.
    6. Tabea Schoeler & Doug Speed & Eleonora Porcu & Nicola Pirastu & Jean-Baptiste Pingault & Zoltán Kutalik, 2023. "Participation bias in the UK Biobank distorts genetic associations and downstream analyses," Nature Human Behaviour, Nature, vol. 7(7), pages 1216-1227, July.
    7. Ronald de Vlaming & Aysu Okbay & Cornelius A Rietveld & Magnus Johannesson & Patrik K E Magnusson & André G Uitterlinden & Frank J A van Rooij & Albert Hofman & Patrick J F Groenen & A Roy Thurik & Ph, 2017. "Meta-GWAS Accuracy and Power (MetaGAP) Calculator Shows that Hiding Heritability Is Partially Due to Imperfect Genetic Correlations across Studies," PLOS Genetics, Public Library of Science, vol. 13(1), pages 1-23, January.
    8. Abdel Abdellaoui & Karin J. H. Verweij, 2021. "Dissecting polygenic signals from genome-wide association studies on human behaviour," Nature Human Behaviour, Nature, vol. 5(6), pages 686-694, June.
    9. Andrew D. Grotzinger & Mijke Rhemtulla & Ronald Vlaming & Stuart J. Ritchie & Travis T. Mallard & W. David Hill & Hill F. Ip & Riccardo E. Marioni & Andrew M. McIntosh & Ian J. Deary & Philipp D. Koel, 2019. "Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits," Nature Human Behaviour, Nature, vol. 3(5), pages 513-525, May.
    10. Jessica Tyrrell & Jie Zheng & Robin Beaumont & Kathryn Hinton & Tom G. Richardson & Andrew R. Wood & George Davey Smith & Timothy M. Frayling & Kate Tilling, 2021. "Genetic predictors of participation in optional components of UK Biobank," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    11. repec:plo:pone00:0013929 is not listed on IDEAS
    12. Gianmarco Mignogna & Caitlin E. Carey & Robbee Wedow & Nikolas Baya & Mattia Cordioli & Nicola Pirastu & Rino Bellocco & Kathryn Fiuza Malerbi & Michel G. Nivard & Benjamin M. Neale & Raymond K. Walte, 2023. "Patterns of item nonresponse behaviour to survey questionnaires are systematic and associated with genetic loci," Nature Human Behaviour, Nature, vol. 7(8), pages 1371-1387, August.
    13. repec:plo:pmed00:1001779 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Caitlin E. Carey & Rebecca Shafee & Robbee Wedow & Amanda Elliott & Duncan S. Palmer & John Compitello & Masahiro Kanai & Liam Abbott & Patrick Schultz & Konrad J. Karczewski & Samuel C. Bryant & Caro, 2024. "Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation," Nature Human Behaviour, Nature, vol. 8(8), pages 1599-1615, August.
    2. Evelina T. Akimova & Tobias Wolfram & Xuejie Ding & Felix C. Tropf & Melinda C. Mills, 2025. "Polygenic prediction of occupational status GWAS elucidates genetic and environmental interplay in intergenerational transmission, careers and health in UK Biobank," Nature Human Behaviour, Nature, vol. 9(2), pages 391-405, February.
    3. Akimova, Evelina T. & Wolfram, Tobias & Ding, Xuejie & Tropf, Felix C. & Mills, Melinda C., 2024. "Polygenic prediction of occupational status GWAS elucidates genetic and environmental interplay in intergenerational transmission, careers and health in UK Biobank," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 9(2), pages 391-405.
    4. Tabea Schoeler & Doug Speed & Eleonora Porcu & Nicola Pirastu & Jean-Baptiste Pingault & Zoltán Kutalik, 2023. "Participation bias in the UK Biobank distorts genetic associations and downstream analyses," Nature Human Behaviour, Nature, vol. 7(7), pages 1216-1227, July.
    5. Sjoerd Alten & Benjamin W. Domingue & Jessica Faul & Titus Galama & Andries T. Marees, 2025. "Correcting for volunteer bias in GWAS increases SNP effect sizes and heritability estimates," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    6. Gianmarco Mignogna & Caitlin E. Carey & Robbee Wedow & Nikolas Baya & Mattia Cordioli & Nicola Pirastu & Rino Bellocco & Kathryn Fiuza Malerbi & Michel G. Nivard & Benjamin M. Neale & Raymond K. Walte, 2023. "Patterns of item nonresponse behaviour to survey questionnaires are systematic and associated with genetic loci," Nature Human Behaviour, Nature, vol. 7(8), pages 1371-1387, August.
    7. Hans Kippersluis & Pietro Biroli & Rita Dias Pereira & Titus J. Galama & Stephanie Hinke & S. Fleur W. Meddens & Dilnoza Muslimova & Eric A. W. Slob & Ronald Vlaming & Cornelius A. Rietveld, 2023. "Overcoming attenuation bias in regressions using polygenic indices," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    8. Jonsdottir, Gudrun A. & Einarsson, Gudmundur & Thorleifsson, Gudmar & Magnusson, Sigurdur H. & Gunnarsson, Arni F. & Frigge, Michael L. & Gisladottir, Rosa S. & Unnsteinsdottir, Unnur & Gunnarsson, Bj, 2021. "Genetic propensities for verbal and spatial ability have opposite effects on body mass index and risk of schizophrenia," Intelligence, Elsevier, vol. 88(C).
    9. Gökberk Alagöz & Else Eising & Yasmina Mekki & Giacomo Bignardi & Pierre Fontanillas & Michel G. Nivard & Michelle Luciano & Nancy J. Cox & Simon E. Fisher & Reyna L. Gordon, 2025. "The shared genetic architecture and evolution of human language and musical rhythm," Nature Human Behaviour, Nature, vol. 9(2), pages 376-390, February.
    10. Daniel J. Benjamin & David Cesarini & Patrick Turley & Alexander Strudwick Young, 2024. "Social-Science Genomics: Progress, Challenges, and Future Directions," NBER Working Papers 32404, National Bureau of Economic Research, Inc.
    11. Hyeokmoon Kweon & Casper A. P. Burik & Yuchen Ning & Rafael Ahlskog & Charley Xia & Erik Abner & Yanchun Bao & Laxmi Bhatta & Tariq O. Faquih & Maud Feijter & Paul Fisher & Andrea Gelemanović & Alexan, 2025. "Associations between common genetic variants and income provide insights about the socio-economic health gradient," Nature Human Behaviour, Nature, vol. 9(4), pages 794-805, April.
    12. Clara Albiñana & Zhihong Zhu & Andrew J. Schork & Andrés Ingason & Hugues Aschard & Isabell Brikell & Cynthia M. Bulik & Liselotte V. Petersen & Esben Agerbo & Jakob Grove & Merete Nordentoft & David , 2023. "Multi-PGS enhances polygenic prediction by combining 937 polygenic scores," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    13. Marie C. Sadler & Alexander Apostolov & Caterina Cevallos & Chiara Auwerx & Diogo M. Ribeiro & Russ B. Altman & Zoltán Kutalik, 2025. "Leveraging large-scale biobank EHRs to enhance pharmacogenetics of cardiometabolic disease medications," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    14. Dixon, Padraig & Hollingworth, William & Harrison, Sean & Davies, Neil M. & Davey Smith, George, 2020. "Mendelian Randomization analysis of the causal effect of adiposity on hospital costs," Journal of Health Economics, Elsevier, vol. 70(C).
    15. Jordi Manuello & Joosung Min & Paul McCarthy & Fidel Alfaro-Almagro & Soojin Lee & Stephen Smith & Lloyd T. Elliott & Anderson M. Winkler & Gwenaëlle Douaud, 2024. "The effects of genetic and modifiable risk factors on brain regions vulnerable to ageing and disease," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    16. Siquan Zhou & Yujie Xu & Jingyuan Xiong & Guo Cheng, 2025. "Cross-trait multivariate GWAS confirms health implications of pubertal timing," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    17. Mattia Marchi & Anne Alkema & Charley Xia & Chris H. L. Thio & Li-Yu Chen & Winni Schalkwijk & Gian M. Galeazzi & Silvia Ferrari & Luca Pingani & Hyeokmoon Kweon & Sara Evans-Lacko & W. David Hill & M, 2024. "Investigating the impact of poverty on mental illness in the UK Biobank using Mendelian randomization," Nature Human Behaviour, Nature, vol. 8(9), pages 1771-1783, September.
    18. Mingyang Li & Xixi Dang & Yiwei Chen & Zhifan Chen & Xinyi Xu & Zhiyong Zhao & Dan Wu, 2024. "Cognitive processing speed and accuracy are intrinsically different in genetic architecture and brain phenotypes," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    19. Abdellaoui, Abdel & Martin, Hilary C. & Rutherford, Adam & Kolk, Martin & Muthukrishna, Michael & Tropf, Felix & Mills, Melinda C. & Zietsch, Brendan & Verweij, Karin J.H. & Visscher, Peter M., 2025. "Socio-economic status is a social construct with heritable components and genetic consequences: a social construct with heritable components and genetic consequences," LSE Research Online Documents on Economics 127662, London School of Economics and Political Science, LSE Library.
    20. Matteo Di Scipio & Mohammad Khan & Shihong Mao & Michael Chong & Conor Judge & Nazia Pathan & Nicolas Perrot & Walter Nelson & Ricky Lali & Shuang Di & Robert Morton & Jeremy Petch & Guillaume Paré, 2023. "A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets," Nature Communications, Nature, vol. 14(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nathum:v:9:y:2025:i:3:d:10.1038_s41562-024-02061-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.