IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v65y2024i2d10.1007_s00362-023-01405-4.html
   My bibliography  Save this article

A method of correction for heaping error in the variables using validation data

Author

Listed:
  • Amar S. Ahmad

    (New York University)

  • Munther Al-Hassan

    (Dubai Men’s College)

  • Hamid Y. Hussain

    (Dubai Health Authority)

  • Nirmin F. Juber

    (New York University)

  • Fred N. Kiwanuka

    (Dubai Men’s College)

  • Mohammed Hag-Ali

    (Higher Colleges of Technology)

  • Raghib Ali

    (New York University)

Abstract

When self-reported data are used in statistical analysis to estimate the mean and variance, as well as the regression parameters, the estimates tend, in many cases, to be biased. This is because interviewees have a tendency to heap their answers to certain values. The aim of the paper is to examine the bias-inducing effect of the heaping error in self-reported data, and study the effect on the heaping error on the mean and variance of a distribution as well as the regression parameters. As a result a new method is introduced to correct the effects of bias due to the heaping error using validation data. Using publicly available data and simulation studies, it can be shown that the newly developed method is practical and can easily be applied to correct the bias in the estimated mean and variance, as well as in the estimated regression parameters computed from self-reported data. Hence, using the method of correction presented in this paper allows researchers to draw accurate conclusions leading to the right decisions, e.g. regarding health care planning and delivery.

Suggested Citation

  • Amar S. Ahmad & Munther Al-Hassan & Hamid Y. Hussain & Nirmin F. Juber & Fred N. Kiwanuka & Mohammed Hag-Ali & Raghib Ali, 2024. "A method of correction for heaping error in the variables using validation data," Statistical Papers, Springer, vol. 65(2), pages 687-704, April.
  • Handle: RePEc:spr:stpapr:v:65:y:2024:i:2:d:10.1007_s00362-023-01405-4
    DOI: 10.1007/s00362-023-01405-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-023-01405-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-023-01405-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Carlo G. Camarda & Paul H. C. Eilers & Jutta Gampe, 2017. "Modelling trends in digit preference patterns," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(5), pages 893-918, November.
    2. Klerman, J.A., 1993. "Heaping in Retrospective Data: Insights from Malaysia Family Life Survey's Breastfeeding Data," Papers 93-21, RAND - Labor and Population Program.
    3. Janine Narciso & António José Silva & Vitor Rodrigues & Maria João Monteiro & António Almeida & Raquel Saavedra & Aldo Matos Costa, 2019. "Behavioral, contextual and biological factors associated with obesity during adolescence: A systematic review," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-20, April.
    4. John Haaga, 1988. "Reliability of retrospective survey data on infant feeding," Demography, Springer;Population Association of America (PAA), vol. 25(2), pages 307-314, May.
    5. Torelli, Nicola & Trivellato, Ugo, 1993. "Modelling inaccuracies in job-search duration data," Journal of Econometrics, Elsevier, vol. 59(1-2), pages 187-211, September.
    6. Fengyi Lin & Liming Guan & Wenchang Fang, 2011. "Heaping in Reported Earnings: Evidence from Monthly Financial Reports of Taiwanese Firms," Emerging Markets Finance and Trade, Taylor & Francis Journals, vol. 47(2), pages 62-73, March.
    7. Thomas Augustin & Joachim Wolff, 2004. "A bias analysis of Weibull models under heaped data," Statistical Papers, Springer, vol. 45(2), pages 211-229, April.
    8. Alan I. Barreca & Melanie Guldi & Jason M. Lindo & Glen R. Waddell, 2011. "Saving Babies? Revisiting the effect of very low birth weight classification," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(4), pages 2117-2123.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arulampalam, Wiji & Corradi, Valentina & Gutknecht, Daniel, 2017. "Modeling heaped duration data: An application to neonatal mortality," Journal of Econometrics, Elsevier, vol. 200(2), pages 363-377.
    2. Homonoff, Tatiana & Spreen, Thomas Luke & St. Clair, Travis, 2020. "Balance sheet insolvency and contribution revenue in public charities," Journal of Public Economics, Elsevier, vol. 186(C).
    3. Luis R. Martinez & Jonas Jessen & Guo Xu, 2023. "A Glimpse of Freedom: Allied Occupation and Political Resistance in East Germany," American Economic Journal: Applied Economics, American Economic Association, vol. 15(1), pages 68-106, January.
    4. Hope Corman & Dhaval Dave & Nancy E. Reichman, 2018. "Evolution of the Infant Health Production Function," Southern Economic Journal, John Wiley & Sons, vol. 85(1), pages 6-47, July.
    5. David Madden, 2002. "Do Tobacco Taxes Influence Starting and Quitting Smoking? A Discrete Choice Approach Using Evidence from a Sample of Irish Women," Working Papers 200205, School of Economics, University College Dublin.
    6. Kim, Jinyoung & Kim, Seonghoon & Koh, Kanghyock, 2022. "Labor market institutions and the incidence of payroll taxation," Journal of Public Economics, Elsevier, vol. 209(C).
    7. Oskar Skans & Linus Liljeberg, 2014. "The wage effects of subsidized career breaks," Empirical Economics, Springer, vol. 47(2), pages 593-617, September.
    8. Jorma J. Schäublin, 2022. "Swiss pension funds: funding ratio, discount rate, and asset allocation," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 158(1), pages 1-23, December.
    9. Luc Behaghel & Maria Florencia Pinto, 2024. "Extended maternity leave and children's long‐term development," Scandinavian Journal of Economics, Wiley Blackwell, vol. 126(2), pages 224-253, April.
    10. Naci Akdemir & Serkan Yenal, 2021. "How Phishers Exploit the Coronavirus Pandemic: A Content Analysis of COVID-19 Themed Phishing Emails," SAGE Open, , vol. 11(3), pages 21582440211, July.
    11. Erich Battistin & Raffaele Miniaci & Guglielmo Weber, 2003. "What Do We Learn from Recall Consumption Data?," Journal of Human Resources, University of Wisconsin Press, vol. 38(2).
    12. Alessio Gaggero & Joan Gil & Dolores Jiménez-Rubio & Eugenio Zucchelli, 2021. "Health information and lifestyle behaviours: the impact of a diabetes diagnosis," UB School of Economics Working Papers 2021/406, University of Barcelona School of Economics.
    13. Adam C. Sales & Ben B. Hansen, 2020. "Limitless Regression Discontinuity," Journal of Educational and Behavioral Statistics, , vol. 45(2), pages 143-174, April.
    14. Drouvelis, Michalis & Marx, Benjamin M., 2022. "Can charitable appeals identify and exploit belief heterogeneity?," Journal of Economic Behavior & Organization, Elsevier, vol. 198(C), pages 631-649.
    15. Dang, Hai-Anh H. & Trinh, Trong-Anh, 2021. "Does the COVID-19 lockdown improve global air quality? New cross-national evidence on its unintended consequences," Journal of Environmental Economics and Management, Elsevier, vol. 105(C).
    16. Luciana Juvenal & Paulo Santos Monteiro, 2024. "Risky Gravity," Journal of the European Economic Association, European Economic Association, vol. 22(4), pages 1590-1627.
    17. Alagöz, Nazli, 2024. "Promotion and technological change in the music industry," Other publications TiSEM 511ceba0-62a0-4c60-a76c-f, Tilburg University, School of Economics and Management.
    18. Cammeraat, Emile & Jongen, Egbert L. W. & Koning, Pierre, 2017. "Preventing NEETs during the Great Recession: The Effects of a Mandatory Activation Program for Young Welfare Recipients," IZA Discussion Papers 11090, Institute of Labor Economics (IZA).
    19. Machin, Stephen & McNally, Sandra & Ruiz-Valenzuela, Jenifer, 2020. "Entry through the narrow door: The costs of just failing high stakes exams," Journal of Public Economics, Elsevier, vol. 190(C).
    20. Dahlberg, Matz & Mani, Kevin & Öhman, Mattias & Wanhainen, Anders, 2016. "Health Information and Well-Being: Evidence from an Asymptomatic Disease," Working Paper Series 2016:2, Uppsala University, Department of Economics.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:65:y:2024:i:2:d:10.1007_s00362-023-01405-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.