IDEAS home Printed from https://ideas.repec.org/a/uwp/jhriss/v44y2009i1p1-24.html
   My bibliography  Save this article

Detecting Problems in Survey Data Using Benford’s Law

Author

Listed:
  • George Judge
  • Laura Schechter

Abstract

"It is 15:00 in Nairobi. Do you know where your enumerators are??" Good quality data is paramount for applied economic research. If the data are distorted, corresponding conclusions may be incorrect. We demonstrate how Benford’s law, the distribution that first digits of numbers in certain data sets should follow, can be used to test for data abnormalities. We conduct an analysis of nine commonly used data sets and find that much data from developing countries is of poor quality while data from the United States seems to be of better quality. Female and male respondents give data of similar quality.

Suggested Citation

  • George Judge & Laura Schechter, 2009. "Detecting Problems in Survey Data Using Benford’s Law," Journal of Human Resources, University of Wisconsin Press, vol. 44(1).
  • Handle: RePEc:uwp:jhriss:v:44:y:2009:i1:p1-24
    as

    Download full text from publisher

    File URL: http://jhr.uwpress.org/cgi/reprint/44/1/1
    Download Restriction: A subscripton is required to access pdf files. Pay per article is available.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nye John & Moul Charles, 2007. "The Political Economy of Numbers: On the Application of Benford's Law to International Macroeconomic Statistics," The B.E. Journal of Macroeconomics, De Gruyter, vol. 7(1), pages 1-14, July.
    2. Paul Glewwe & Hai-Anh Hoang Dang, 2008. "The Impact of Decentralized Data Entry on the Quality of Household Survey Data in Developing Countries: Evidence from a Randomized Experiment in Vietnam," World Bank Economic Review, World Bank Group, vol. 22(1), pages 165-185, January.
    3. Philipson, Tomas & Malani, Anup, 1999. "Measurement errors: A principal investigator-agent approach," Journal of Econometrics, Elsevier, vol. 91(2), pages 273-298, August.
    4. Morrow, John, 2014. "Benford's Law, families of distributions and a test basis," LSE Research Online Documents on Economics 60364, London School of Economics and Political Science, LSE Library.
    5. Grendar, Marian & Judge, George & Schechter, Laura, 2007. "An empirical non-parametric likelihood family of data-based Benford-like distributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 380(C), pages 429-438.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dang, Canh Thien & Owens, Trudy, 2020. "Does transparency come at the cost of charitable services? Evidence from investigating British charities," Journal of Economic Behavior & Organization, Elsevier, vol. 172(C), pages 314-343.
    2. Holz, Carsten A., 2014. "The quality of China's GDP statistics," China Economic Review, Elsevier, vol. 30(C), pages 309-338.
    3. Hansen, Bradley A. & Hansen, Mary Eschelbach, 2016. "The historian's craft and economics," Journal of Institutional Economics, Cambridge University Press, vol. 12(2), pages 349-370, June.
    4. Lee, Joanne & Judge, George G, 2008. "Identifying falsified clinical data," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt8x00h1c1, Department of Agricultural & Resource Economics, UC Berkeley.
    5. Bernhard Rauch & Max Göttsche & Gernot Brähler & Stefan Engel, 2011. "Fact and Fiction in EU‐Governmental Economic Data," German Economic Review, Verein für Socialpolitik, vol. 12(3), pages 243-255, August.
    6. Ronelle Burger & Canh Thien Dang & Trudy Owens, 2017. "Better performing NGOs do report more accurately: Evidence from investigating Ugandan NGO financial accounts," Discussion Papers 2017-10, University of Nottingham, CREDIT.
    7. Lothar Essig & Joachim K. Winter, 2009. "Item Non-Response to Financial Questions in Household Surveys: An Experimental Study of Interviewer and Mode Effects," Fiscal Studies, Institute for Fiscal Studies, vol. 30(Special I), pages 367-390, December.
    8. Villas-Boas, Sofia B. & Fu, Qiuzi & Judge, George, 2017. "Benford’s law and the FSD distribution of economic behavioral micro data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 486(C), pages 711-719.
    9. Daniel McFadden, 2009. "The human side of mechanism design: a tribute to Leo Hurwicz and Jean-Jacque Laffont," Review of Economic Design, Springer;Society for Economic Design, vol. 13(1), pages 77-100, April.
    10. Hürlimann, Werner, 2015. "On the uniform random upper bound family of first significant digit distributions," Journal of Informetrics, Elsevier, vol. 9(2), pages 349-358.
    11. Theoharry Grammatikos & Nikolaos I. Papanikolaou, 2021. "Applying Benford’s Law to Detect Accounting Data Manipulation in the Banking Industry," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 115-142, April.
    12. Kalaichelvan, Mohandass & Lim Kai Jie, Shawn, 2012. "A Critical Evaluation of the Significance of Round Numbers in European Equity Markets in Light of the Predictions from Benford’s Law," MPRA Paper 40960, University Library of Munich, Germany.
    13. John Morrow, 2014. "Benford's Law, Families of Distributions and a Test Basis," CEP Discussion Papers dp1291, Centre for Economic Performance, LSE.
    14. Tariq Ahmad Mir, 2012. "The leading digit distribution of the worldwide Illicit Financial Flows," Papers 1201.3432, arXiv.org, revised Nov 2012.
    15. Thomas Stoerk, 2015. "Statistical corruption in Beijing’s air quality data has likely ended in 2012," GRI Working Papers 194, Grantham Research Institute on Climate Change and the Environment.
    16. Winter, Joachim, 0000. "Bracketing effects in categorized survey questions and the measurement of economic quantities," Sonderforschungsbereich 504 Publications 02-35, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
    17. Tomasz Kamil Michalski & Guillaume Stoltz, 2010. "Do countries falsify economic date strategically? Some evidence that they do," Working Papers hal-00540794, HAL.
    18. Clarke, Philip M. & Fiebig, Denzil G. & Gerdtham, Ulf-G., 2008. "Optimal recall length in survey design," Journal of Health Economics, Elsevier, vol. 27(5), pages 1275-1284, September.
    19. Lee, Joanne & Cho, Wendy K. Tam & Judge, George G., 2010. "Stigler's approach to recovering the distribution of first significant digits in natural data sets," Statistics & Probability Letters, Elsevier, vol. 80(2), pages 82-88, January.
    20. Montag, Josef, 2017. "Identifying odometer fraud in used car market data," Transport Policy, Elsevier, vol. 60(C), pages 10-23.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:uwp:jhriss:v:44:y:2009:i1:p1-24. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: . General contact details of provider: http://jhr.uwpress.org/ .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (email available below). General contact details of provider: http://jhr.uwpress.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.