IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/60364.html
   My bibliography  Save this paper

Benford's Law, families of distributions and a test basis

Author

Listed:
  • Morrow, John

Abstract

Benford's Law is used to test for data irregularities. While novel, there are two weaknesses in the current methodology. First, test values used in practice are too conservative and the test values of this paper are more powerful and hold for fairly small samples. Second, testing requires Benford's Law to hold, which it often does not. I present a simple method to transform distributions to satisfy the Law with arbitrary precision and induce scale invariance, freeing tests from the choice of units. I additionally derive a rate of convergence to Benford's Law. Finally, the results are applied to common distributions.

Suggested Citation

  • Morrow, John, 2014. "Benford's Law, families of distributions and a test basis," LSE Research Online Documents on Economics 60364, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:60364
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/60364/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Tam Cho, Wendy K. & Gaines, Brian J., 2007. "Breaking the (Benford) Law: Statistical Fraud Detection in Campaign Finance," The American Statistician, American Statistical Association, vol. 61, pages 218-223, August.
    2. G. Noether, 1963. "Note on the kolmogorov statistic in the discrete case," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 7(1), pages 115-116, December.
    3. Rodriguez R.J., 2004. "First Significant Digit Patterns From Mixtures of Uniform Distributions," The American Statistician, American Statistical Association, vol. 58, pages 64-71, February.
    4. Grendar, Marian & Judge, George & Schechter, Laura, 2007. "An empirical non-parametric likelihood family of data-based Benford-like distributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 380(C), pages 429-438.
    5. Scott Marchi & James Hamilton, 2006. "Assessing the Accuracy of Self-Reported Data: an Evaluation of the Toxics Release Inventory," Journal of Risk and Uncertainty, Springer, vol. 32(1), pages 57-76, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rabeea Sadaf, 2017. "Advanced Statistical Techniques For Testing Benford'S Law," Annals of Faculty of Economics, University of Oradea, Faculty of Economics, vol. 1(2), pages 229-238, December.
    2. Bernhard Rauch & Max Göttsche & Gernot Brähler & Stefan Engel, 2011. "Fact and Fiction in EU‐Governmental Economic Data," German Economic Review, Verein für Socialpolitik, vol. 12(3), pages 243-255, August.
    3. Ronelle Burger & Canh Thien Dang & Trudy Owens, 2017. "Better performing NGOs do report more accurately: Evidence from investigating Ugandan NGO financial accounts," Discussion Papers 2017-10, University of Nottingham, CREDIT.
    4. Dang, Canh Thien & Owens, Trudy, 2020. "Does transparency come at the cost of charitable services? Evidence from investigating British charities," Journal of Economic Behavior & Organization, Elsevier, vol. 172(C), pages 314-343.
    5. Holz, Carsten A., 2014. "The quality of China's GDP statistics," China Economic Review, Elsevier, vol. 30(C), pages 309-338.
    6. Hürlimann, Werner, 2015. "On the uniform random upper bound family of first significant digit distributions," Journal of Informetrics, Elsevier, vol. 9(2), pages 349-358.
    7. Pankaj C. Patel & Mike G. Tsionas & Maria João Guedes, 2022. "Benford's law, small business financial reporting, and survival," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 43(8), pages 3301-3315, December.
    8. Druică, Elena & Oancea, Bogdan & Vâlsan, Călin, 2018. "Benford's law and the limits of digit analysis," International Journal of Accounting Information Systems, Elsevier, vol. 31(C), pages 75-82.
    9. Thomas Stoerk, 2015. "Statistical corruption in Beijing’s air quality data has likely ended in 2012," GRI Working Papers 194, Grantham Research Institute on Climate Change and the Environment.
    10. Hao, Zhuang & Zhang, Xudong & Wang, Yuze, 2024. "Assessing the accuracy of self-reported health expenditure data: Evidence from two public surveys in China," Social Science & Medicine, Elsevier, vol. 356(C).
    11. Kalaichelvan, Mohandass & Lim Kai Jie, Shawn, 2012. "A Critical Evaluation of the Significance of Round Numbers in European Equity Markets in Light of the Predictions from Benford’s Law," MPRA Paper 40960, University Library of Munich, Germany.
    12. Eutsler, Jared & Kathleen Harris, M. & Tyler Williams, L. & Cornejo, Omar E., 2023. "Accounting for partisanship and politicization: Employing Benford's Law to examine misreporting of COVID-19 infection cases and deaths in the United States," Accounting, Organizations and Society, Elsevier, vol. 108(C).
    13. George Judge & Laura Schechter, 2009. "Detecting Problems in Survey Data Using Benford’s Law," Journal of Human Resources, University of Wisconsin Press, vol. 44(1).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lee, Joanne & Cho, Wendy K. Tam & Judge, George G., 2010. "Stigler's approach to recovering the distribution of first significant digits in natural data sets," Statistics & Probability Letters, Elsevier, vol. 80(2), pages 82-88, January.
    2. Vadim S. Balashov & Yuxing Yan & Xiaodi Zhu, 2020. "Who Manipulates Data During Pandemics? Evidence from Newcomb-Benford Law," Papers 2007.14841, arXiv.org, revised Jan 2021.
    3. Philip E Hulme & Danish A Ahmed & Phillip J Haubrock & Brooks A Kaiser & Melina Kourantidou & Boris Leroy & Shana M Mcdermott, 2024. "Widespread imprecision in estimates of the economic costs of invasive alien species worldwide," Post-Print hal-04633043, HAL.
    4. Matthew A. Cole & David J. Maddison & Liyun Zhang, 2020. "Testing the emission reduction claims of CDM projects using the Benford’s Law," Climatic Change, Springer, vol. 160(3), pages 407-426, June.
    5. Lee, Kang-Bok & Han, Sumin & Jeong, Yeasung, 2020. "COVID-19, flattening the curve, and Benford’s law," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 559(C).
    6. Kishore Singh & Peter Best, 2020. "Implementing Benford’s Law in Continuous Monitoring Applications," Journal of Accounting and Management Information Systems, Faculty of Accounting and Management Information Systems, The Bucharest University of Economic Studies, vol. 19(2), pages 379-404, June.
    7. Villas-Boas, Sofia B. & Fu, Qiuzi & Judge, George, 2017. "Benford’s law and the FSD distribution of economic behavioral micro data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 486(C), pages 711-719.
    8. Fraas, Art & Egorenkov, Alex, 2015. "A Retrospective Study of EPA’s Air Toxics Program under the Revised Section 112 Requirements of the Clean Air Act," RFF Working Paper Series dp-15-23, Resources for the Future.
    9. Louie Rivers & Tamara Dempsey & Jade Mitchell & Carole Gibbs, 2015. "Environmental Regulation and Enforcement: Structures, Processes and the Use of Data for Fraud Detection," Journal of Environmental Assessment Policy and Management (JEAPM), World Scientific Publishing Co. Pte. Ltd., vol. 17(04), pages 1-29, December.
    10. Alex Hollingsworth & Ivan Rudik, 2021. "The Effect of Leaded Gasoline on Elderly Mortality: Evidence from Regulatory Exemptions," American Economic Journal: Economic Policy, American Economic Association, vol. 13(3), pages 345-373, August.
    11. Sitsofe Tsagbey & Miguel de Carvalho & Garritt L. Page, 2017. "All Data are Wrong, but Some are Useful? Advocating the Need for Data Auditing," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 231-235, July.
    12. Qiuzi Fu & Sofia B. Villas-Boas & George Judge, 2019. "Does china income FSDs follow Benford? A comparison between Chinese income first significant digit distribution with Benford distribution," China Economic Journal, Taylor & Francis Journals, vol. 12(1), pages 68-76, January.
    13. Dang, Canh Thien & Owens, Trudy, 2020. "Does transparency come at the cost of charitable services? Evidence from investigating British charities," Journal of Economic Behavior & Organization, Elsevier, vol. 172(C), pages 314-343.
    14. Shinsuke Tanaka & Kensuke Teshima & Eric Verhoogen, 2022. "North-South Displacement Effects of Environmental Regulation: The Case of Battery Recycling," American Economic Review: Insights, American Economic Association, vol. 4(3), pages 271-288, September.
    15. Dang, Tri Vi & Wang, Youan & Wang, Zigan, 2022. "The role of financial constraints in firm investment under pollution abatement regulation," Journal of Corporate Finance, Elsevier, vol. 76(C).
    16. Hannah Aoyagi & Oladele A. Ogunseitan, 2015. "Toxic Releases and Risk Disparity: A Spatiotemporal Model of Industrial Ecology and Social Empowerment," IJERPH, MDPI, vol. 12(6), pages 1-19, June.
    17. De Silva, Dakshina G. & McComb, Robert P. & Schiller, Anita R. & Slechten, Aurelie, 2021. "Firm behavior and pollution in small geographies," European Economic Review, Elsevier, vol. 136(C).
    18. Edward J. Lusk & Michael Halperin, 2014. "Detecting Newcomb-Benford Digital Frequency Anomalies in the Audit Context: Suggested Chi2 Test Possibilities," Accounting and Finance Research, Sciedu Press, vol. 3(2), pages 191-191, May.
    19. Thomas Stoerk, 2015. "Statistical corruption in Beijing’s air quality data has likely ended in 2012," GRI Working Papers 194, Grantham Research Institute on Climate Change and the Environment.
    20. John Xuefeng Jiang & Jing Kong, 2024. "Green dies in darkness? environmental externalities of newspaper closures," Review of Accounting Studies, Springer, vol. 29(4), pages 3564-3599, December.

    More about this item

    Keywords

    Benford's Law; data quality; fraud detection;
    All these keywords.

    JEL classification:

    • C10 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - General
    • C24 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Truncated and Censored Models; Switching Regression Models; Threshold Regression Models
    • C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:60364. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.