IDEAS home Printed from https://ideas.repec.org/a/taf/amstat/v71y2017i3p231-235.html
   My bibliography  Save this article

All Data are Wrong, but Some are Useful? Advocating the Need for Data Auditing

Author

Listed:
  • Sitsofe Tsagbey
  • Miguel de Carvalho
  • Garritt L. Page

Abstract

In a recent article from the Annals of Applied Statistics, Cox discussed the main phases of applied statistical research ranging from clarifying study objectives to final data analysis and interpreting results. As an incidental remark to these main phases, we advocate that beyond cleaning and preprocessing the data, it is a good practice to audit the data to determine if they can be trusted at all. A case study based on Ghanaian Official Fishery Statistics is used to illustrate this need, with Benford's law being the tool used to carrying out the data audit. Supplementary materials for this article are available online.

Suggested Citation

  • Sitsofe Tsagbey & Miguel de Carvalho & Garritt L. Page, 2017. "All Data are Wrong, but Some are Useful? Advocating the Need for Data Auditing," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 231-235, July.
  • Handle: RePEc:taf:amstat:v:71:y:2017:i:3:p:231-235
    DOI: 10.1080/00031305.2017.1311282
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/00031305.2017.1311282
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/00031305.2017.1311282?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Christopher Pala, 2013. "Detective work uncovers under-reported overfishing," Nature, Nature, vol. 496(7443), pages 18-18, April.
    2. Reg Watson & Daniel Pauly, 2001. "Systematic distortions in world fisheries catch trends," Nature, Nature, vol. 414(6863), pages 534-536, November.
    3. Scott Marchi & James Hamilton, 2006. "Assessing the Accuracy of Self-Reported Data: an Evaluation of the Toxics Release Inventory," Journal of Risk and Uncertainty, Springer, vol. 32(1), pages 57-76, January.
    4. Mebane, Walter R., 2011. "Comment on “Benford's Law and the Detection of Election Fraudâ€," Political Analysis, Cambridge University Press, vol. 19(3), pages 269-272, July.
    5. Fewster, R. M., 2009. "A Simple Explanation of Benford's Law," The American Statistician, American Statistical Association, vol. 63(1), pages 26-32.
    6. Andreas Diekmann, 2007. "Not the First Digit! Using Benford's Law to Detect Fraudulent Scientif ic Data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 34(3), pages 321-329.
    7. Anton K Formann, 2010. "The Newcomb-Benford Law in Its Relation to Some Common Distributions," PLOS ONE, Public Library of Science, vol. 5(5), pages 1-13, May.
    8. George Judge & Laura Schechter, 2009. "Detecting Problems in Survey Data Using Benford’s Law," Journal of Human Resources, University of Wisconsin Press, vol. 44(1).
    9. Nicholas J. Horton, 2015. "Challenges and Opportunities for Statistics and Statistical Education: Looking Back, Looking Forward," The American Statistician, Taylor & Francis Journals, vol. 69(2), pages 138-145, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Roy Cerqueti & Claudio Lupi, 2023. "Severe testing of Benford’s law," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 677-694, June.
    2. Junho Lee & Miguel de Carvalho, 2019. "Technological improvements or climate change? Bayesian modeling of time-varying conformance to Benford’s Law," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-11, April.
    3. Sylwestrzak Marek, 2023. "Applying Benford’s law to detect earnings management," Journal of Economics and Management, Sciendo, vol. 45(1), pages 216-236, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Montag, Josef, 2017. "Identifying odometer fraud in used car market data," Transport Policy, Elsevier, vol. 60(C), pages 10-23.
    2. Vadim S. Balashov & Yuxing Yan & Xiaodi Zhu, 2020. "Who Manipulates Data During Pandemics? Evidence from Newcomb-Benford Law," Papers 2007.14841, arXiv.org, revised Jan 2021.
    3. Matthew A. Cole & David J. Maddison & Liyun Zhang, 2020. "Testing the emission reduction claims of CDM projects using the Benford’s Law," Climatic Change, Springer, vol. 160(3), pages 407-426, June.
    4. Holz, Carsten A., 2014. "The quality of China's GDP statistics," China Economic Review, Elsevier, vol. 30(C), pages 309-338.
    5. Willis A. Jones, 2020. "A Benford Analysis of National Collegiate Athletic Association Division I Finance Data," Journal of Sports Economics, , vol. 21(3), pages 234-255, April.
    6. Louie Rivers & Tamara Dempsey & Jade Mitchell & Carole Gibbs, 2015. "Environmental Regulation and Enforcement: Structures, Processes and the Use of Data for Fraud Detection," Journal of Environmental Assessment Policy and Management (JEAPM), World Scientific Publishing Co. Pte. Ltd., vol. 17(04), pages 1-29, December.
    7. Theoharry Grammatikos & Nikolaos I. Papanikolaou, 2021. "Applying Benford’s Law to Detect Accounting Data Manipulation in the Banking Industry," Journal of Financial Services Research, Springer;Western Finance Association, vol. 59(1), pages 115-142, April.
    8. Sammy Zahran & Terrence Iverson & Stephan Weiler & Anthony Underwood, 2014. "Evidence that the accuracy of self-reported lead emissions data improved: A puzzle and discussion," Journal of Risk and Uncertainty, Springer, vol. 49(3), pages 235-257, December.
    9. Montag, Josef, 2015. "Identifying Odometer Fraud: Evidence from the Used Car Market in the Czech Republic," MPRA Paper 65182, University Library of Munich, Germany.
    10. Mr. Jesus R Gonzalez-Garcia & Mr. Gonzalo C Pastor Campos, 2009. "Benford’s Law and Macroeconomic Data Quality," IMF Working Papers 2009/010, International Monetary Fund.
    11. Ioana Sorina Deleanu, 2017. "Do Countries Consistently Engage in Misinforming the International Community about Their Efforts to Combat Money Laundering? Evidence Using Benford’s Law," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-19, January.
    12. Wang, Delu & Chen, Fan & Mao, Jinqi & Liu, Nannan & Rong, Fangyu, 2022. "Are the official national data credible? Empirical evidence from statistics quality evaluation of China's coal and its downstream industries," Energy Economics, Elsevier, vol. 114(C).
    13. Auffhammer, Maximilian & Carson, Richard T., 2008. "Forecasting the path of China's CO2 emissions using province-level information," Journal of Environmental Economics and Management, Elsevier, vol. 55(3), pages 229-247, May.
    14. Bernhard Rauch & Max G�ttsche & Stephan Langenegger, 2014. "Detecting Problems in Military Expenditure Data Using Digital Analysis," Defence and Peace Economics, Taylor & Francis Journals, vol. 25(2), pages 97-111, April.
    15. Lee, Kang-Bok & Han, Sumin & Jeong, Yeasung, 2020. "COVID-19, flattening the curve, and Benford’s law," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 559(C).
    16. Brähler, Gernot & Bensmann, Markus & Emke, Anna-Lena, 2010. "Der Einsatz mathematisch-statistischer Methoden in der digitalen Betriebsprüfung," Ilmenauer Schriften zur Betriebswirtschaftslehre, Technische Universität Ilmenau, Institut für Betriebswirtschaftslehre, volume 4, number 42010.
    17. Rachel Peletz & Emily Kumpel & Mateyo Bonham & Zarah Rahman & Ranjiv Khush, 2016. "To What Extent is Drinking Water Tested in Sub-Saharan Africa? A Comparative Analysis of Regulated Water Quality Monitoring," IJERPH, MDPI, vol. 13(3), pages 1-14, March.
    18. Venuka Aggarwal & Khushdeep Dharni, 2020. "Deshelling the Shell Companies Using Benford’s Law: An Emerging Market Study," Vikalpa: The Journal for Decision Makers, , vol. 45(3), pages 160-169, September.
    19. Bernhard Rauch & Max Göttsche & Gernot Brähler & Stefan Engel, 2011. "Fact and Fiction in EU‐Governmental Economic Data," German Economic Review, Verein für Socialpolitik, vol. 12(3), pages 243-255, August.
    20. Shikano Susumu & Mack Verena, 2011. "When Does the Second-Digit Benford’s Law-Test Signal an Election Fraud?: Facts or Misleading Test Results," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(5-6), pages 719-732, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:amstat:v:71:y:2017:i:3:p:231-235. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UTAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.