IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v184y2021i2p571-588.html
   My bibliography  Save this article

Proxy expenditure weights for Consumer Price Index: Audit sampling inference for big‐data statistics

Author

Listed:
  • Li‐Chun Zhang

Abstract

Purchase data from retail chains can provide proxy measures of private household expenditure on items that are the most troublesome to collect in the traditional expenditure survey. Due to the inevitable coverage and selection errors, bias must exist in these proxy measures. Moreover, given the sheer amount of data, the bias completely dominates the variance. To investigate the potential of replacing costly and burdensome surveys by non‐survey big‐data sources, we propose an audit sampling inference approach, which does not require linking the audit sample and the big‐data source at the individual level. It turns out that one is unable to reject a null hypothesis of unbiased big‐data estimation at the chosen size, because the audit sampling variance is too large compared to the bias of the big‐data estimate. For the same reason, audit sampling fails to yield a meaningful mean squared error estimate. We propose a novel accuracy measure that is generally applicable in such situations. This can provide a necessary part of the statistical argument for the uptake of non‐survey big‐data sources, in replacement of traditional survey sampling. An application to disaggregated food price indices is used to demonstrate the proposed approach.

Suggested Citation

  • Li‐Chun Zhang, 2021. "Proxy expenditure weights for Consumer Price Index: Audit sampling inference for big‐data statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 571-588, April.
  • Handle: RePEc:bla:jorssa:v:184:y:2021:i:2:p:571-588
    DOI: 10.1111/rssa.12632
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12632
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12632?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Li-Chun Zhang, 2019. "On valid descriptive inference from non-probability sample," Statistical Theory and Related Fields, Taylor & Francis Journals, vol. 3(2), pages 103-113, July.
    2. Erich Battistin & Mario Padula, 2016. "Survey instruments and the reports of consumption expenditures: evidence from the consumer expenditure surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 559-581, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Timiryanova, Venera, 2022. "Высокочастотные Данные, Характеризующие Розничную Торговлю: Интересы Государства, Предприятий И Научных Организаций [High-frequency retail data: the interests of the state, enterprises and scientif," MPRA Paper 115681, University Library of Munich, Germany.
    2. Fabrizio Solari & Antonella Bernardini & Nicoletta Cibella, 2023. "Statistical framework for fully register based population counts," METRON, Springer;Sapienza Università di Roma, vol. 81(1), pages 109-129, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Olivier Coibion & Yuriy Gorodnichenko & Dmitri Koustas, 2021. "Consumption Inequality and the Frequency of Purchases," American Economic Journal: Macroeconomics, American Economic Association, vol. 13(4), pages 449-482, October.
    2. Rodolfo G. Campos & Iliana Reggio & Dionisio Garc𫑐, 2013. "Micro versus macro consumption data: the cyclical properties of the consumer expenditure survey," Applied Economics, Taylor & Francis Journals, vol. 45(26), pages 3778-3785, September.
    3. Giacomo De Giorgi & Luca Gambetti, 2012. "Consumption Heterogeneity over the Business Cycle," Working Papers 646, Barcelona School of Economics.
    4. Olga Gorbachev, 2011. "Did Household Consumption Become More Volatile?," American Economic Review, American Economic Association, vol. 101(5), pages 2248-2270, August.
    5. Martina Patone & Li‐Chun Zhang, 2021. "On Two Existing Approaches to Statistical Analysis of Social Media Data," International Statistical Review, International Statistical Institute, vol. 89(1), pages 54-71, April.
    6. Campos, Rodolfo G. & Reggio, Iliana, 2013. "Measurement error and imputation of consumption in survey data," UC3M Working papers. Economics we1219, Universidad Carlos III de Madrid. Departamento de Economía.
    7. Giacomo De Giorgi & Luca Gambetti, 2012. "The Effects of Government Spending on the Distribution of Consumption," Working Papers 645, Barcelona School of Economics.
    8. Yingli Pan & Wen Cai & Zhan Liu, 2022. "Inference for non-probability samples under high-dimensional covariate-adjusted superpopulation model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(4), pages 955-979, October.
    9. Giacomo De Giorgi & Luca Gambetti, 2017. "Business Cycle Fluctuations and the Distribution of Consumption," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 23, pages 19-41, January.
    10. Scrimgeour, Dean & Gorry, James, 2015. "Using Engel Curves to Estimate CPI Bias for the Elderly," Working Papers 2015-03, Department of Economics, Colgate University, revised 08 Jun 2015.
    11. Campos, Rodolfo G. & Reggio, Iliana, 2014. "Measurement error in imputation procedures," Economics Letters, Elsevier, vol. 122(2), pages 197-202.
    12. Paul A. Smith, 2021. "Estimating Sampling Errors in Consumer Price Indices," International Statistical Review, International Statistical Institute, vol. 89(3), pages 481-504, December.
    13. Justine Hastings & Jesse M. Shapiro, 2018. "How Are SNAP Benefits Spent? Evidence from a Retail Panel," American Economic Review, American Economic Association, vol. 108(12), pages 3493-3540, December.
    14. Thomas F. Crossley & Joachim K. Winter, 2014. "Asking Households about Expenditures: What Have We Learned?," NBER Chapters, in: Improving the Measurement of Consumer Expenditures, pages 23-50, National Bureau of Economic Research, Inc.
    15. Li-Chun Zhang, 2019. "Proxy expenditure weights for Consumer Price Index: Audit sampling inference for big data statistics," Papers 1906.11208, arXiv.org.
    16. Campos, Rodolfo G. & Reggio, Iliana & García-Píriz, Dionisio, 2012. "Micro vs. macro consumption data : the cyclical properties of the consumer expenditure survey," UC3M Working papers. Economics we1220, Universidad Carlos III de Madrid. Departamento de Economía.
    17. Marcin Hitczenko, 2013. "Optimal recall period length in consumer payment surveys," Working Papers 13-16, Federal Reserve Bank of Boston.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:184:y:2021:i:2:p:571-588. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.