IDEAS home Printed from https://ideas.repec.org/p/cte/wsrepe/37746.html
   My bibliography  Save this paper

Tall big data time series of high frequency: stylized facts and econometric modelling

Author

Listed:
  • Espasa, Antoni
  • Carlomagno Real, Guillermo

Abstract

The paper starts commenting on the hard tasks of data treatment -mainly, cleaning, classification, and aggregation- that are required at the beginning of any analysis with big data. Subsequently, it focuses on non-financial big data time series of high frequency that for many problems are aggregated at daily, hourly, or higher frequency levels of several minutes. Then, the paper discusses possible stylized facts present in these data. In this respect, it studies relevant seasonality: daily, weekly, monthly, and annually, and analyses how, for the data in question, these cycles could be affected by weather variables and by factors due to the annual composition of the calendar. Consequently, the paper investigates the possible main characteristics of the mentioned cycles and the types of responses to the exogenous weather and calendar factors that data could show. The shorter cycles could change along the annual cycle and interact with the exogenous variables. The modelling strategy could require regime-switching, dynamic, non-linear structures, and interactions between the factors considered. Then the paper analyses the construction of explanatory variables that could be useful for taking into account all the above peculiarities. We propose the use of the automated procedure, Autometrics, to discover -in words of Prof Hendry- a parsimonious model not dominated by any other, which is able to explain all the characteristics of the data. The model can be used for structural analysis, forecasting, and, when it is the case, to build real-time quantitative macroeconomic leading indicators. Finally, the paper includes an application to the daily series of jobless claims in Chile.

Suggested Citation

  • Espasa, Antoni & Carlomagno Real, Guillermo, 2023. "Tall big data time series of high frequency: stylized facts and econometric modelling," DES - Working Papers. Statistics and Econometrics. WS 37746, Universidad Carlos III de Madrid. Departamento de Estadística.
  • Handle: RePEc:cte:wsrepe:37746
    as

    Download full text from publisher

    File URL: https://e-archivo.uc3m.es/bitstream/handle/10016/37746/ws202306.pdf?sequence=1
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Antoni Espasa & Eva Senra, 2017. "Twenty-Two Years of Inflation Assessment and Forecasting Experience at the Bulletin of EU & US Inflation and Macroeconomic Analysis," Econometrics, MDPI, vol. 5(4), pages 1-28, October.
    2. Jose Ramon Cancelo & Antoni Espasa, 1996. "Modelling and forecastng daily series of electricity demand," Investigaciones Economicas, Fundación SEPI, vol. 20(3), pages 359-376, September.
    3. Alberto Cavallo & Roberto Rigobon, 2016. "The Billion Prices Project: Using Online Prices for Measurement and Research," Journal of Economic Perspectives, American Economic Association, vol. 30(2), pages 151-178, Spring.
    4. Cancelo, José Ramón & Espasa, Antoni, 1991. "Forecasting daily demand for electricity with multiple-input nonlinear transfer function models: a case study," UC3M Working papers. Economics 2808, Universidad Carlos III de Madrid. Departamento de Economía.
    5. Espasa, Antoni, 1993. "Modelling daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3682, Universidad Carlos III de Madrid. Departamento de Estadística.
    6. Espasa, Antoni & Cancelo, José Ramón & Revuelta, J. Manuel, 1996. "Automatic modelling of daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3356, Universidad Carlos III de Madrid. Departamento de Estadística.
    7. J. Ignacio Conde-Ruiz & Manu García & Luis A. Puch & Jesús Ruiz, 2019. "Calendar effects in daily aggregate employment creation and destruction in Spain," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 10(1), pages 25-63, March.
    8. Diego Bodas & Juan Ramon Garcia & Juan Murillo & Matias Pacce & Tomasa Rodrigo & Juan de Dios Romero & Pep Ruiz & Camilo Ulloa & Heribert Valero, 2018. "Measuring Retail Trade Using Card Transactional Data," Working Papers 18/03, BBVA Bank, Economic Research Department.
    9. Beate Franke & Jean-FRANçois Plante & Ribana Roscher & En-shiun Annie Lee & Cathal Smyth & Armin Hatefi & Fuqi Chen & Einat Gil & Alexander Schwing & Alessandro Selvitella & Michael M. Hoffman & Roger, 2016. "Statistical Inference, Learning and Models in Big Data," International Statistical Review, International Statistical Institute, vol. 84(3), pages 371-389, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jose Ramon Cancelo & Antoni Espasa, 1996. "Modelling and forecastng daily series of electricity demand," Investigaciones Economicas, Fundación SEPI, vol. 20(3), pages 359-376, September.
    2. Cancelo, José Ramón & Espasa, Antoni & Grafe, Rosmarie, 2008. "Forecasting the electricity load from one day to one week ahead for the Spanish system operator," International Journal of Forecasting, Elsevier, vol. 24(4), pages 588-602.
    3. Cancelo, José Ramón & Espasa, Antoni & Grafe, Rosmarie, 2007. "Forecasting from one day to one week ahead for the Spanish system operator," DES - Working Papers. Statistics and Econometrics. WS ws078418, Universidad Carlos III de Madrid. Departamento de Estadística.
    4. Espasa, Antoni & Cancelo, José Ramón & Revuelta, J. Manuel, 1996. "Automatic modelling of daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3356, Universidad Carlos III de Madrid. Departamento de Estadística.
    5. Fernando Alvarez & Francesco Lippi & Juan Passadore, 2017. "Are State- and Time-Dependent Models Really Different?," NBER Macroeconomics Annual, University of Chicago Press, vol. 31(1), pages 379-457.
    6. Fernando Alvarez & Francesco Lippi, 2020. "Temporary Price Changes, Inflation Regimes, and the Propagation of Monetary Shocks," American Economic Journal: Macroeconomics, American Economic Association, vol. 12(1), pages 104-152, January.
    7. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    8. Juan de Dios Romero Palop & Juan Murillo Arias & Diego J. Bodas-Sagi & Heribert Valero Lapaz, 2019. "Determining the usual environment of cardholders as a key factor to measure the evolution of domestic tourism," Information Technology & Tourism, Springer, vol. 21(1), pages 23-43, March.
    9. Erica L. Groshen & Brian C. Moyer & Ana M. Aizcorbe & Ralph Bradley & David M. Friedman, 2017. "How Government Statistics Adjust for Potential Biases from Quality Change and New Goods in an Age of Digital Technologies: A View from the Trenches," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 187-210, Spring.
    10. Garcia-Louzao, Jose & Hospido, Laura & Ruggieri, Alessandro, 2023. "Dual returns to experience," Labour Economics, Elsevier, vol. 80(C).
    11. Patrick Bajari & Zhihao Cen & Victor Chernozhukov & Manoj Manukonda & Jin Wang & Ramon Huerta & Junbo Li & Ling Leng & George Monokroussos & Suhas Vijaykunar & Shan Wan, 2023. "Hedonic prices and quality adjusted price indices powered by AI," CeMMAP working papers 08/23, Institute for Fiscal Studies.
    12. J. Ignacio Conde-Ruiz & Manu García & Luis A. Puch & Jesús Ruiz, 2019. "Calendar effects in daily aggregate employment creation and destruction in Spain," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 10(1), pages 25-63, March.
    13. Jennifer Peña & Elvira Prades, 2021. "Price setting in Chile: Micro evidence from consumer on-line prices during the social outbreak and Covid-19," Working Papers 2112, Banco de España.
    14. Giuseppe Arbia & Vincenzo Nardelli, 2024. "Using Web-Data to Estimate Spatial Regression Models," International Regional Science Review, , vol. 47(2), pages 204-226, March.
    15. Cristina Miranda & Reinaldo Castro Souza & Mônica Barros & Cristina Vidigal Cabral de Miranda, 2007. "Short Term Demand Forecasting Using Double Exponential Smoothing and Interventions to Account for Holidays and Temperature Effects," EcoMod2007 23900058, EcoMod.
    16. Espasa, Antoni, 1993. "Modelling daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3682, Universidad Carlos III de Madrid. Departamento de Estadística.
    17. Hai Long Vo & Duc Hong Vo, 2023. "The purchasing power parity and exchange‐rate economics half a century on," Journal of Economic Surveys, Wiley Blackwell, vol. 37(2), pages 446-479, April.
    18. Diego Bodas & Juan Ramon Garcia & Juan Murillo & Matias Pacce & Tomasa Rodrigo & Juan de Dios Romero & Pep Ruiz & Camilo Ulloa & Heribert Valero, 2018. "Measuring Retail Trade Using Card Transactional Data," Working Papers 18/03, BBVA Bank, Economic Research Department.
    19. Cavallo, Alberto & Kryvtsov, Oleksiy, 2023. "What can stockouts tell us about inflation? Evidence from online micro data," Journal of International Economics, Elsevier, vol. 146(C).

    More about this item

    Keywords

    Aggregation;

    JEL classification:

    • C01 - Mathematical and Quantitative Methods - - General - - - Econometrics
    • C22 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cte:wsrepe:37746. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ana Poveda (email available below). General contact details of provider: http://portal.uc3m.es/portal/page/portal/dpto_estadistica .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.