IDEAS home Printed from https://ideas.repec.org/p/cte/wsrepe/37746.html
   My bibliography  Save this paper

Tall big data time series of high frequency: stylized facts and econometric modelling

Author

Listed:
  • Espasa, Antoni
  • Carlomagno Real, Guillermo

Abstract

The paper starts commenting on the hard tasks of data treatment -mainly, cleaning, classification, and aggregation- that are required at the beginning of any analysis with big data. Subsequently, it focuses on non-financial big data time series of high frequency that for many problems are aggregated at daily, hourly, or higher frequency levels of several minutes. Then, the paper discusses possible stylized facts present in these data. In this respect, it studies relevant seasonality: daily, weekly, monthly, and annually, and analyses how, for the data in question, these cycles could be affected by weather variables and by factors due to the annual composition of the calendar. Consequently, the paper investigates the possible main characteristics of the mentioned cycles and the types of responses to the exogenous weather and calendar factors that data could show. The shorter cycles could change along the annual cycle and interact with the exogenous variables. The modelling strategy could require regime-switching, dynamic, non-linear structures, and interactions between the factors considered. Then the paper analyses the construction of explanatory variables that could be useful for taking into account all the above peculiarities. We propose the use of the automated procedure, Autometrics, to discover -in words of Prof Hendry- a parsimonious model not dominated by any other, which is able to explain all the characteristics of the data. The model can be used for structural analysis, forecasting, and, when it is the case, to build real-time quantitative macroeconomic leading indicators. Finally, the paper includes an application to the daily series of jobless claims in Chile.

Suggested Citation

  • Espasa, Antoni & Carlomagno Real, Guillermo, 2023. "Tall big data time series of high frequency: stylized facts and econometric modelling," DES - Working Papers. Statistics and Econometrics. WS 37746, Universidad Carlos III de Madrid. Departamento de Estadística.
  • Handle: RePEc:cte:wsrepe:37746
    as

    Download full text from publisher

    File URL: https://e-archivo.uc3m.es/rest/api/core/bitstreams/19729326-cfd7-4255-9d63-372fc54e2293/content
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. J. Ignacio Conde-Ruiz & Manu García & Luis A. Puch & Jesús Ruiz, 2019. "Calendar effects in daily aggregate employment creation and destruction in Spain," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 10(1), pages 25-63, March.
    2. Beate Franke & Jean-FRANçois Plante & Ribana Roscher & En-shiun Annie Lee & Cathal Smyth & Armin Hatefi & Fuqi Chen & Einat Gil & Alexander Schwing & Alessandro Selvitella & Michael M. Hoffman & Roger, 2016. "Statistical Inference, Learning and Models in Big Data," International Statistical Review, International Statistical Institute, vol. 84(3), pages 371-389, December.
    3. Espasa, Antoni & Cancelo, José Ramón & Revuelta, J. Manuel, 1996. "Automatic modelling of daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3356, Universidad Carlos III de Madrid. Departamento de Estadística.
    4. Antoni Espasa & Eva Senra, 2017. "Twenty-Two Years of Inflation Assessment and Forecasting Experience at the Bulletin of EU & US Inflation and Macroeconomic Analysis," Econometrics, MDPI, vol. 5(4), pages 1-28, October.
    5. Jose Ramon Cancelo & Antoni Espasa, 1996. "Modelling and forecastng daily series of electricity demand," Investigaciones Economicas, Fundación SEPI, vol. 20(3), pages 359-376, September.
    6. Cancelo, José Ramón & Espasa, Antoni, 1991. "Forecasting daily demand for electricity with multiple-input nonlinear transfer function models: a case study," UC3M Working papers. Economics 2808, Universidad Carlos III de Madrid. Departamento de Economía.
    7. Espasa, Antoni, 1993. "Modelling daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3682, Universidad Carlos III de Madrid. Departamento de Estadística.
    8. Diego Bodas & Juan Ramon Garcia & Juan Murillo & Matias Pacce & Tomasa Rodrigo & Juan de Dios Romero & Pep Ruiz & Camilo Ulloa & Heribert Valero, 2018. "Measuring Retail Trade Using Card Transactional Data," Working Papers 18/03, BBVA Bank, Economic Research Department.
    9. Alberto Cavallo & Roberto Rigobon, 2016. "The Billion Prices Project: Using Online Prices for Measurement and Research," Journal of Economic Perspectives, American Economic Association, vol. 30(2), pages 151-178, Spring.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jose Ramon Cancelo & Antoni Espasa, 1996. "Modelling and forecastng daily series of electricity demand," Investigaciones Economicas, Fundación SEPI, vol. 20(3), pages 359-376, September.
    2. Cancelo, José Ramón & Espasa, Antoni & Grafe, Rosmarie, 2008. "Forecasting the electricity load from one day to one week ahead for the Spanish system operator," International Journal of Forecasting, Elsevier, vol. 24(4), pages 588-602.
    3. Cancelo, José Ramón & Espasa, Antoni & Grafe, Rosmarie, 2007. "Forecasting from one day to one week ahead for the Spanish system operator," DES - Working Papers. Statistics and Econometrics. WS ws078418, Universidad Carlos III de Madrid. Departamento de Estadística.
    4. Espasa, Antoni & Cancelo, José Ramón & Revuelta, J. Manuel, 1996. "Automatic modelling of daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3356, Universidad Carlos III de Madrid. Departamento de Estadística.
    5. Fernando Alvarez & Francesco Lippi & Juan Passadore, 2017. "Are State- and Time-Dependent Models Really Different?," NBER Macroeconomics Annual, University of Chicago Press, vol. 31(1), pages 379-457.
    6. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    7. Jennifer Peña & Elvira Prades, 2021. "Price setting in Chile: Micro evidence from consumer on-line prices during the social outbreak and Covid-19," Working Papers 2112, Banco de España.
    8. Cristina Miranda & Reinaldo Castro Souza & Mônica Barros & Cristina Vidigal Cabral de Miranda, 2007. "Short Term Demand Forecasting Using Double Exponential Smoothing and Interventions to Account for Holidays and Temperature Effects," EcoMod2007 23900058, EcoMod.
    9. Espasa, Antoni, 1993. "Modelling daily series of economic activity," DES - Working Papers. Statistics and Econometrics. WS 3682, Universidad Carlos III de Madrid. Departamento de Estadística.
    10. Markus Dertwinkel-Kalt & Mats Köster, 2017. "Salience and Online Sales: The Role of Brand Image Concerns," CESifo Working Paper Series 6787, CESifo.
    11. Fernando Alvarez & Francesco Lippi & Aleksei Oskolkov, 2022. "The Macroeconomics of Sticky Prices with Generalized Hazard Functions [“Optimal Inattention to the Stock Market With Information Costs and Transactions Costs,”]," The Quarterly Journal of Economics, Oxford University Press, vol. 137(2), pages 989-1038.
    12. Fernando Alvarez & Hervé Le Bihan & Francesco Lippi, 2016. "The Real Effects of Monetary Shocks in Sticky Price Models: A Sufficient Statistic Approach," American Economic Review, American Economic Association, vol. 106(10), pages 2817-2851, October.
    13. Amita Majumder & Ranjan Ray, 2020. "National and subnational purchasing power parity: a review," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 47(2), pages 103-124, June.
    14. J. Ignacio Conde-Ruiz & Manu García & Luis A. Puch & Jesús Ruiz, 2020. "Una metodología para el seguimiento de la afiliación a la Seguridad Social durante la crisis del Covid-19," Fedea Economy Notes 2020-06, FEDEA.
    15. Choi, Sangyup & Shin, Junhyeok, 2022. "Bitcoin: An inflation hedge but not a safe haven," Finance Research Letters, Elsevier, vol. 46(PB).
    16. Brian Fabo & Sharon Sarah Belli, 2017. "(Un)beliveable wages? An analysis of minimum wage policies in Europe from a living wage perspective," IZA Journal of Labor Policy, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 6(1), pages 1-11, December.
    17. David Staines, 2023. "Stochastic Equilibrium the Lucas Critique and Keynesian Economics," Papers 2312.16214, arXiv.org, revised Jun 2024.
    18. Kanika Mahajan & Shekhar Tomar, 2021. "COVID‐19 and Supply Chain Disruption: Evidence from Food Markets in India†," American Journal of Agricultural Economics, John Wiley & Sons, vol. 103(1), pages 35-52, January.
    19. Justin Longo & Alan Rodney Dobell, 2018. "The Limits of Policy Analytics: Early Examples and the Emerging Boundary of Possibilities," Politics and Governance, Cogitatio Press, vol. 6(4), pages 5-17.
    20. Fernando Alvarez & Francesco Lippi, 2020. "Temporary Price Changes, Inflation Regimes, and the Propagation of Monetary Shocks," American Economic Journal: Macroeconomics, American Economic Association, vol. 12(1), pages 104-152, January.

    More about this item

    Keywords

    Aggregation;

    JEL classification:

    • C01 - Mathematical and Quantitative Methods - - General - - - Econometrics
    • C22 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cte:wsrepe:37746. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ana Poveda (email available below). General contact details of provider: http://portal.uc3m.es/portal/page/portal/dpto_estadistica .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.