IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2511.05556.html

Daily Forecasting for Annual Time Series Datasets Using Similarity-Based Machine Learning Methods: A Case Study in the Energy Market

Author

Listed:
  • Mahdi Goldani

Abstract

The policy environment of countries changes rapidly, influencing macro-level indicators such as the Energy Security Index. However, this index is only reported annually, limiting its responsiveness to short-term fluctuations. To address this gap, the present study introduces a daily proxy for the Energy Security Index and applies it to forecast energy security at a daily frequency.The study employs a two stage approach first, a suitable daily proxy for the annual Energy Security Index is identified by applying six time series similarity measures to key energy related variables. Second, the selected proxy is modeled using the XGBoost algorithm to generate 15 day ahead forecasts, enabling high frequency monitoring of energy security dynamics.As the result of proxy choosing, Volume Brent consistently emerged as the most suitable proxy across the majority of methods. The model demonstrated strong performance, with an R squared of 0.981 on the training set and 0.945 on the test set, and acceptable error metrics . The 15 day forecast of Brent volume indicates short term fluctuations, with a peak around day 4, a decline until day 8, a rise near day 10, and a downward trend toward day 15, accompanied by prediction intervals.By integrating time series similarity measures with machine learning based forecasting, this study provides a novel framework for converting low frequency macroeconomic indicators into high frequency, actionable signals. The approach enables real time monitoring of the Energy Security Index, offering policymakers and analysts a scalable and practical tool to respond more rapidly to fast changing policy and market conditions, especially in data scarce environments.

Suggested Citation

  • Mahdi Goldani, 2025. "Daily Forecasting for Annual Time Series Datasets Using Similarity-Based Machine Learning Methods: A Case Study in the Energy Market," Papers 2511.05556, arXiv.org.
  • Handle: RePEc:arx:papers:2511.05556
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2511.05556
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jaime Vera-Jaramillo, 2025. "tempdisagg: A Python Framework for Temporal Disaggregation of Time Series Data," Papers 2503.22054, arXiv.org.
    2. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    3. Luke Mosley & Idris Eckley & Alex Gibberd, 2021. "Sparse Temporal Disaggregation," Papers 2108.05783, arXiv.org, revised Oct 2022.
    4. Luke Mosley & Idris A. Eckley & Alex Gibberd, 2022. "Sparse temporal disaggregation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2203-2233, October.
    5. Zhang, Lan & Mykland, Per A. & Ait-Sahalia, Yacine, 2005. "A Tale of Two Time Scales: Determining Integrated Volatility With Noisy High-Frequency Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1394-1411, December.
    6. Martin Feldkircher & Florian Huber & Michael Pfarrhofer, 2021. "Measuring the effectiveness of US monetary policy during the COVID‐19 recession," Scottish Journal of Political Economy, Scottish Economic Society, vol. 68(3), pages 287-297, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Atin Aboutorabi & Ga'etan de Rassenfosse, 2024. "Nowcasting R&D Expenditures: A Machine Learning Approach," Papers 2407.11765, arXiv.org.
    2. Klaus Abberger & Michael Graff & Oliver Müller & Boriss Siliverstovs, 2023. "Imputing Monthly Values for Quarterly Time Series: An Application Performed with Swiss Business Cycle Data," Journal of Business Cycle Research, Springer;Centre for International Research on Economic Tendency Surveys (CIRET), vol. 19(3), pages 241-273, November.
    3. Valter Di Giacinto & Vincenzo Mariani & Daniele Ruggeri & Giuseppe Saporito & Andrea Sechi & Giovanni Soggia & Andrea Venturini & Antonio Veronico, 2025. "Timely quarterly indicators of household consumption and disposable income for the Italian regions," Questioni di Economia e Finanza (Occasional Papers) 970, Bank of Italy, Economic Research and International Relations Area.
    4. Lahiri, Kajal & Monokroussos, George & Zhao, Yongchen, 2013. "The yield spread puzzle and the information content of SPF forecasts," Economics Letters, Elsevier, vol. 118(1), pages 219-221.
    5. Zhao, Zhibiao & Wu, Wei Biao, 2009. "Nonparametric inference of discretely sampled stable Lévy processes," Journal of Econometrics, Elsevier, vol. 153(1), pages 83-92, November.
    6. Winkelried, Diego, 2012. "Predicting quarterly aggregates with monthly indicators," Working Papers 2012-023, Banco Central de Reserva del Perú.
    7. Peter Yegon & W. Brent Lindquist & Svetlozar T. Rachev, 2025. "Asset Pricing in the Presence of Market Microstructure Noise," Papers 2511.00308, arXiv.org.
    8. Toshiaki Ogawa & Masato Ubukata & Toshiaki Watanabe, 2020. "Stock Return Predictability and Variance Risk Premia around the ZLB," IMES Discussion Paper Series 20-E-09, Institute for Monetary and Economic Studies, Bank of Japan.
    9. Takahashi, Makoto & Watanabe, Toshiaki & Omori, Yasuhiro, 2016. "Volatility and quantile forecasts by realized stochastic volatility models with generalized hyperbolic distribution," International Journal of Forecasting, Elsevier, vol. 32(2), pages 437-457.
    10. Chuong Luong & Nikolai Dokuchaev, 2016. "Modeling Dependency Of Volatility On Sampling Frequency Via Delay Equations," Annals of Financial Economics (AFE), World Scientific Publishing Co. Pte. Ltd., vol. 11(02), pages 1-21, June.
    11. Brave, Scott A. & Gascon, Charles & Kluender, William & Walstrum, Thomas, 2021. "Predicting benchmarked US state employment data in real time," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1261-1275.
    12. Diakonova, Marina & Molina, Luis & Mueller, Hannes & Pérez, Javier J. & Rauh, Christopher, 2024. "The information content of conflict, social unrest and policy uncertainty measures for macroeconomic forecasting," Latin American Journal of Central Banking (previously Monetaria), Elsevier, vol. 5(4).
    13. Andrea Carriero & Todd E. Clark & Massimiliano Marcellino, 2015. "Realtime nowcasting with a Bayesian mixed frequency model with stochastic volatility," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(4), pages 837-862, October.
    14. Aastveit, Knut Are & Jore, Anne Sofie & Ravazzolo, Francesco, 2016. "Identification and real-time forecasting of Norwegian business cycles," International Journal of Forecasting, Elsevier, vol. 32(2), pages 283-292.
    15. Pedro Afonso Fernandes, 2024. "Forecasting with Neuro-Dynamic Programming," Papers 2404.03737, arXiv.org.
    16. Robert Ambrisko, 2022. "Nowcasting Macroeconomic Variables Using High-Frequency Fiscal Data," Working Papers 2022/5, Czech National Bank, Research and Statistics Department.
    17. Libero Monteforte & Valentina Raponi, 2019. "Short‐term forecasts of economic activity: Are fortnightly factors useful?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 38(3), pages 207-221, April.
    18. Podolskij, Mark & Vetter, Mathias, 2009. "Bipower-type estimation in a noisy diffusion setting," Stochastic Processes and their Applications, Elsevier, vol. 119(9), pages 2803-2831, September.
    19. Richard K. Crump & Stefano Eusepi & Domenico Giannone & Eric Qian & Argia M. Sbordone, 2021. "A Large Bayesian VAR of the United States Economy," Staff Reports 976, Federal Reserve Bank of New York.
    20. Altmeyer, Randolf & Bibinger, Markus, 2015. "Functional stable limit theorems for quasi-efficient spectral covolatility estimators," Stochastic Processes and their Applications, Elsevier, vol. 125(12), pages 4556-4600.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2511.05556. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.