IDEAS home Printed from https://ideas.repec.org/a/spr/metcap/v27y2025i2d10.1007_s11009-025-10148-8.html
   My bibliography  Save this article

Generative-Discriminative Machine Learning Models for High-Frequency Financial Regime Classification

Author

Listed:
  • Andreas Koukorinis

    (University College London
    Trading)

  • Gareth W. Peters

    (University of California
    London School of Economics and Political Science
    University of Oxford)

  • Guido Germano

    (University College London
    London School of Economics and Political Science)

Abstract

We combine a hidden Markov model (HMM) and a kernel machine (SVM/MKL) into a hybrid HMM-SVM/MKL generative-discriminative learning approach to accurately classify high-frequency financial regimes and predict the direction of trades. We capture temporal dependencies and key stylized facts in high-frequency financial time series by integrating the HMM to produce model-based generative feature embeddings from microstructure time series data. These generative embeddings then serve as inputs to a SVM with single- and multi-kernel (MKL) formulations for predictive discrimination. Our methodology, which does not require manual feature engineering, improves classification accuracy compared to single-kernel SVMs and kernel target alignment methods. It also outperforms both logistic classifier and feed-forward networks. This hybrid HMM-SVM-MKL approach shows high-frequency time-series classification improvements that can significantly benefit applications in finance.

Suggested Citation

  • Andreas Koukorinis & Gareth W. Peters & Guido Germano, 2025. "Generative-Discriminative Machine Learning Models for High-Frequency Financial Regime Classification," Methodology and Computing in Applied Probability, Springer, vol. 27(2), pages 1-32, June.
  • Handle: RePEc:spr:metcap:v:27:y:2025:i:2:d:10.1007_s11009-025-10148-8
    DOI: 10.1007/s11009-025-10148-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11009-025-10148-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11009-025-10148-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. BAUWENS, Luc & VEREDAS, David, 1999. "The stochastic conditional duration model: a latent factor model for the analysis of financial durations," LIDAM Discussion Papers CORE 1999058, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    2. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    3. Peter C. B. Phillips & Zhijie Xiao, 1998. "A Primer on Unit Root Testing," Journal of Economic Surveys, Wiley Blackwell, vol. 12(5), pages 423-470, December.
    4. Matthew Dixon & Diego Klabjan & Jin Hoon Bang, 2017. "Classification-based financial markets prediction using deep neural networks," Algorithmic Finance, IOS Press, vol. 6(3-4), pages 67-77.
    5. Andrew Ang & Allan Timmermann, 2012. "Regime Changes and Financial Markets," Annual Review of Financial Economics, Annual Reviews, vol. 4(1), pages 313-337, October.
    6. Robert F. Engle & Jeffrey R. Russell, 1998. "Autoregressive Conditional Duration: A New Model for Irregularly Spaced Transaction Data," Econometrica, Econometric Society, vol. 66(5), pages 1127-1162, September.
    7. Bauwens, Luc & Veredas, David, 2004. "The stochastic conditional duration model: a latent variable model for the analysis of financial durations," Journal of Econometrics, Elsevier, vol. 119(2), pages 381-412, April.
    8. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    9. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    10. Tristan Fletcher & John Shawe-Taylor, 2013. "Multiple Kernel Learning with Fisher Kernels for High Frequency Currency Prediction," Computational Economics, Springer;Society for Computational Economics, vol. 42(2), pages 217-240, August.
    11. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    12. Kozhan, Roman & Salmon, Mark, 2012. "The information content of a limit order book: The case of an FX market," Journal of Financial Markets, Elsevier, vol. 15(1), pages 1-28.
    13. Jennifer Pohle & Roland Langrock & Floris M. Beest & Niels Martin Schmidt, 2017. "Selecting the Number of States in Hidden Markov Models: Pragmatic Solutions Illustrated Using Animal Movement," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(3), pages 270-293, September.
    14. Alain P. Chaboud & Benjamin Chiquoine & Erik Hjalmarsson & Clara Vega, 2014. "Rise of the Machines: Algorithmic Trading in the Foreign Exchange Market," Journal of Finance, American Finance Association, vol. 69(5), pages 2045-2084, October.
    15. Hamilton, James D., 1990. "Analysis of time series subject to changes in regime," Journal of Econometrics, Elsevier, vol. 45(1-2), pages 39-70.
    16. Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
    17. O. E. Barndorff-Nielsen & P. Reinhard Hansen & A. Lunde & N. Shephard, 2009. "Realized kernels in practice: trades and quotes," Econometrics Journal, Royal Economic Society, vol. 12(3), pages 1-32, November.
    18. Álvaro Cartea & Sebastian Jaimungal, 2015. "Risk Metrics And Fine Tuning Of High-Frequency Trading Strategies," Mathematical Finance, Wiley Blackwell, vol. 25(3), pages 576-611, July.
    19. repec:bla:jecsur:v:12:y:1998:i:5:p:423-69 is not listed on IDEAS
    20. V. Filimonov & D. Sornette, 2015. "Apparent criticality and calibration issues in the Hawkes self-excited point process model: application to high-frequency financial data," Quantitative Finance, Taylor & Francis Journals, vol. 15(8), pages 1293-1314, August.
    21. Kay H Brodersen & Thomas M Schofield & Alexander P Leff & Cheng Soon Ong & Ekaterina I Lomakina & Joachim M Buhmann & Klaas E Stephan, 2011. "Generative Embedding for Model-Based Classification of fMRI Data," PLOS Computational Biology, Public Library of Science, vol. 7(6), pages 1-19, June.
    22. �lvaro Cartea & Sebastian Jaimungal, 2013. "Modelling Asset Prices for Algorithmic and High-Frequency Trading," Applied Mathematical Finance, Taylor & Francis Journals, vol. 20(6), pages 512-547, December.
    23. Tay, Francis E. H. & Cao, Lijuan, 2001. "Application of support vector machines in financial time series forecasting," Omega, Elsevier, vol. 29(4), pages 309-317, August.
    24. M. Bigeco & E. Grosso & E. Otranto, 2008. "Recognizing and Forecasting the Sign of Financial Local Trends using Hidden Markov Models," Working Paper CRENoS 200803, Centre for North South Economic Research, University of Cagliari and Sassari, Sardinia.
    25. Wang, Na & Li, Dong & Wang, Qiwen, 2012. "Visibility graph analysis on quarterly macroeconomic series of China based on complex network theory," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(24), pages 6543-6555.
    26. Christopher Krauss & Anh Do & Nicolas Huck, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," Post-Print hal-01768895, HAL.
    27. Gençay, Ramazan & Dacorogna, Michel & Muller, Ulrich A. & Pictet, Olivier & Olsen, Richard, 2001. "An Introduction to High-Frequency Finance," Elsevier Monographs, Elsevier, edition 1, number 9780122796715.
    28. Andrei Kirilenko & Albert S. Kyle & Mehrdad Samadi & Tugkan Tuzun, 2017. "The Flash Crash: High-Frequency Trading in an Electronic Market," Journal of Finance, American Finance Association, vol. 72(3), pages 967-998, June.
    29. Vanden, Joel M., 2005. "Equilibrium analysis of volatility clustering," Journal of Empirical Finance, Elsevier, vol. 12(3), pages 374-417, June.
    30. Andersen, Torben G, 1996. "Return Volatility and Trading Volume: An Information Flow Interpretation of Stochastic Volatility," Journal of Finance, American Finance Association, vol. 51(1), pages 169-204, March.
    31. Aldrich, Eric M. & Heckenbach, Indra & Laughlin, Gregory, 2016. "A compound duration model for high-frequency asset returns," Journal of Empirical Finance, Elsevier, vol. 39(PA), pages 105-128.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiufeng Yan, 2021. "Autoregressive conditional duration modelling of high frequency data," Papers 2111.02300, arXiv.org.
    2. Xiufeng Yan, 2021. "Multiplicative Component GARCH Model of Intraday Volatility," Papers 2111.02376, arXiv.org.
    3. Carè, Rosella & Cumming, Douglas, 2024. "Technology and automation in financial trading: A bibliometric review," Research in International Business and Finance, Elsevier, vol. 71(C).
    4. Detlef Seese & Christof Weinhardt & Frank Schlottmann (ed.), 2008. "Handbook on Information Technology in Finance," International Handbooks on Information Systems, Springer, number 978-3-540-49487-4, December.
    5. Qian, Yihe & Zhang, Yang, 2025. "Long-term forecasting in asset pricing: Machine learning models’ sensitivity to macroeconomic shifts and firm-specific factors," The North American Journal of Economics and Finance, Elsevier, vol. 78(C).
    6. Zhou, Hao & Kalev, Petko S., 2019. "Algorithmic and high frequency trading in Asia-Pacific, now and the future," Pacific-Basin Finance Journal, Elsevier, vol. 53(C), pages 186-207.
    7. Daniel Cunha Oliveira & Yutong Lu & Xi Lin & Mihai Cucuringu & Andre Fujita, 2024. "Causality-Inspired Models for Financial Time Series Forecasting," Papers 2408.09960, arXiv.org.
    8. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    9. Guillaume Chevalier & Guillaume Coqueret & Thomas Raffinot, 2022. "Supervised portfolios," Post-Print hal-04144588, HAL.
    10. Jing Hao & Feng He & Feng Ma & Shibo Zhang & Xiaotao Zhang, 2025. "Machine learning vs deep learning in stock market investment: an international evidence," Annals of Operations Research, Springer, vol. 348(1), pages 93-115, May.
    11. Pedro M. Mirete-Ferrer & Alberto Garcia-Garcia & Juan Samuel Baixauli-Soler & Maria A. Prats, 2022. "A Review on Machine Learning for Asset Management," Risks, MDPI, vol. 10(4), pages 1-46, April.
    12. Chaeshick Chung & Sukjin Park, 2021. "Deep Learning Market Microstructure: Dual-Stage Attention-Based Recurrent Neural Networks," Working Papers 2108, Nam Duck-Woo Economic Research Institute, Sogang University (Former Research Institute for Market Economy).
    13. Philippe Goulet Coulombe & Maximilian Gobel, 2023. "Maximally Machine-Learnable Portfolios," Working Papers 23-01, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management, revised Apr 2023.
    14. Rama Cont & Mihai Cucuringu & Chao Zhang, 2021. "Cross-Impact of Order Flow Imbalance in Equity Markets," Papers 2112.13213, arXiv.org, revised Jun 2023.
    15. Katarzyna Bien-Barkowska, 2012. ""Does it take volume to move fx rates?" Evidence from quantile regressions," Dynamic Econometric Models, Uniwersytet Mikolaja Kopernika, vol. 12, pages 35-52.
    16. Theo Berger, 2025. "On the information content of explainable artificial intelligence for quantitative approaches in finance," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 47(1), pages 177-203, March.
    17. Carlo Campajola & Domenico Di Gangi & Fabrizio Lillo & Daniele Tantari, 2020. "Modelling time-varying interactions in complex systems: the Score Driven Kinetic Ising Model," Papers 2007.15545, arXiv.org, revised Aug 2021.
    18. Thierry Warin & Aleksandar Stojkov, 2021. "Machine Learning in Finance: A Metadata-Based Systematic Review of the Literature," JRFM, MDPI, vol. 14(7), pages 1-31, July.
    19. DOLADO , Juan J. & RODRIGUEZ-POO, Juan & VEREDAS, David, 2004. "Testing weak exogeneity in the exponential family : an application to financial point processes," LIDAM Discussion Papers CORE 2004049, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    20. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metcap:v:27:y:2025:i:2:d:10.1007_s11009-025-10148-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.