IDEAS home Printed from https://ideas.repec.org/p/zbw/iwqwdp/132017.html
   My bibliography  Save this paper

Exploiting social media with higher-order Factorization Machines: Statistical arbitrage on high-frequency data of the S&P 500

Author

Listed:
  • Knoll, Julian
  • Stübinger, Johannes
  • Grottke, Michael

Abstract

Over the past 15 years,there have been a number of studies using text mining for predicting stock market data. Two recent publications employed support vector machines and second-order Factorization Machines, respectively, to this end. However, these approaches either completely neglect interactions between the features extracted from the text, or they only account for second-order interactions. In thispaper, weapply higher-order Factorization Machines, for which efficient training algorithms have only been available since 2016. As Factorization Machines require hyperparameters to be specified, we also introduce the novel adaptive-order algorithm for automatically determining them. Our studyis the first one tomake use of social media data for predicting high-frequency stock returns, namely the ones of the S&P 500 stock constituents. We show that, unlike a trading strategy employing support vector machines, Factorization-Machine-based strategies attain positive returns after transactions costs for the years 2014 and 2015. Especially the approach applying thea daptive-order algorithm outperforms classical approaches with respect to a multitude of criteria, and it features very favorable characteristics.

Suggested Citation

  • Knoll, Julian & Stübinger, Johannes & Grottke, Michael, 2017. "Exploiting social media with higher-order Factorization Machines: Statistical arbitrage on high-frequency data of the S&P 500," FAU Discussion Papers in Economics 13/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
  • Handle: RePEc:zbw:iwqwdp:132017
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/162392/1/889293619.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    2. Huck, Nicolas, 2009. "Pairs selection and outranking: An application to the S&P 100 index," European Journal of Operational Research, Elsevier, vol. 196(2), pages 819-825, July.
    3. Evan Gatev & William N. Goetzmann & K. Geert Rouwenhorst, 2006. "Pairs Trading: Performance of a Relative-Value Arbitrage Rule," The Review of Financial Studies, Society for Financial Studies, vol. 19(3), pages 797-827.
    4. Stübinger, Johannes & Mangold, Benedikt & Krauss, Christopher, 2016. "Statistical arbitrage with vine copulas," FAU Discussion Papers in Economics 11/2016, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    5. Stübinger, Johannes & Endres, Sylvia, 2017. "Pairs trading with a mean-reverting jump-diffusion model on high-frequency data," FAU Discussion Papers in Economics 10/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    6. Christopher Krauss & Anh Do & Nicolas Huck, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," Post-Print hal-01768895, HAL.
    7. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    8. David A. Bowen & Mark C. Hutchinson, 2016. "Pairs trading in the UK equity market: risk and return," The European Journal of Finance, Taylor & Francis Journals, vol. 22(14), pages 1363-1387, November.
    9. Werner Antweiler & Murray Z. Frank, 2004. "Is All That Talk Just Noise? The Information Content of Internet Stock Message Boards," Journal of Finance, American Finance Association, vol. 59(3), pages 1259-1294, June.
    10. Huck, Nicolas, 2010. "Pairs trading and outranking: The multi-step-ahead forecasting case," European Journal of Operational Research, Elsevier, vol. 207(3), pages 1702-1716, December.
    11. Marco Avellaneda & Jeong-Hyun Lee, 2010. "Statistical arbitrage in the US equities market," Quantitative Finance, Taylor & Francis Journals, vol. 10(7), pages 761-782.
    12. Fama, Eugene F & French, Kenneth R, 1996. "Multifactor Explanations of Asset Pricing Anomalies," Journal of Finance, American Finance Association, vol. 51(1), pages 55-84, March.
    13. Jegadeesh, Narasimhan & Titman, Sheridan, 1993. "Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency," Journal of Finance, American Finance Association, vol. 48(1), pages 65-91, March.
    14. Hossein Rad & Rand Kwong Yew Low & Robert Faff, 2016. "The profitability of pairs trading strategies: distance, cointegration and copula methods," Quantitative Finance, Taylor & Francis Journals, vol. 16(10), pages 1541-1558, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    2. Stübinger, Johannes & Endres, Sylvia, 2017. "Pairs trading with a mean-reverting jump-diffusion model on high-frequency data," FAU Discussion Papers in Economics 10/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    3. Matthew Clegg & Christopher Krauss, 2018. "Pairs trading with partial cointegration," Quantitative Finance, Taylor & Francis Journals, vol. 18(1), pages 121-138, January.
    4. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    5. Han, Chulwoo & He, Zhaodong & Toh, Alenson Jun Wei, 2023. "Pairs trading via unsupervised learning," European Journal of Operational Research, Elsevier, vol. 307(2), pages 929-947.
    6. Johannes Stübinger & Sylvia Endres, 2018. "Pairs trading with a mean-reverting jump–diffusion model on high-frequency data," Quantitative Finance, Taylor & Francis Journals, vol. 18(10), pages 1735-1751, October.
    7. Endres, Sylvia & Stübinger, Johannes, 2017. "Optimal trading strategies for Lévy-driven Ornstein-Uhlenbeck processes," FAU Discussion Papers in Economics 17/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    8. Johannes St binger & Jens Bredthauer, 2017. "Statistical Arbitrage Pairs Trading with High-frequency Data," International Journal of Economics and Financial Issues, Econjournals, vol. 7(4), pages 650-662.
    9. Fischer, Thomas & Krauss, Christopher, 2017. "Deep learning with long short-term memory networks for financial market predictions," FAU Discussion Papers in Economics 11/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    10. Stübinger, Johannes, 2018. "Statistical arbitrage with optimal causal paths on high-frequencydata of the S&P 500," FAU Discussion Papers in Economics 01/2018, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    11. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    12. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    13. Stübinger, Johannes & Walter, Dominik & Knoll, Julian, 2017. "Financial market predictions with Factorization Machines: Trading the opening hour based on overnight social media data," FAU Discussion Papers in Economics 19/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    14. Thomas Günter Fischer & Christopher Krauss & Alexander Deinert, 2019. "Statistical Arbitrage in Cryptocurrency Markets," JRFM, MDPI, vol. 12(1), pages 1-15, February.
    15. Kasper Johansson & Thomas Schmelzer & Stephen Boyd, 2024. "Finding Moving-Band Statistical Arbitrages via Convex-Concave Optimization," Papers 2402.08108, arXiv.org.
    16. Krauss, Christopher, 2015. "Statistical arbitrage pairs trading strategies: Review and outlook," FAU Discussion Papers in Economics 09/2015, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    17. Pushpendu Ghosh & Ariel Neufeld & Jajati Keshari Sahoo, 2020. "Forecasting directional movements of stock prices for intraday trading using LSTM and random forests," Papers 2004.10178, arXiv.org, revised Jun 2021.
    18. Endres, Sylvia & Stübinger, Johannes, 2018. "A flexible regime switching model with pairs trading application to the S&P 500 high-frequency stock returns," FAU Discussion Papers in Economics 07/2018, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    19. Clegg, Matthew & Krauss, Christopher, 2016. "Pairs trading with partial cointegration," FAU Discussion Papers in Economics 05/2016, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    20. Law, K.F. & Li, W.K. & Yu, Philip L.H., 2018. "A single-stage approach for cointegration-based pairs trading," Finance Research Letters, Elsevier, vol. 26(C), pages 177-184.

    More about this item

    Keywords

    finance; factorization machine; social media; statistical arbitrage; high-frequency data;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:iwqwdp:132017. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/vierlde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.