IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2305.00245.html
   My bibliography  Save this paper

Industry Classification Using a Novel Financial Time-Series Case Representation

Author

Listed:
  • Rian Dolphin
  • Barry Smyth
  • Ruihai Dong

Abstract

The financial domain has proven to be a fertile source of challenging machine learning problems across a variety of tasks including prediction, clustering, and classification. Researchers can access an abundance of time-series data and even modest performance improvements can be translated into significant additional value. In this work, we consider the use of case-based reasoning for an important task in this domain, by using historical stock returns time-series data for industry sector classification. We discuss why time-series data can present some significant representational challenges for conventional case-based reasoning approaches, and in response, we propose a novel representation based on stock returns embeddings, which can be readily calculated from raw stock returns data. We argue that this representation is well suited to case-based reasoning and evaluate our approach using a large-scale public dataset for the industry sector classification task, demonstrating substantial performance improvements over several baselines using more conventional representations.

Suggested Citation

  • Rian Dolphin & Barry Smyth & Ruihai Dong, 2023. "Industry Classification Using a Novel Financial Time-Series Case Representation," Papers 2305.00245, arXiv.org.
  • Handle: RePEc:arx:papers:2305.00245
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2305.00245
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rian Dolphin & Barry Smyth & Yang Xu & Ruihai Dong, 2021. "Measuring Financial Time Series Similarity With a View to Identifying Profitable Stock Market Opportunities," Papers 2107.03926, arXiv.org.
    2. Rian Dolphin & Barry Smyth & Ruihai Dong, 2022. "Stock Embeddings: Learning Distributed Representations for Financial Assets," Papers 2202.08968, arXiv.org.
    3. Christian Weiner, 2005. "The Impact of Industry Classification Schemes on Financial Research," SFB 649 Discussion Papers SFB649DP2005-062, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    4. Bhaskarjit Sarmah & Nayana Nair & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning," Papers 2207.07183, arXiv.org.
    5. Se-Hak Chun & Young-Woong Ko, 2020. "Geometric Case Based Reasoning for Stock Market Prediction," Sustainability, MDPI, vol. 12(17), pages 1-11, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dimitrios Vamvourellis & M'at'e Toth & Snigdha Bhagat & Dhruv Desai & Dhagash Mehta & Stefano Pasquali, 2023. "Company Similarity using Large Language Models," Papers 2308.08031, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rian Dolphin & Barry Smyth & Ruihai Dong, 2022. "A Multimodal Embedding-Based Approach to Industry Classification in Financial Markets," Papers 2211.06378, arXiv.org.
    2. Dimitrios Vamvourellis & M'at'e Toth & Snigdha Bhagat & Dhruv Desai & Dhagash Mehta & Stefano Pasquali, 2023. "Company Similarity using Large Language Models," Papers 2308.08031, arXiv.org.
    3. Rian Dolphin & Barry Smyth & Ruihai Dong, 2022. "Stock Embeddings: Learning Distributed Representations for Financial Assets," Papers 2202.08968, arXiv.org.
    4. Zhou Fang, 2023. "Continuous-Time Path-Dependent Exploratory Mean-Variance Portfolio Construction," Papers 2303.02298, arXiv.org.
    5. Liping Wang & Jiawei Li & Lifan Zhao & Zhizhuo Kou & Xiaohan Wang & Xinyi Zhu & Hao Wang & Yanyan Shen & Lei Chen, 2023. "Methods for Acquiring and Incorporating Knowledge into Stock Price Prediction: A Survey," Papers 2308.04947, arXiv.org.
    6. Tobias Keller & Martin Glaum & Andreas Bausch & Thorsten Bunz, 2023. "The “CEO in context” technique revisited: A replication and extension of Hambrick and Quigley (2014)," Strategic Management Journal, Wiley Blackwell, vol. 44(4), pages 1111-1138, April.
    7. Abdul-Rahman Khokhar, 2019. "Working Capital Investment: A Comparative Study - Canada Versus the United States," Multinational Finance Journal, Multinational Finance Journal, vol. 23(1-2), pages 65-102, March - J.
    8. Qi Tang & Tongmei Fan & Ruchen Shi & Jingyan Huang & Yidan Ma, 2021. "Prediction of financial time series using LSTM and data denoising methods," Papers 2103.03505, arXiv.org.
    9. Se-Hak Chun & Jae-Won Jang, 2022. "A New Trend Pattern-Matching Method of Interactive Case-Based Reasoning for Stock Price Predictions," Sustainability, MDPI, vol. 14(3), pages 1-15, January.
    10. Yuanyang Liu & Gautam Pant & Olivia R. L. Sheng, 2020. "Predicting Labor Market Competition: Leveraging Interfirm Network and Employee Skills," Information Systems Research, INFORMS, vol. 31(4), pages 1443-1466, December.
    11. Rudolph, Christin & Schwetzler, Bernhard, 2014. "Mountain or molehill? Downward biases in the conglomerate discount measure," Journal of Banking & Finance, Elsevier, vol. 40(C), pages 420-431.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2305.00245. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.