IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1912.06236.html
   My bibliography  Save this paper

Automatic Financial Feature Construction

Author

Listed:
  • Jie Fang
  • Shutao Xia
  • Jianwu Lin
  • Yong Jiang

Abstract

In automatic financial feature construction task, the state-of-the-art technic leverages reverse polish expression to represent the features, then use genetic programming (GP) to conduct its evolution process. In this paper, we propose a new framework based on neural network, alpha discovery neural network (ADNN). In this work, we made several contributions. Firstly, in this task, we make full use of neural network overwhelming advantage in feature extraction to construct highly informative features. Secondly, we use domain knowledge to design the object function, batch size, and sampling rules. Thirdly, we use pre-training to replace the GP evolution process. According to neural network universal approximation theorem, pre-training can conduct a more effective and explainable evolution process. Experiment shows that ADNN can remarkably produce more diversified and higher informative features than GP. Besides, ADNN can serve as a data augmentation algorithm. It further improves the the performance of financial features constructed by GP.

Suggested Citation

  • Jie Fang & Shutao Xia & Jianwu Lin & Yong Jiang, 2019. "Automatic Financial Feature Construction," Papers 1912.06236, arXiv.org, revised Oct 2020.
  • Handle: RePEc:arx:papers:1912.06236
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1912.06236
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zura Kakushadze, 2016. "101 Formulaic Alphas," Papers 1601.00991, arXiv.org, revised Mar 2016.
    2. Changqing Cheng & Akkarapol Sa-Ngasoongsong & Omer Beyca & Trung Le & Hui Yang & Zhenyu (James) Kong & Satish T.S. Bukkapatnam, 2015. "Time series forecasting for nonlinear and non-stationary processes: a review and comparative study," IISE Transactions, Taylor & Francis Journals, vol. 47(10), pages 1053-1071, October.
    3. Gan, Lirong & Wang, Huamao & Yang, Zhaojun, 2020. "Machine learning solutions to challenges in finance: An application to the pricing of financial products," Technological Forecasting and Social Change, Elsevier, vol. 153(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gürdal Ertek & Lakshmi Kailas, 2021. "Analyzing a Decade of Wind Turbine Accident News with Topic Modeling," Sustainability, MDPI, vol. 13(22), pages 1-34, November.
    2. Jie Fang & Shutao Xia & Jianwu Lin & Zhikang Xia & Xiang Liu & Yong Jiang, 2019. "Alpha Discovery Neural Network based on Prior Knowledge," Papers 1912.11761, arXiv.org, revised Nov 2020.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ozdemir, Ali Can & Buluş, Kurtuluş & Zor, Kasım, 2022. "Medium- to long-term nickel price forecasting using LSTM and GRU networks," Resources Policy, Elsevier, vol. 78(C).
    2. Jie Fang & Jianwu Lin & Shutao Xia & Yong Jiang & Zhikang Xia & Xiang Liu, 2020. "Neural Network-based Automatic Factor Construction," Papers 2008.06225, arXiv.org, revised Oct 2020.
    3. Caterina De Lucia & Pasquale Pazienza & Mark Bartlett, 2020. "Does Good ESG Lead to Better Financial Performances by Firms? Machine Learning and Logistic Regression Models of Public Enterprises in Europe," Sustainability, MDPI, vol. 12(13), pages 1-29, July.
    4. Zura Kakushadze & Willie Yu, 2017. "Dead Alphas as Risk Factors," Papers 1709.06641, arXiv.org.
    5. Chi Chen & Li Zhao & Wei Cao & Jiang Bian & Chunxiao Xing, 2020. "Trimming the Sail: A Second-order Learning Paradigm for Stock Prediction," Papers 2002.06878, arXiv.org.
    6. Vaia I. Kontopoulou & Athanasios D. Panagopoulos & Ioannis Kakkos & George K. Matsopoulos, 2023. "A Review of ARIMA vs. Machine Learning Approaches for Time Series Forecasting in Data Driven Networks," Future Internet, MDPI, vol. 15(8), pages 1-31, July.
    7. Iraj Daizadeh, 2021. "Leveraging latent persistency in United States patent and trademark applications to gain insight into the evolution of an innovation-driven economy," Papers 2101.02588, arXiv.org, revised May 2021.
    8. Jun Lu & Joerg Osterrieder, 2022. "Feature Selection via the Intervened Interpolative Decomposition and its Application in Diversifying Quantitative Strategies," Papers 2209.14532, arXiv.org.
    9. Xiao Yang & Weiqing Liu & Dong Zhou & Jiang Bian & Tie-Yan Liu, 2020. "Qlib: An AI-oriented Quantitative Investment Platform," Papers 2009.11189, arXiv.org.
    10. Shuo Yu & Hongyan Xue & Xiang Ao & Feiyang Pan & Jia He & Dandan Tu & Qing He, 2023. "Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning," Papers 2306.12964, arXiv.org.
    11. Huo, Da & Chaudhry, Hassan Rauf, 2021. "Using machine learning for evaluating global expansion location decisions: An analysis of Chinese manufacturing sector," Technological Forecasting and Social Change, Elsevier, vol. 163(C).
    12. Yongli Li & Tianchen Wang & Baiqing Sun & Chao Liu, 2022. "Detecting the lead–lag effect in stock markets: definition, patterns, and investment strategies," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-36, December.
    13. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    14. Kristof Lommers & Ouns El Harzli & Jack Kim, 2021. "Confronting Machine Learning With Financial Research," Papers 2103.00366, arXiv.org, revised Mar 2021.
    15. George Kosgei Kiptum, 2022. "Relationship between Kenya’s economic growth and inflation," SN Business & Economics, Springer, vol. 2(12), pages 1-16, December.
    16. Hyunsoo Lee, 2019. "Development of Sustainable Recycling Investment Framework Considering Uncertain Demand and Nonlinear Recycling Cost," Sustainability, MDPI, vol. 11(14), pages 1-18, July.
    17. Hector O. Zapata & Supratik Mukhopadhyay, 2022. "A Bibliometric Analysis of Machine Learning Econometrics in Asset Pricing," JRFM, MDPI, vol. 15(11), pages 1-17, November.
    18. Zura Kakushadze & Willie Yu, 2017. "Notes on Fano Ratio and Portfolio Optimization," Papers 1711.10640, arXiv.org, revised Apr 2018.
    19. Jia, Xiongjie & Sang, Yichen & Li, Yanjun & Du, Wei & Zhang, Guolei, 2022. "Short-term forecasting for supercharged boiler safety performance based on advanced data-driven modelling framework," Energy, Elsevier, vol. 239(PE).
    20. Ahmed M. Khedr & Ifra Arif & Pravija Raj P V & Magdi El‐Bannany & Saadat M. Alhashmi & Meenu Sreedharan, 2021. "Cryptocurrency price prediction using traditional statistical and machine‐learning techniques: A survey," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 28(1), pages 3-34, January.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1912.06236. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.