IDEAS home Printed from https://ideas.repec.org/a/spr/snbeco/v1y2021i8d10.1007_s43546-021-00106-0.html
   My bibliography  Save this article

Construction of a news article evaluation model utilizing high-frequency data and a large-scale language generation model

Author

Listed:
  • Yoshihiro Nishi

    (Keio University)

  • Aiko Suge

    (Keio University)

  • Hiroshi Takahashi

    (Keio University)

Abstract

News articles have significant impacts on asset prices in financial markets. A great number of attempts have been conducted to ascertain how news articles influence stock prices. News articles have been reported to contain sentimental and fundamental information that affects stock price fluctuations, and many studies have been conducted to evaluate stock price fluctuations using them as analytical data. However, the limitations in the number of available datasets usually become the hurdle for the model accuracy. This study aims to improve the analytical model’s accuracy by generating news articles using language generation technology. We tested whether the model that used the generated data was better than the trained model with real-world data. The model constructed in this research is a model that evaluates news articles distributed to financial markets based on the price fluctuation rate of stock prices and predicts and evaluates stock price fluctuations. This study labeled based on high-frequency trading data and generated news articles using a large-scale language generation model (GPT-2). Also, we analyzed and verified the effect. In this study, we succeeded in generating news articles using the large-scale language generation model and improving the classification accuracy. Our method proposed in this paper has great potential to improve text analysis accuracy in various areas.

Suggested Citation

  • Yoshihiro Nishi & Aiko Suge & Hiroshi Takahashi, 2021. "Construction of a news article evaluation model utilizing high-frequency data and a large-scale language generation model," SN Business & Economics, Springer, vol. 1(8), pages 1-18, August.
  • Handle: RePEc:spr:snbeco:v:1:y:2021:i:8:d:10.1007_s43546-021-00106-0
    DOI: 10.1007/s43546-021-00106-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s43546-021-00106-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s43546-021-00106-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shrestha, Yash Raj & Krishna, Vaibhav & von Krogh, Georg, 2021. "Augmenting organizational decision-making with deep learning algorithms: Principles, promises, and challenges," Journal of Business Research, Elsevier, vol. 123(C), pages 588-603.
    2. Surajit Bag & Shivam Gupta & Ajay Kumar & Uthayasankar Sivarajah, 2021. "An integrated artificial intelligence framework for knowledge creation and B2B marketing rational decision making for improving firm performance," Post-Print hal-03188195, HAL.
    3. Li, Xi & Shi, Mengze & Wang, Xin (Shane), 2019. "Video mining: Measuring visual information using automatic methods," International Journal of Research in Marketing, Elsevier, vol. 36(2), pages 216-231.
    4. Oliver Schaer & Nikolaos Kourentzes & Robert Fildes, 2022. "Predictive competitive intelligence with prerelease online search traffic," Production and Operations Management, Production and Operations Management Society, vol. 31(10), pages 3823-3839, October.
    5. Gandhi, Mohina & Kar, Arpan Kumar, 2022. "How do Fortune firms build a social presence on social media platforms? Insights from multi-modal analytics," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    6. Miikka Blomster & Timo Koivumäki, 2022. "Exploring the resources, competencies, and capabilities needed for successful machine learning projects in digital marketing," Information Systems and e-Business Management, Springer, vol. 20(1), pages 123-169, March.
    7. Blasco-Arcas, Lorena & Kastanakis, Minas N. & Alcañiz, Mariano & Reyes-Menendez, Ana, 2023. "Leveraging user behavior and data science technologies for management: An overview," Journal of Business Research, Elsevier, vol. 154(C).
    8. Neil A. Morgan & Kimberly A. Whitler & Hui Feng & Simos Chari, 2019. "Research in marketing strategy," Journal of the Academy of Marketing Science, Springer, vol. 47(1), pages 4-29, January.
    9. Abhishek Borah & Francesca Bonetti & Angelito Calma & José Martí-Parreño, 2023. "The Journal of the Academy of Marketing Science at 50: A historical analysis," Journal of the Academy of Marketing Science, Springer, vol. 51(1), pages 222-243, January.
    10. Milne, George R. & Villarroel Ordenes, Francisco & Kaplan, Begum, 2020. "Mindful consumption: Three consumer segment views," Australasian marketing journal, Elsevier, vol. 28(1), pages 3-10.
    11. Mariani, Marcello M. & Borghi, Matteo & Laker, Benjamin, 2023. "Do submission devices influence online review ratings differently across different types of platforms? A big data analysis," Technological Forecasting and Social Change, Elsevier, vol. 189(C).
    12. Shrihari Sridhar & Eric Fang, 2019. "New vistas for marketing strategy: digital, data-rich, and developing market (D3) environments," Journal of the Academy of Marketing Science, Springer, vol. 47(6), pages 977-985, November.
    13. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    14. Akter, Shahriar & Dwivedi, Yogesh K. & Sajib, Shahriar & Biswas, Kumar & Bandara, Ruwan J. & Michael, Katina, 2022. "Algorithmic bias in machine learning-based marketing models," Journal of Business Research, Elsevier, vol. 144(C), pages 201-216.
    15. Pantano, Eleonora & Dennis, Charles, 2019. "Store buildings as tourist attractions: Mining retail meaning of store building pictures through a machine learning approach," Journal of Retailing and Consumer Services, Elsevier, vol. 51(C), pages 304-310.
    16. Holmlund, Maria & Van Vaerenbergh, Yves & Ciuchita, Robert & Ravald, Annika & Sarantopoulos, Panagiotis & Ordenes, Francisco Villarroel & Zaki, Mohamed, 2020. "Customer experience management in the age of big data analytics: A strategic framework," Journal of Business Research, Elsevier, vol. 116(C), pages 356-365.
    17. Sotaro Katsumata & Seungjin Kim, 2020. "The Text-Score Allocation Model: Finding Latent Topics of Online Review Documents and Multi-Item Ratings," Discussion Papers in Economics and Business 20-01, Osaka University, Graduate School of Economics.
    18. Kuchta Martin, 2020. "Extent of data utilization within digital marketing processes," Studia Commercialia Bratislavensia, Sciendo, vol. 13(43), pages 35-43, March.
    19. Ming-Hui Huang & Roland T. Rust, 2021. "A strategic framework for artificial intelligence in marketing," Journal of the Academy of Marketing Science, Springer, vol. 49(1), pages 30-50, January.
    20. Luigi M. De Luca & Dennis Herhausen & Gabriele Troilo & Andrea Rossi, 2021. "How and when do big data investments pay off? The role of marketing affordances and service innovation," Journal of the Academy of Marketing Science, Springer, vol. 49(4), pages 790-810, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:snbeco:v:1:y:2021:i:8:d:10.1007_s43546-021-00106-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.