IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2505.14420.html
   My bibliography  Save this paper

SAE-FiRE: Enhancing Earnings Surprise Predictions Through Sparse Autoencoder Feature Selection

Author

Listed:
  • Huopu Zhang
  • Yanguang Liu
  • Mengnan Du

Abstract

Predicting earnings surprises through the analysis of earnings conference call transcripts has attracted increasing attention from the financial research community. Conference calls serve as critical communication channels between company executives, analysts, and shareholders, offering valuable forward-looking information. However, these transcripts present significant analytical challenges, typically containing over 5,000 words with substantial redundancy and industry-specific terminology that creates obstacles for language models. In this work, we propose the Sparse Autoencoder for Financial Representation Enhancement (SAE-FiRE) framework to address these limitations by extracting key information while eliminating redundancy. SAE-FiRE employs Sparse Autoencoders (SAEs) to efficiently identify patterns and filter out noises, and focusing specifically on capturing nuanced financial signals that have predictive power for earnings surprises. Experimental results indicate that the proposed method can significantly outperform comparing baselines.

Suggested Citation

  • Huopu Zhang & Yanguang Liu & Mengnan Du, 2025. "SAE-FiRE: Enhancing Earnings Surprise Predictions Through Sparse Autoencoder Feature Selection," Papers 2505.14420, arXiv.org.
  • Handle: RePEc:arx:papers:2505.14420
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2505.14420
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David F. Larcker & Anastasia A. Zakolyukina, 2012. "Detecting Deceptive Discussions in Conference Calls," Journal of Accounting Research, Wiley Blackwell, vol. 50(2), pages 495-540, May.
    2. Brown, Stephen & Hillegeist, Stephen A. & Lo, Kin, 2004. "Conference calls and information asymmetry," Journal of Accounting and Economics, Elsevier, vol. 37(3), pages 343-366, September.
    3. Brian J. Bushee & Ian D. Gow & Daniel J. Taylor, 2018. "Linguistic Complexity in Firm Disclosures: Obfuscation or Information?," Journal of Accounting Research, Wiley Blackwell, vol. 56(1), pages 85-121, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Devrimi Kaya & Christian Maier & Tobias Böhmer, 2020. "Empirische Kapitalmarktforschung zu Conference Calls: Eine Literaturanalyse [Empirical Capital Market Research on Conference Calls: A Literature Review]," Schmalenbach Journal of Business Research, Springer, vol. 72(2), pages 183-212, June.
    2. Chourou, Lamia & Hossain, Ashrafee T. & Jha, Anand, 2024. "US political corruption and quarterly conference calls," Journal of Banking & Finance, Elsevier, vol. 161(C).
    3. DeLisle, R. Jared & Grant, Andrew & Mao, Ruiqi, 2024. "Does environmental and social performance affect pricing efficiency? Evidence from earnings conference call tones," Journal of Corporate Finance, Elsevier, vol. 86(C).
    4. Yangyang Chen & Jeffrey Ng & Xin Yang, 2021. "Talk Less, Learn More: Strategic Disclosure in Response to Managerial Learning from the Options Market," Journal of Accounting Research, Wiley Blackwell, vol. 59(5), pages 1609-1649, December.
    5. Xi Fu & Xiaoxi Wu & Zhifang Zhang, 2021. "The Information Role of Earnings Conference Call Tone: Evidence from Stock Price Crash Risk," Journal of Business Ethics, Springer, vol. 173(3), pages 643-660, October.
    6. Lu, Hsueh-Tien, 2018. "Voluntary disclosure and the pricing of earnings components," Journal of Behavioral and Experimental Finance, Elsevier, vol. 20(C), pages 64-73.
    7. Nerissa C. Brown & Richard M. Crowley & W. Brooke Elliott, 2020. "What Are You Saying? Using topic to Detect Financial Misreporting," Journal of Accounting Research, Wiley Blackwell, vol. 58(1), pages 237-291, March.
    8. Blankespoor, Elizabeth & deHaan, Ed & Marinovic, Iván, 2020. "Disclosure processing costs, investors’ information choice, and equity market outcomes: A review," Journal of Accounting and Economics, Elsevier, vol. 70(2).
    9. Mansouri, Sasan, 2021. "Does firm's silence drive media's attention away?," VfS Annual Conference 2021 (Virtual Conference): Climate Economics 242433, Verein für Socialpolitik / German Economic Association.
    10. Allee, Kristian D. & Do, Chuong & Sterin, Mikhail, 2021. "Product market competition, disclosure framing, and casting in earnings conference calls," Journal of Accounting and Economics, Elsevier, vol. 72(1).
    11. Trinh, Vu Quang & Trinh, Hai Hong & Li, Teng & Vo, Xuan Vinh, 2024. "Climate change exposure, financial development, and the cost of debt: Evidence from EU countries," Journal of Financial Stability, Elsevier, vol. 74(C).
    12. Borochin, Paul A. & Cicon, James E. & DeLisle, R. Jared & Price, S. McKay, 2018. "The effects of conference call tones on market perceptions of value uncertainty," Journal of Financial Markets, Elsevier, vol. 40(C), pages 75-91.
    13. Blau, Benjamin M. & DeLisle, Jared R. & Price, S. McKay, 2015. "Do sophisticated investors interpret earnings conference call tone differently than investors at large? Evidence from short sales," Journal of Corporate Finance, Elsevier, vol. 31(C), pages 203-219.
    14. Belinda Crawford Camiciottoli, 2012. "Oral financial reporting: A rhetorical analysis of earnings calls," FINANCIAL REPORTING, FrancoAngeli Editore, vol. 2012(4), pages 15-34.
    15. Chen, Yuan & Han, Dongmei & Zhou, Xiaofeng, 2023. "Mining the emotional information in the audio of earnings conference calls : A deep learning approach for sentiment analysis of securities analysts' follow-up behavior," International Review of Financial Analysis, Elsevier, vol. 88(C).
    16. Bhattacharya, Indranil & Mickovic, Ana, 2024. "Accounting fraud detection using contextual language learning," International Journal of Accounting Information Systems, Elsevier, vol. 53(C).
    17. Hope, Ole-Kristian & Wang, Jingjing, 2018. "Management deception, big-bath accounting, and information asymmetry: Evidence from linguistic analysis," Accounting, Organizations and Society, Elsevier, vol. 70(C), pages 33-51.
    18. Stephen A. Hillegeist & James P. Kavourakis & Matthew Pinnuck, 2023. "The association between quarter length, forecast errors, and firms’ voluntary disclosures," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(2), pages 1885-1918, June.
    19. Umar, Tarik, 2022. "Complexity aversion when SeekingAlpha," Journal of Accounting and Economics, Elsevier, vol. 73(2).
    20. Evangeline O. Elijido-Ten & Peter Clarkson, 2019. "Going Beyond Climate Change Risk Management: Insights from the World’s Largest Most Sustainable Corporations," Journal of Business Ethics, Springer, vol. 157(4), pages 1067-1089, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2505.14420. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.