IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.02921.html

Debiasing LLMs by Fine-tuning

Author

Listed:
  • Zhenyu Gao
  • Wenxi Jiang
  • Yutong Yan

Abstract

Prior research shows that large language models (LLMs) exhibit systematic extrapolation bias when forming predictions from both experimental and real-world data, and that prompt-based approaches appear limited in alleviating this bias. We propose a supervised fine-tuning (SFT) approach that uses Low-Rank Adaptation (LoRA) to train off-the-shelf LLMs on instruction datasets constructed from rational benchmark forecasts. By intervening at the parameter level, SFT changes how LLMs map observed information into forecasts and thereby mitigates extrapolation bias. We evaluate the fine-tuned model in two settings: controlled forecasting experiments and cross-sectional stock return prediction. In both settings, fine-tuning corrects the extrapolative bias out-of-sample, establishing a low-cost and generalizable method for debiasing LLMs.

Suggested Citation

  • Zhenyu Gao & Wenxi Jiang & Yutong Yan, 2026. "Debiasing LLMs by Fine-tuning," Papers 2604.02921, arXiv.org, revised May 2026.
  • Handle: RePEc:arx:papers:2604.02921
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2604.02921
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sean Cao Robert H. Smith & Wei Jiang & Baozhong Yang J. Mack Robinson & Alan L Zhang & Tarun Ramadorai, 2023. "How to Talk When a Machine Is Listening: Corporate Disclosure in the Age of AI," The Review of Financial Studies, Society for Financial Studies, vol. 36(9), pages 3603-3642.
    2. Alejandro Lopez-Lira & Yuehua Tang & Mingyin Zhu, 2025. "The Memorization Problem: Can We Trust LLMs' Economic Forecasts?," Papers 2504.14765, arXiv.org, revised Dec 2025.
    3. Alejandro Lopez-Lira, 2025. "Can Large Language Models Trade? Testing Financial Theories with LLM Agents in Market Simulations," Papers 2504.10789, arXiv.org.
    4. Hassan Afrouzi & Spencer Y Kwon & Augustin Landier & Yueran Ma & David Thesmar, 2023. "Overreaction in Expectations: Evidence and Theory," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 138(3), pages 1713-1764.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    6. Shuaiyu Chen & T. Clifton Green & Huseyin Gulen & Dexin Zhou, 2024. "What Does ChatGPT Make of Historical Stock Returns? Extrapolation and Miscalibration in LLM Stock Return Forecasts," Papers 2409.11540, arXiv.org.
    7. Yutong Yan & Raphael Tang & Zhenyu Gao & Wenxi Jiang & Yao Lu, 2026. "DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining," Papers 2603.11838, arXiv.org.
    8. Zefeng Chen & Darcy Pu, 2026. "Autonomous Market Intelligence: Agentic AI Nowcasting Predicts Stock Returns," Papers 2601.11958, arXiv.org.
    9. Da, Zhi & Huang, Xing & Jin, Lawrence J., 2021. "Extrapolative beliefs in the cross-section: What can we learn from the crowds?," Journal of Financial Economics, Elsevier, vol. 140(1), pages 175-196.
    10. Francesco D’Acunto & Nagpurnanand Prabhala & Alberto G Rossi, 2019. "The Promises and Pitfalls of Robo-Advising," The Review of Financial Studies, Society for Financial Studies, vol. 32(5), pages 1983-2020.
    11. Paul Glasserman & Caden Lin, 2023. "Assessing Look-Ahead Bias in Stock Return Predictions Generated By GPT Sentiment Analysis," Papers 2309.17322, arXiv.org.
    12. John J. Horton & Apostolos Filippas & Benjamin S. Manning, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," NBER Working Papers 31122, National Bureau of Economic Research, Inc.
    13. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cao, Sean & Jiang, Wei & Wang, Junbo & Yang, Baozhong, 2024. "From Man vs. Machine to Man + Machine: The art and AI of stock analyses," Journal of Financial Economics, Elsevier, vol. 160(C).
    2. Hui Chen & Antoine Didisheim & Mohammad & Pourmohammadi & Luciano Somoza & Hanqing Tian, 2025. "A Financial Brain Scan of the LLM," Papers 2508.21285, arXiv.org, revised Feb 2026.
    3. Chen, Rui & Jiang, Haiqi & Guo, Tingyu & Fan, Chenyou, 2025. "Can Large Language Models forecast carbon price movements? Evidence from Chinese carbon markets," Research in International Business and Finance, Elsevier, vol. 77(PB).
    4. repec:cam:camjip:2506 is not listed on IDEAS
    5. Allen Yikuan Huang & Zheqi Fan, 2026. "Beyond Prompting: An Autonomous Framework for Systematic Factor Investing via Agentic AI," Papers 2603.14288, arXiv.org, revised Apr 2026.
    6. Maher Hamid, 2026. "Implementing domain-specific LLMs for strategic investment decisions: a retrospective case study comparing AI and human expertise," Digital Finance, Springer, vol. 8(1), pages 1-134, March.
    7. Wolfgang Breuer & Andreas Knetsch, 2023. "Recent trends in the digitalization of finance and accounting," Journal of Business Economics, Springer, vol. 93(9), pages 1451-1461, November.
    8. Tom L. Dudda & Lars Hornuf, 2025. "The Perks and Perils of Machine Learning in Business and Economic Research," CESifo Working Paper Series 11721, CESifo.
    9. Didisheim, Antoine & Fraschini, Martina & Somoza, Luciano, 2025. "AI’s predictable memory in financial analysis," Economics Letters, Elsevier, vol. 256(C).
    10. Lansing, Kevin J. & LeRoy, Stephen F. & Ma, Jun, 2022. "Examining the sources of excess return predictability: Stochastic volatility or market inefficiency?," Journal of Economic Behavior & Organization, Elsevier, vol. 197(C), pages 50-72.
    11. Chen, Rui & Ren, Jinjuan, 2022. "Do AI-powered mutual funds perform better?," Finance Research Letters, Elsevier, vol. 47(PA).
    12. Avramov, D. & Ge, S. & Li, S. & Linton, O. B., 2025. "Dual Industry Effects and Cross-Stock Predictability," Cambridge Working Papers in Economics 2512, Faculty of Economics, University of Cambridge.
    13. Zhaobo Zhu & Licheng Sun, 2024. "When Buffett Meets Bollinger: An Integrated Approach to Fundamental and Technical Analysis," Post-Print hal-04703041, HAL.
    14. Li, Yihan, 2024. "Trading on trends: How the ordering of historical volume predicts Chinese stock returns?," International Review of Financial Analysis, Elsevier, vol. 95(PC).
    15. Rossi, Alberto G. & Utkus, Stephen, 2024. "The diversification and welfare effects of robo-advising," Journal of Financial Economics, Elsevier, vol. 157(C).
    16. Yang ZHANG & Ziang QIU Ziang & Donghyun PARK & Shu TIAN, 2026. "Role of Artificial Intelligence in Finance: Selective Literature Review and Implications for Asia's Financial Stability," Working Papers wp61, South East Asian Central Banks (SEACEN) Research and Training Centre, revised Feb 2026.
    17. Chen, Jian & Tang, Guohao & Yao, Jiaquan & Zhou, Guofu, 2023. "Employee sentiment and stock returns," Journal of Economic Dynamics and Control, Elsevier, vol. 149(C).
    18. Michalski, Lachlan & Low, Rand Kwong Yew, 2024. "Determinants of corporate credit ratings: Does ESG matter?," International Review of Financial Analysis, Elsevier, vol. 94(C).
    19. Ylinen, Mika & Ranta, Mikko, 2025. "Predicting corporate innovation using machine learning and social media data," Technovation, Elsevier, vol. 148(C).
    20. Shuangshuang Fan & Yichao Li & William Mbanyele & Xiufeng Lai, 2025. "Determinants and Pathways for Inclusive Growth in China: Investigation Based on Artificial Intelligence (AI) Algorithm," Computational Economics, Springer;Society for Computational Economics, vol. 65(3), pages 1231-1264, March.
    21. Abraham Itzhak Weinberg, 2025. "Hybrid Quantum-Classical Ensemble Learning for S\&P 500 Directional Prediction," Papers 2512.15738, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.02921. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.