IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-60801-6.html
   My bibliography  Save this article

Mitigating data bias and ensuring reliable evaluation of AI models with shortcut hull learning

Author

Listed:
  • Wenhao Zhou

    (Tsinghua University
    Tsinghua University
    Tsinghua University
    Tsinghua University)

  • Faqiang Liu

    (Tsinghua University
    Tsinghua University
    Tsinghua University
    Tsinghua University)

  • Hao Zheng

    (Tsinghua University
    Tsinghua University
    Tsinghua University
    Tsinghua University)

  • Rong Zhao

    (Tsinghua University
    Tsinghua University
    Tsinghua University
    Tsinghua University)

Abstract

Shortcut learning poses a significant challenge to both the interpretability and robustness of artificial intelligence, arising from dataset biases that lead models to exploit unintended correlations, or shortcuts, which undermine performance evaluations. Addressing these inherent biases is particularly difficult due to the complex, high-dimensional nature of data. Here, we introduce shortcut hull learning, a diagnostic paradigm that unifies shortcut representations in probability space and utilizes diverse models with different inductive biases to efficiently learn and identify shortcuts. This paradigm establishes a comprehensive, shortcut-free evaluation framework, validated by developing a shortcut-free topological dataset to assess deep neural networks’ global capabilities, enabling a shift from Minsky and Papert’s representational analysis to an empirical investigation of learning capacity. Unexpectedly, our experimental results suggest that under this framework, convolutional models—typically considered weak in global capabilities—outperform transformer-based models, challenging prevailing beliefs. By enabling robust and bias-free evaluation, our framework uncovers the true model capabilities beyond architectural preferences, offering a foundation for advancing AI interpretability and reliability.

Suggested Citation

  • Wenhao Zhou & Faqiang Liu & Hao Zheng & Rong Zhao, 2025. "Mitigating data bias and ensuring reliable evaluation of AI models with shortcut hull learning," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60801-6
    DOI: 10.1038/s41467-025-60801-6
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-60801-6
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-60801-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Alhussein Fawzi & Matej Balog & Aja Huang & Thomas Hubert & Bernardino Romera-Paredes & Mohammadamin Barekatain & Alexander Novikov & Francisco J. R. Ruiz & Julian Schrittwieser & Grzegorz Swirszcz & , 2022. "Discovering faster matrix multiplication algorithms with reinforcement learning," Nature, Nature, vol. 610(7930), pages 47-53, October.
    2. John Jumper & Richard Evans & Alexander Pritzel & Tim Green & Michael Figurnov & Olaf Ronneberger & Kathryn Tunyasuvunakool & Russ Bates & Augustin Žídek & Anna Potapenko & Alex Bridgland & Clemens Me, 2021. "Highly accurate protein structure prediction with AlphaFold," Nature, Nature, vol. 596(7873), pages 583-589, August.
    3. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    4. Trieu H. Trinh & Yuhuai Wu & Quoc V. Le & He He & Thang Luong, 2024. "Author Correction: Solving olympiad geometry without human demonstrations," Nature, Nature, vol. 627(8004), pages 8-8, March.
    5. Alex Davies & Petar Veličković & Lars Buesing & Sam Blackwell & Daniel Zheng & Nenad Tomašev & Richard Tanburn & Peter Battaglia & Charles Blundell & András Juhász & Marc Lackenby & Geordie Williamson, 2021. "Advancing mathematics by guiding human intuition with AI," Nature, Nature, vol. 600(7887), pages 70-74, December.
    6. Trieu H. Trinh & Yuhuai Wu & Quoc V. Le & He He & Thang Luong, 2024. "Solving olympiad geometry without human demonstrations," Nature, Nature, vol. 625(7995), pages 476-482, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jesús Fernández-Villaverde & Galo Nuño & Jesse Perla, 2024. "Taming the Curse of Dimensionality: Quantitative Economics with Deep Learning," NBER Working Papers 33117, National Bureau of Economic Research, Inc.
    2. Boštjan Gec & Sašo Džeroski & Ljupčo Todorovski, 2024. "Discovery of Exact Equations for Integer Sequences," Mathematics, MDPI, vol. 12(23), pages 1-22, November.
    3. Evangelos Katsamakas & Oleg V. Pavlov & Ryan Saklad, 2024. "Artificial intelligence and the transformation of higher education institutions," Papers 2402.08143, arXiv.org.
    4. Yang Ye & Abhishek Pandey & Carolyn Bawden & Dewan Md. Sumsuzzman & Rimpi Rajput & Affan Shoukat & Burton H. Singer & Seyed M. Moghadas & Alison P. Galvani, 2025. "Integrating artificial intelligence with mechanistic epidemiological modeling: a scoping review of opportunities and challenges," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    5. Patrick Bryant & Gabriele Pozzati & Wensi Zhu & Aditi Shenoy & Petras Kundrotas & Arne Elofsson, 2022. "Predicting the structure of large protein complexes using AlphaFold and Monte Carlo tree search," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    6. Hajkowicz, Stefan & Naughtin, Claire & Sanderson, Conrad & Schleiger, Emma & Karimi, Sarvnaz & Bratanova, Alexandra & Bednarz, Tomasz, 2022. "Artificial intelligence for science – adoption trends and future development pathways," MPRA Paper 115464, University Library of Munich, Germany.
    7. Xin Li & Qunxi Zhu & Chengli Zhao & Xiaojun Duan & Bolin Zhao & Xue Zhang & Huanfei Ma & Jie Sun & Wei Lin, 2024. "Higher-order Granger reservoir computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    8. Zeqing Jin & Dahyun Daniel Lim & Xueying Zhao & Meenakshi Mamunuru & Sassan Roham & Grace X. Gu, 2024. "Machine learning enabled optimization of showerhead design for semiconductor deposition process," Journal of Intelligent Manufacturing, Springer, vol. 35(2), pages 925-935, February.
    9. Runyu Zhang & Yingjian Liu & Thomas Zheng & Sarah Eddin & Steven Nolet & Yi-Ling Liang & Shaghayegh Rezazadeh & Joseph Wilson & Hongbing Lu & Dong Qian, 2024. "A fast spatio-temporal temperature predictor for vacuum assisted resin infusion molding process based on deep machine learning modeling," Journal of Intelligent Manufacturing, Springer, vol. 35(4), pages 1737-1764, April.
    10. Tian Zhu & Merry H. Ma, 2022. "Deriving the Optimal Strategy for the Two Dice Pig Game via Reinforcement Learning," Stats, MDPI, vol. 5(3), pages 1-14, August.
    11. Weifan Long & Taixian Hou & Xiaoyi Wei & Shichao Yan & Peng Zhai & Lihua Zhang, 2023. "A Survey on Population-Based Deep Reinforcement Learning," Mathematics, MDPI, vol. 11(10), pages 1-17, May.
    12. Zhenchong Mo & Lin Gong & Mingren Zhu & Junde Lan, 2024. "The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning," Sustainability, MDPI, vol. 16(22), pages 1-34, November.
    13. Sun-Ting Tsai & Eric Fields & Yijia Xu & En-Jui Kuo & Pratyush Tiwary, 2022. "Path sampling of recurrent neural networks by incorporating known physics," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    14. Evangelos Katsamakas & Oleg V. Pavlov & Ryan Saklad, 2024. "Artificial Intelligence and the Transformation of Higher Education Institutions: A Systems Approach," Sustainability, MDPI, vol. 16(14), pages 1-21, July.
    15. Luozhijie Jin & Zijian Du & Le Shu & Yan Cen & Yuanfeng Xu & Yongfeng Mei & Hao Zhang, 2025. "Transformer-generated atomic embeddings to enhance prediction accuracy of crystal properties with machine learning," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    16. Cui, Tianxiang & Du, Nanjiang & Yang, Xiaoying & Ding, Shusheng, 2024. "Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    17. Simon D Angus, 2024. "Tracking Policy-relevant Narratives of Democratic Resilience at Scale: from experts and machines, to AI & the transformer revolution," SoDa Laboratories Working Paper Series 2024-07, Monash University, SoDa Laboratories.
    18. Min Yan & Can Huang & Peter Bienstman & Peter Tino & Wei Lin & Jie Sun, 2024. "Emerging opportunities and challenges for the future of reservoir computing," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    19. Pantelis Livanos & Choy Kriechbaum & Sophia Remers & Arvid Herrmann & Sabine Müller, 2025. "Kinesin-12 POK2 polarization is a prerequisite for a fully functional division site and aids cell plate positioning," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    20. Xiaoyue Li & John M. Mulvey, 2023. "Optimal Portfolio Execution in a Regime-switching Market with Non-linear Impact Costs: Combining Dynamic Program and Neural Network," Papers 2306.08809, arXiv.org.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-60801-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.