IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.01354.html

The Necessity of Imperfection:Reversing Model Collapse via Simulating Cognitive Boundedness

Author

Listed:
  • Zhongjie Jiang

Abstract

Although synthetic data is widely promoted as a remedy, its prevailing production paradigm -- one optimizing for statistical smoothness -- systematically removes the long-tail, cognitively grounded irregularities that characterize human text. Prolonged training on such statistically optimal but cognitively impoverished data accelerates model collapse. This paper proposes a paradigm shift: instead of imitating the surface properties of data, we simulate the cognitive processes that generate human text. We introduce the Prompt-driven Cognitive Computing Framework (PMCSF), whose core consists of a Cognitive State Decoder (CSD) that reverse-engineers unstructured text into structured cognitive vectors, and a Cognitive Text Encoder (CTE) that re-materializes these states into text enriched with human-typical imperfections via mathematically defined Cognitive Perturbation Operators. The framework is validated through a two-stage objective evaluation pipeline. First, in cognitive codec verification, CTE text yields a Jensen-Shannon divergence of 0.0614 from human text (vs. 0.4431 for standard LLM output), passes double-blind professional media review, and achieves an intraclass correlation coefficient ICC > 0.9 for cognitive profile alignment across heterogeneous models. Second, in functional gain evaluation, isomorphic stress tests in the A-share market show that strategies incorporating CTE-generated data reduce maximum drawdown by 47.4% during the 2015 crash and deliver 8.6% Defensive Alpha, exceeding transaction costs by a factor of 33. Our findings demonstrate that modelling human cognitive limitations -- not copying surface data -- enables synthetic data with genuine functional gain, offering a viable technical pathway toward resolving the AI data-collapse crisis.

Suggested Citation

  • Zhongjie Jiang, 2025. "The Necessity of Imperfection:Reversing Model Collapse via Simulating Cognitive Boundedness," Papers 2512.01354, arXiv.org, revised Dec 2025.
  • Handle: RePEc:arx:papers:2512.01354
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.01354
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Tversky, Amos & Kahneman, Daniel, 1992. "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty, Springer, vol. 5(4), pages 297-323, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhongjie Jiang, 2025. "Workflow is All You Need: Escaping the "Statistical Smoothing Trap" via High-Entropy Information Foraging and Adversarial Pacing," Papers 2512.10121, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van den Bergh, J.C.J.M. & Botzen, W.J.W., 2015. "Monetary valuation of the social cost of CO2 emissions: A critical survey," Ecological Economics, Elsevier, vol. 114(C), pages 33-46.
    2. Heiko Karle & Georg Kirchsteiger & Martin Peitz, 2015. "Loss Aversion and Consumption Choice: Theory and Experimental Evidence," American Economic Journal: Microeconomics, American Economic Association, vol. 7(2), pages 101-120, May.
    3. Shoji, Isao & Kanehiro, Sumei, 2016. "Disposition effect as a behavioral trading activity elicited by investors' different risk preferences," International Review of Financial Analysis, Elsevier, vol. 46(C), pages 104-112.
    4. Muhammad Kashif & Thomas Leirvik, 2022. "The MAX Effect in an Oil Exporting Country: The Case of Norway," JRFM, MDPI, vol. 15(4), pages 1-16, March.
    5. Jonathan Meng & Feng Fu, 2020. "Understanding Gambling Behavior and Risk Attitudes Using Cryptocurrency-based Casino Blockchain Data," Papers 2008.05653, arXiv.org, revised Aug 2020.
    6. Daniel Fonseca Costa & Francisval Carvalho & Bruno César Moreira & José Willer Prado, 2017. "Bibliometric analysis on the association between behavioral finance and decision making with cognitive biases such as overconfidence, anchoring effect and confirmation bias," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1775-1799, June.
    7. Cappelen, Alexander W. & Sørensen, Erik Ø. & Tungodden, Bertil & Xu, Xiaogeng, 2025. "Risk taking on behalf of others: Does the timing of uncertainty revelation matter?," Discussion Paper Series in Economics 13/2025, Norwegian School of Economics, Department of Economics.
    8. Jos'e Cl'audio do Nascimento, 2019. "Behavioral Biases and Nonadditive Dynamics in Risk Taking: An Experimental Investigation," Papers 1908.01709, arXiv.org, revised Apr 2023.
    9. Breaban, Adriana & van de Kuilen, Gijs & Noussair, Charles, 2016. "Prudence, Personality, Cognitive Ability and Emotional State," Other publications TiSEM 9a01a5ab-e03d-49eb-9cd7-4, Tilburg University, School of Economics and Management.
    10. Martín Egozcue & Sébastien Massoni & Wing-Keung Wong & Ričardas Zitikis, 2012. "Integration-segregation decisions under general value functions : "Create your own bundle -- choose 1, 2, or all 3 !"," Post-Print halshs-00747008, HAL.
    11. Howard Kunreuther & Erwann Michel-Kerjan, 2015. "Demand for fixed-price multi-year contracts: Experimental evidence from insurance decisions," Journal of Risk and Uncertainty, Springer, vol. 51(2), pages 171-194, October.
    12. Choo, Weihao & de Jong, Piet, 2015. "The tradeoff insurance premium as a two-sided generalisation of the distortion premium," Insurance: Mathematics and Economics, Elsevier, vol. 65(C), pages 238-246.
    13. Francesco GUALA, 2017. "Preferences: Neither Behavioural nor Mental," Departmental Working Papers 2017-05, Department of Economics, Management and Quantitative Methods at Università degli Studi di Milano.
    14. Fabian Herweg & Svenja Hippel & Daniel Müller & Fabio Römeis, 2024. "Axiom Preferences and Choice Mistakes under Risk," ECONtribute Discussion Papers Series 326, University of Bonn and University of Cologne, Germany.
    15. Shi, Yun & Cui, Xiangyu & Zhou, Xunyu, 2020. "Beta and Coskewness Pricing: Perspective from Probability Weighting," SocArXiv 5rqhv, Center for Open Science.
    16. Simplice Asongu & Nicholas M. Odhiambo, 2020. "Financial access, governance and insurance sector development in sub-Saharan Africa," Journal of Economic Studies, Emerald Group Publishing Limited, vol. 47(4), pages 849-875, February.
    17. Bin Zou, 2017. "Optimal Investment In Hedge Funds Under Loss Aversion," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 20(03), pages 1-32, May.
    18. Alex Stomper & Marie-Louise Vierø, 2015. "Iterated Expectations Under Rank-dependent Expected Utility And Model Consistency," Working Paper 1228, Economics Department, Queen's University.
    19. Liu, Zhiqiang & Yan, Miao & Fan, Youqing & Chen, Liling, 2021. "Ascribed or achieved? The role of birth order on innovative behaviour in the workplace," Journal of Business Research, Elsevier, vol. 134(C), pages 480-492.
    20. Dertwinkel-Kalt, Markus & Köster, Mats, 2017. "Local thinking and skewness preferences," DICE Discussion Papers 248, Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.01354. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.