IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v19y2025i2s1751157725000252.html

Annotating scientific uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches

Author

Listed:
  • Ningrum, Panggih Kusuma
  • Mayr, Philipp
  • Smirnova, Nina
  • Atanassova, Iana

Abstract

We present UnScientify,1 a system designed to detect scientific uncertainty in scholarly full text. The system utilizes a weakly supervised technique to identify verbally expressed uncertainty in scientific texts and their authorial references. The core methodology of UnScientify is based on a multi-faceted pipeline that integrates span pattern matching, complex sentence analysis and author reference checking. This approach streamlines the labeling and annotation processes essential for identifying scientific uncertainty, covering a variety of uncertainty expression types to support diverse applications including information retrieval, text mining and scientific document processing. The evaluation results highlight the trade-offs between modern large language models (LLMs) and the UnScientify system. UnScientify, which employs more traditional techniques, achieved superior performance in the scientific uncertainty detection task, attaining an accuracy score of 0.808. This finding underscores the continued relevance and efficiency of UnScientify's simple rule-based and pattern matching strategy for this specific application. The results demonstrate that in scenarios where resource efficiency, interpretability, and domain-specific adaptability are critical, traditional methods can still offer significant advantages.

Suggested Citation

  • Ningrum, Panggih Kusuma & Mayr, Philipp & Smirnova, Nina & Atanassova, Iana, 2025. "Annotating scientific uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches," Journal of Informetrics, Elsevier, vol. 19(2).
  • Handle: RePEc:eee:infome:v:19:y:2025:i:2:s1751157725000252
    DOI: 10.1016/j.joi.2025.101661
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157725000252
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2025.101661?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Panggih Kusuma Ningrum & Iana Atanassova, 2024. "Annotation of scientific uncertainty using linguistic patterns," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(10), pages 6261-6285, October.
    2. Johannes Lehmann & Matthias Rillig, 2014. "Distinguishing variability from uncertainty," Nature Climate Change, Nature, vol. 4(3), pages 153-153, March.
    3. Ramona Bongelli & Ilaria Riccioni & Roberto Burro & Andrzej Zuczkowski, 2019. "Writers’ uncertainty in scientific and popular biomedical articles. A comparative analysis of the British Medical Journal and Discover Magazine," PLOS ONE, Public Library of Science, vol. 14(9), pages 1-26, September.
    4. Mingxin Yao & Ying Wei & Huiyu Wang, 2023. "Promoting research by reducing uncertainty in academic writing: a large-scale diachronic case study on hedging in Science research articles across 25 years," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4541-4558, August.
    5. Si Shen & Jiangfeng Liu & Litao Lin & Ying Huang & Lin Zhang & Chang Liu & Yutong Feng & Dongbo Wang, 2023. "SsciBERT: a pre-trained language model for social science texts," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 1241-1263, February.
    6. Janusz A. Hołyst & Philipp Mayr & Michael Thelwall & Ingo Frommholz & Shlomo Havlin & Alon Sela & Yoed N. Kenett & Denis Helic & Aljoša Rehar & Sebastijan R. Maček & Przemysław Kazienko & Tomasz Kajda, 2024. "Protect our environment from information overload," Nature Human Behaviour, Nature, vol. 8(3), pages 402-403, March.
    7. Chen, Chaomei & Song, Min & Heo, Go Eun, 2018. "A scalable and adaptive method for finding semantically equivalent cue words of uncertainty," Journal of Informetrics, Elsevier, vol. 12(1), pages 158-180.
    8. Sigel, Katja & Klauer, Bernd & Pahl-Wostl, Claudia, 2010. "Conceptualising uncertainty in environmental decision-making: The example of the EU water framework directive," Ecological Economics, Elsevier, vol. 69(3), pages 502-510, January.
    9. Ascough, J.C. & Maier, H.R. & Ravalico, J.K. & Strudley, M.W., 2008. "Future research challenges for incorporation of uncertainty in environmental and ecological decision-making," Ecological Modelling, Elsevier, vol. 219(3), pages 383-399.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Panggih Kusuma Ningrum & Iana Atanassova, 2024. "Annotation of scientific uncertainty using linguistic patterns," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(10), pages 6261-6285, October.
    2. Xavier Romão & Esmeralda Paupério, 2016. "A framework to assess quality and uncertainty in disaster loss data," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 83(2), pages 1077-1102, September.
    3. Marcelo Coelho & Cristovão Vicente Scapulatempo Fernandes & Daniel Henrique Marco Detzel, 2019. "Uncertainty analysis in the detection of trends, cycles, and shifts in water resources time series," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(8), pages 2629-2644, June.
    4. Maartje Bodde & Karin Van der Wel & Peter Driessen & Arjan Wardekker & Hens Runhaar, 2018. "Strategies for Dealing with Uncertainties in Strategic Environmental Assessment: An Analytical Framework Illustrated with Case Studies from The Netherlands," Sustainability, MDPI, vol. 10(7), pages 1-24, July.
    5. Ana Paula Coelho Clauberg & Renato de Mello & Flávio José Simioni & Simone Sehnem, 2021. "System for assessing the sustainability conditions of small hydro plants by fuzzy logic," Sustainable Development, John Wiley & Sons, Ltd., vol. 29(2), pages 300-317, March.
    6. Seidl, Rupert & Fernandes, Paulo M. & Fonseca, Teresa F. & Gillet, François & Jönsson, Anna Maria & Merganičová, Katarína & Netherer, Sigrid & Arpaci, Alexander & Bontemps, Jean-Daniel & Bugmann, Hara, 2011. "Modelling natural disturbances in forest ecosystems: a review," Ecological Modelling, Elsevier, vol. 222(4), pages 903-924.
    7. Kanapaux, William & Kiker, Gregory A., 2013. "Development and testing of an object-oriented model for adaptively managing human disturbance of least tern (Sternula antillarum) nesting habitat," Ecological Modelling, Elsevier, vol. 268(C), pages 64-77.
    8. Wu, Lingfei & Kittur, Aniket & Youn, Hyejin & Milojević, Staša & Leahey, Erin & Fiore, Stephen M. & Ahn, Yong-Yeol, 2022. "Metrics and mechanisms: Measuring the unmeasurable in the science of science," Journal of Informetrics, Elsevier, vol. 16(2).
    9. repec:plo:pone00:0080137 is not listed on IDEAS
    10. Oluwaseye Samson Adedoja & Emmanuel Rotimi Sadiku & Yskandar Hamam, 2025. "Multicriteria Decision-Making for Sustainable Mining: Evaluating the Transition to Net-Zero-Carbon Energy Systems," Sustainability, MDPI, vol. 17(10), pages 1-32, May.
    11. J. J. Warmink & M. Brugnach & J. Vinke-de Kruijf & R. M. J. Schielen & D. C. M. Augustijn, 2017. "Coping with Uncertainty in River Management: Challenges and Ways Forward," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 31(14), pages 4587-4600, November.
    12. Yoon, Jeeyoung & Syafiandini, Arida Ferti & Song, Min, 2023. "Exploring the knowledge certainty shift: Metaknowledge analysis on drugs via assertion uncertainty burstiness," Journal of Informetrics, Elsevier, vol. 17(2).
    13. Sellak, Hamza & Ouhbi, Brahim & Frikh, Bouchra & Palomares, Iván, 2017. "Towards next-generation energy planning decision-making: An expert-based framework for intelligent decision support," Renewable and Sustainable Energy Reviews, Elsevier, vol. 80(C), pages 1544-1577.
    14. Elena Cervelli & Stefania Pindozzi & Emilia Allevato & Luigi Saulino & Roberto Silvestro & Ester Scotto di Perta & Antonio Saracino, 2022. "Landscape Planning Integrated Approaches to Support Post-Wildfire Restoration in Natural Protected Areas: The Vesuvius National Park Case Study," Land, MDPI, vol. 11(7), pages 1-25, July.
    15. Vanwindekens, Frédéric M. & Stilmant, Didier & Baret, Philippe V., 2013. "Development of a broadened cognitive mapping approach for analysing systems of practices in social–ecological systems," Ecological Modelling, Elsevier, vol. 250(C), pages 352-362.
    16. Reyes, René & Nelson, Harry & Zerriffi, Hisham, 2021. "How do decision makers´ ethnicity and religion influence the use of forests? Evidence from Chile," Forest Policy and Economics, Elsevier, vol. 128(C).
    17. Fischer, Beate & Klauer, Bernd & Schiller, Johannes, 2013. "Prospects for sustainable land-use policy in Germany: Experimenting with a sustainability heuristic," Ecological Economics, Elsevier, vol. 95(C), pages 213-220.
    18. Sacchelli, S. & Fabbrizzi, S., 2015. "Minimisation of uncertainty in decision-making processes using optimised probabilistic Fuzzy Cognitive Maps: A case study for a rural sector," Socio-Economic Planning Sciences, Elsevier, vol. 52(C), pages 31-40.
    19. Gourlis, Georgios & Kovacic, Iva, 2017. "Building Information Modelling for analysis of energy efficient industrial buildings – A case study," Renewable and Sustainable Energy Reviews, Elsevier, vol. 68(P2), pages 953-963.
    20. Hidayatno, Akhmad & Jafino, Bramka Arga & Setiawan, Andri D. & Purwanto, Widodo Wahyu, 2020. "When and why does transition fail? A model-based identification of adoption barriers and policy vulnerabilities for transition to natural gas vehicles," Energy Policy, Elsevier, vol. 138(C).
    21. S. Camici & L. Brocca & T. Moramarco, 2017. "Accuracy versus variability of climate projections for flood assessment in central Italy," Climatic Change, Springer, vol. 141(2), pages 273-286, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:19:y:2025:i:2:s1751157725000252. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.