IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v128y2023i5d10.1007_s11192-023-04636-2.html
   My bibliography  Save this article

Biases in scholarly recommender systems: impact, prevalence, and mitigation

Author

Listed:
  • Michael Färber

    (Institute AIFB, Karlsruhe Institute of Technology (KIT))

  • Melissa Coutinho

    (Institute AIFB, Karlsruhe Institute of Technology (KIT))

  • Shuzhou Yuan

    (Institute AIFB, Karlsruhe Institute of Technology (KIT))

Abstract

With the remarkable increase in the number of scientific entities such as publications, researchers, and scientific topics, and the associated information overload in science, academic recommender systems have become increasingly important for millions of researchers and science enthusiasts. However, it is often overlooked that these systems are subject to various biases. In this article, we first break down the biases of academic recommender systems and characterize them according to their impact and prevalence. In doing so, we distinguish between biases originally caused by humans and biases induced by the recommender system. Second, we provide an overview of methods that have been used to mitigate these biases in the scholarly domain. Based on this, third, we present a framework that can be used by researchers and developers to mitigate biases in scholarly recommender systems and to evaluate recommender systems fairly. Finally, we discuss open challenges and possible research directions related to scholarly biases.

Suggested Citation

  • Michael Färber & Melissa Coutinho & Shuzhou Yuan, 2023. "Biases in scholarly recommender systems: impact, prevalence, and mitigation," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(5), pages 2703-2736, May.
  • Handle: RePEc:spr:scient:v:128:y:2023:i:5:d:10.1007_s11192-023-04636-2
    DOI: 10.1007/s11192-023-04636-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04636-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-023-04636-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Joshua Ettinger & Friederike E. L. Otto & E. Lisa F. Schipper, 2021. "Storytelling can be a powerful tool for science," Nature, Nature, vol. 589(7842), pages 352-352, January.
    2. Sebastian Berger & Christoph Feldhaus & Axel Ockenfels, 2018. "A shared identity promotes herding in an information cascade game," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 4(1), pages 63-72, July.
    3. Rachel Courtland, 2018. "Bias detectives: the researchers striving to make algorithms fair," Nature, Nature, vol. 558(7710), pages 357-360, June.
    4. Johan S. G. Chu & James A. Evans, 2021. "Slowed canonical progress in large fields of science," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 118(41), pages 2021636118-, October.
    5. Richard Van Noorden & Dalmeet Singh Chawla, 2019. "Hundreds of extreme self-citing scientists revealed in new database," Nature, Nature, vol. 572(7771), pages 578-579, August.
    6. Heather Sarsons, 2017. "Recognition for Group Work: Gender Differences in Academia," American Economic Review, American Economic Association, vol. 107(5), pages 141-145, May.
    7. Luke Holman & Devi Stuart-Fox & Cindy E Hauser, 2018. "The gender gap in science: How long until women are equally represented?," PLOS Biology, Public Library of Science, vol. 16(4), pages 1-20, April.
    8. Xianwen Wang & Chen Liu & Wenli Mao & Zhichao Fang, 2015. "Erratum to: The open access advantage considering citation, article usage and social media attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(3), pages 1149-1149, June.
    9. Mitcham, Carl & Emeritus,, 2021. "Science policy and democracy," Technology in Society, Elsevier, vol. 67(C).
    10. Andersen, Jens Peter & Nielsen, Mathias Wullum, 2018. "Google Scholar and Web of Science: Examining gender differences in citation coverage across five scientific disciplines," Journal of Informetrics, Elsevier, vol. 12(3), pages 950-959.
    11. Shouhuai Xu & Moti Yung & Jingguo Wang, 2021. "Seeking Foundations for the Science of Cyber Security," Information Systems Frontiers, Springer, vol. 23(2), pages 263-267, April.
    12. Xianwen Wang & Chen Liu & Wenli Mao & Zhichao Fang, 2015. "The open access advantage considering citation, article usage and social media attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 555-564, May.
    13. Alberto Martín-Martín & Mike Thelwall & Enrique Orduna-Malea & Emilio Delgado López-Cózar, 2021. "Correction to: Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 907-908, January.
    14. Jevin D West & Jennifer Jacquet & Molly M King & Shelley J Correll & Carl T Bergstrom, 2013. "The Role of Gender in Scholarly Authorship," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-6, July.
    15. Jory Lerback & Brooks Hanson, 2017. "Journals invite too few women to referee," Nature, Nature, vol. 541(7638), pages 455-457, January.
    16. Shalabh, 2021. "Statistical inference via data science," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 1155-1155, July.
    17. Mathias W. Nielsen, 2016. "Limits to meritocracy? Gender in academic recruitment and promotion processes," Science and Public Policy, Oxford University Press, vol. 43(3), pages 386-399.
    18. Abhay S. D. Rajput & Sangeeta Sharma, 2021. "India: draft science policy calls for public engagement," Nature, Nature, vol. 592(7852), pages 26-26, April.
    19. Jerome K. Vanclay, 2009. "Bias in the journal impact factor," Scientometrics, Springer;Akadémiai Kiadó, vol. 78(1), pages 3-12, January.
    20. Lisa Mandle & Analisa Shields-Estrada & Rebecca Chaplin-Kramer & Matthew G. E. Mitchell & Leah L. Bremer & Jesse D. Gourevitch & Peter Hawthorne & Justin A. Johnson & Brian E. Robinson & Jeffrey R. Sm, 2021. "Increasing decision relevance of ecosystem service science," Nature Sustainability, Nature, vol. 4(2), pages 161-169, February.
    21. Dag W. Aksnes, 2003. "A macro study of self-citation," Scientometrics, Springer;Akadémiai Kiadó, vol. 56(2), pages 235-246, February.
    22. Alberto Martín-Martín & Mike Thelwall & Enrique Orduna-Malea & Emilio Delgado López-Cózar, 2021. "Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 871-906, January.
    23. Jean J. Wang & Sarah X. Shao & Fred Y. Ye, 2021. "Identifying 'seed' papers in sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6001-6011, July.
    24. T. Ojasoo & J. C. Doré, 1999. "Citation bias in medical journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 45(1), pages 81-94, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liyin Zhang & Yuchen Qian & Chao Ma & Jiang Li, 2023. "Continued collaboration shortens the transition period of scientists who move to another institution," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(3), pages 1765-1784, March.
    2. Malte Hückstädt, 2023. "Ten reasons why research collaborations succeed—a random forest approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(3), pages 1923-1950, March.
    3. Cinzia Daraio & Simone Di Leo & Loet Leydesdorff, 2022. "Using the Leiden Rankings as a Heuristics: Evidence from Italian universities in the European landscape," LEM Papers Series 2022/08, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    4. Zhuanlan Sun & C. Clark Cao & Sheng Liu & Yiwei Li & Chao Ma, 2024. "Behavioral consequences of second-person pronouns in written communications between authors and reviewers of scientific papers," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    5. Thomas, Duncan Andrew & Ramos-Vielba, Irene, 2022. "Reframing study of research(er) funding towards configurations and trails," SocArXiv uty2v, Center for Open Science.
    6. Weihua Li & Sam Zhang & Zhiming Zheng & Skyler J. Cranmer & Aaron Clauset, 2022. "Untangling the network effects of productivity and prominence among scientists," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    7. Cinzia Daraio & Simone Di Leo & Loet Leydesdorff, 2023. "A heuristic approach based on Leiden rankings to identify outliers: evidence from Italian universities in the European landscape," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 483-510, January.
    8. Eitan Frachtenberg, 2022. "Multifactor Citation Analysis over Five Years: A Case Study of SIGMETRICS Papers," Publications, MDPI, vol. 10(4), pages 1-16, December.
    9. Marek Kwiek & Wojciech Roszka, 2022. "Academic vs. biological age in research on academic careers: a large-scale study with implications for scientifically developing systems," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 3543-3575, June.
    10. Katchanov, Yurij L. & Markova, Yulia V. & Shmatko, Natalia A., 2023. "Uncited papers in the structure of scientific communication," Journal of Informetrics, Elsevier, vol. 17(2).
    11. Jiang, Zhuoren & Lin, Tianqianjin & Huang, Cui, 2023. "Deep representation learning of scientific paper reveals its potential scholarly impact," Journal of Informetrics, Elsevier, vol. 17(1).
    12. Liang, Zhentao & Ba, Zhichao & Mao, Jin & Li, Gang, 2023. "Research complexity increases with scientists’ academic age: Evidence from library and information science," Journal of Informetrics, Elsevier, vol. 17(1).
    13. Manuel Goyanes & Márton Demeter & Aurea Grané & Tamás Tóth & Homero Gil Zúñiga, 2023. "Research patterns in communication (2009–2019): testing female representation and productivity differences, within the most cited authors and the field," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 137-156, January.
    14. Pierre Pelletier & Kevin Wirtz, 2023. "Sails and Anchors: The Complementarity of Exploratory and Exploitative Scientists in Knowledge Creation," Papers 2312.10476, arXiv.org.
    15. Wu, Lingfei & Kittur, Aniket & Youn, Hyejin & Milojević, Staša & Leahey, Erin & Fiore, Stephen M. & Ahn, Yong-Yeol, 2022. "Metrics and mechanisms: Measuring the unmeasurable in the science of science," Journal of Informetrics, Elsevier, vol. 16(2).
    16. Raminta Pranckutė, 2021. "Web of Science (WoS) and Scopus: The Titans of Bibliographic Information in Today’s Academic World," Publications, MDPI, vol. 9(1), pages 1-59, March.
    17. Chompunuch Saravudecha & Duangruthai Na Thungfai & Chananthida Phasom & Sodsri Gunta-in & Aorrakanya Metha & Peangkobfah Punyaphet & Tippawan Sookruay & Wannachai Sakuludomkan & Nut Koonrungsesomboon, 2023. "Hybrid Gold Open Access Citation Advantage in Clinical Medicine: Analysis of Hybrid Journals in the Web of Science," Publications, MDPI, vol. 11(2), pages 1-9, March.
    18. Lin Zhang & Yuanyuan Shang & Ying Huang & Gunnar Sivertsen, 2022. "Gender differences among active reviewers: an investigation based on publons," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 145-179, January.
    19. Josh Yamamoto & Eitan Frachtenberg, 2022. "Gender Differences in Collaboration Patterns in Computer Science," Publications, MDPI, vol. 10(1), pages 1-21, February.
    20. Lutz Bornmann & Robin Haunschild & Rüdiger Mutz, 2021. "Growth rates of modern science: a latent piecewise growth curve approach to model publication numbers from established and new literature databases," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-15, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:128:y:2023:i:5:d:10.1007_s11192-023-04636-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.