IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0235502.html
   My bibliography  Save this article

Evolution and impact of bias in human and machine learning algorithm interaction

Author

Listed:
  • Wenlong Sun
  • Olfa Nasraoui
  • Patrick Shafto

Abstract

Traditionally, machine learning algorithms relied on reliable labels from experts to build predictions. More recently however, algorithms have been receiving data from the general population in the form of labeling, annotations, etc. The result is that algorithms are subject to bias that is born from ingesting unchecked information, such as biased samples and biased labels. Furthermore, people and algorithms are increasingly engaged in interactive processes wherein neither the human nor the algorithms receive unbiased data. Algorithms can also make biased predictions, leading to what is now known as algorithmic bias. On the other hand, human’s reaction to the output of machine learning methods with algorithmic bias worsen the situations by making decision based on biased information, which will probably be consumed by algorithms later. Some recent research has focused on the ethical and moral implication of machine learning algorithmic bias on society. However, most research has so far treated algorithmic bias as a static factor, which fails to capture the dynamic and iterative properties of bias. We argue that algorithmic bias interacts with humans in an iterative manner, which has a long-term effect on algorithms’ performance. For this purpose, we present an iterated-learning framework that is inspired from human language evolution to study the interaction between machine learning algorithms and humans. Our goal is to study two sources of bias that interact: the process by which people select information to label (human action); and the process by which an algorithm selects the subset of information to present to people (iterated algorithmic bias mode). We investigate three forms of iterated algorithmic bias (personalization filter, active learning, and random) and how they affect the performance of machine learning algorithms by formulating research questions about the impact of each type of bias. Based on statistical analyses of the results of several controlled experiments, we found that the three different iterated bias modes, as well as initial training data class imbalance and human action, do affect the models learned by machine learning algorithms. We also found that iterated filter bias, which is prominent in personalized user interfaces, can lead to more inequality in estimated relevance and to a limited human ability to discover relevant data. Our findings indicate that the relevance blind spot (items from the testing set whose predicted relevance probability is less than 0.5 and who thus risk being hidden from humans) amounted to 4% of all relevant items when using a content-based filter that predicts relevant items. A similar simulation using a real-life rating data set found that the same filter resulted in a blind spot size of 75% of the relevant testing set.

Suggested Citation

  • Wenlong Sun & Olfa Nasraoui & Patrick Shafto, 2020. "Evolution and impact of bias in human and machine learning algorithm interaction," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-39, August.
  • Handle: RePEc:plo:pone00:0235502
    DOI: 10.1371/journal.pone.0235502
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0235502
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0235502&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0235502?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jörg Rieskamp & Jerome R. Busemeyer & Barbara A. Mellers, 2006. "Extending the Bounds of Rationality: Evidence and Theories of Preferential Choice," Journal of Economic Literature, American Economic Association, vol. 44(3), pages 631-661, September.
    2. Daniel D. Lee & H. Sebastian Seung, 1999. "Learning the parts of objects by non-negative matrix factorization," Nature, Nature, vol. 401(6755), pages 788-791, October.
    3. James J. Heckman, 1977. "Sample Selection Bias As a Specification Error (with an Application to the Estimation of Labor Supply Functions)," NBER Working Papers 0172, National Bureau of Economic Research, Inc.
    4. Michael E. Tipping & Christopher M. Bishop, 1999. "Probabilistic Principal Component Analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(3), pages 611-622.
    5. Jerome D. Williams & David Lopez & Patrick Shafto & Kyungwon Lee, 2019. "Technological Workforce and Its Impact on Algorithmic Justice in Politics," Customer Needs and Solutions, Springer;Institute for Sustainable Innovation and Growth (iSIG), vol. 6(3), pages 84-91, December.
    6. Chong Ju Choi & Carla C. J. M. Millar & Caroline Y. L. Wong, 2005. "Knowledge and the State," Palgrave Macmillan Books, in: Knowledge Entanglements, chapter 0, pages 19-38, Palgrave Macmillan.
    7. Yakun Li & Jiaomin Liu & Jiadong Ren, 2019. "Social recommendation model based on user interaction in complex social networks," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-17, July.
    8. Alina Sîrbu & Dino Pedreschi & Fosca Giannotti & János Kertész, 2019. "Algorithmic bias amplifies opinion fragmentation and polarization: A bounded confidence model," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-20, March.
    9. Mubbashir Ayub & Mustansar Ali Ghazanfar & Zahid Mehmood & Tanzila Saba & Riad Alharbey & Asmaa Mahdi Munshi & Mayda Abdullateef Alrige, 2019. "Modeling user rating preference behavior to improve the performance of the collaborative filtering based recommender systems," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-29, August.
    10. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ashesh Rambachan, 2018. "Algorithmic Fairness," AEA Papers and Proceedings, American Economic Association, vol. 108, pages 22-27, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Akter, Shahriar & Dwivedi, Yogesh K. & Sajib, Shahriar & Biswas, Kumar & Bandara, Ruwan J. & Michael, Katina, 2022. "Algorithmic bias in machine learning-based marketing models," Journal of Business Research, Elsevier, vol. 144(C), pages 201-216.
    2. Akter, Shahriar & Hossain, Md Afnan & Sajib, Shahriar & Sultana, Saida & Rahman, Mahfuzur & Vrontis, Demetris & McCarthy, Grace, 2023. "A framework for AI-powered service innovation capability: Review and agenda for future research," Technovation, Elsevier, vol. 125(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiao-Bai Li & Jialun Qin, 2017. "Anonymizing and Sharing Medical Text Records," Information Systems Research, INFORMS, vol. 28(2), pages 332-352, June.
    2. Wentao Qu & Xianchao Xiu & Huangyue Chen & Lingchen Kong, 2023. "A Survey on High-Dimensional Subspace Clustering," Mathematics, MDPI, vol. 11(2), pages 1-39, January.
    3. Waleed Reafee & Naomie Salim & Atif Khan, 2016. "The Power of Implicit Social Relation in Rating Prediction of Social Recommender Systems," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-20, May.
    4. Del Corso, Gianna M. & Romani, Francesco, 2019. "Adaptive nonnegative matrix factorization and measure comparisons for recommender systems," Applied Mathematics and Computation, Elsevier, vol. 354(C), pages 164-179.
    5. Oliver Hinz & Jochen Eckert, 2010. "The Impact of Search and Recommendation Systems on Sales in Electronic Commerce," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 2(2), pages 67-77, April.
    6. Shweta Bahl & Ajay Sharma, 2021. "Education–Occupation Mismatch and Dispersion in Returns to Education: Evidence from India," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 153(1), pages 251-298, January.
    7. P Fogel & C Geissler & P Cotte & G Luta, 2022. "Applying separative non-negative matrix factorization to extra-financial data," Working Papers hal-03689774, HAL.
    8. Vitaliy Roud & Thomas Wolfgang Thurner, 2018. "The Influence of State‐Ownership on Eco‐Innovations in Russian Manufacturing Firms," Journal of Industrial Ecology, Yale University, vol. 22(5), pages 1213-1227, October.
    9. Wang, Zihan & Daeipour, Mohamad & Xu, Hongyi, 2023. "Quantification and propagation of Aleatoric uncertainties in topological structures," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
    10. Lawrence Bunnell & Kweku-Muata Osei-Bryson & Victoria Y. Yoon, 0. "RecSys Issues Ontology: A Knowledge Classification of Issues for Recommender Systems Researchers," Information Systems Frontiers, Springer, vol. 0, pages 1-42.
    11. Martinovici, A., 2019. "Revealing attention - how eye movements predict brand choice and moment of choice," Other publications TiSEM 7dca38a5-9f78-4aee-bd81-c, Tilburg University, School of Economics and Management.
    12. Joanna Sokolowska & Patrycja Sleboda, 2015. "The Inverse Relation Between Risks and Benefits: The Role of Affect and Expertise," Risk Analysis, John Wiley & Sons, vol. 35(7), pages 1252-1267, July.
    13. Donald R. Haurin & Stuart S. Rosenthal, 2009. "Language, Agglomeration and Hispanic Homeownership," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 37(2), pages 155-183, June.
    14. Jong Won Min, 2019. "The Influence of Stigma and Views on Mental Health Treatment Effectiveness on Service Use by Age and Ethnicity: Evidence From the CDC BRFSS 2007, 2009, and 2012," SAGE Open, , vol. 9(3), pages 21582440198, September.
    15. Khalil, Elias L., 2010. "The Bayesian fallacy: Distinguishing internal motivations and religious beliefs from other beliefs," Journal of Economic Behavior & Organization, Elsevier, vol. 75(2), pages 268-280, August.
    16. Zhan (Michael) Shi & T. S. Raghu, 2020. "An Economic Analysis of Product Recommendation in the Presence of Quality and Taste-Match Heterogeneity," Information Systems Research, INFORMS, vol. 31(2), pages 399-411, June.
    17. Xin Xu & Yang Lu & Yupeng Zhou & Zhiguo Fu & Yanjie Fu & Minghao Yin, 2021. "An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks," Mathematics, MDPI, vol. 9(15), pages 1-14, July.
    18. Borenstein, Severin & Netz, Janet, 1999. "Why do all the flights leave at 8 am?: Competition and departure-time differentiation in airline markets," International Journal of Industrial Organization, Elsevier, vol. 17(5), pages 611-640, July.
    19. Voxi Amavilah & Antonio R. Andrés, 2014. "Globalization, Peace & Stability, Governance, and Knowledge Economy," Research Africa Network Working Papers 14/012, Research Africa Network (RAN).
    20. Naiyang Guan & Lei Wei & Zhigang Luo & Dacheng Tao, 2013. "Limited-Memory Fast Gradient Descent Method for Graph Regularized Nonnegative Matrix Factorization," PLOS ONE, Public Library of Science, vol. 8(10), pages 1-10, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0235502. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.