IDEAS home Printed from https://ideas.repec.org/p/osf/osfxxx/jvpbw.html
   My bibliography  Save this paper

Ethnography and Machine Learning: Synergies and New Directions

Author

Listed:
  • Abramson, Corey
  • Li, Zhuofan

Abstract

Ethnography (social scientific methods that illuminate how people understand, navigate and shape the real world contexts in which they live their lives) and machine learning (computational techniques that use big data and statistical learning models to perform quantifiable tasks) are each core to contemporary social science. Yet these tools have remained largely separate in practice. This chapter draws on a growing body of scholarship that argues that ethnography and machine learning can be usefully combined, particularly for large comparative studies. Specifically, this paper (a) explains the value (and challenges) of using machine learning alongside qualitative field research for certain types of projects, (b) discusses recent methodological trends to this effect, (c) provides examples that illustrate workflow drawn from several large projects, and (d) concludes with a roadmap for enabling productive coevolution of field methods and machine learning. Keywords ethnography, computational social science, qualitative methods, machine learning, natural language processing, large language models, computational ethnography, digital ethnography, big data, research methods, mixed-methods

Suggested Citation

  • Abramson, Corey & Li, Zhuofan, 2024. "Ethnography and Machine Learning: Synergies and New Directions," OSF Preprints jvpbw, Center for Open Science.
  • Handle: RePEc:osf:osfxxx:jvpbw
    DOI: 10.31219/osf.io/jvpbw
    as

    Download full text from publisher

    File URL: https://osf.io/download/675616d36b40ab0a3f661619/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/jvpbw?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Freese, Jeremy & Peterson, David, 2017. "Replication in Social Science," SocArXiv 5bck9, Center for Open Science.
    2. Richard Van Noorden, 2022. "How language-generation AIs could transform science," Nature, Nature, vol. 605(7908), pages 21-21, May.
    3. Bart Bonikowski & Laura K. Nelson, 2022. "From Ends to Means: The Promise of Computational Text Analysis for Theoretically Driven Sociological Research," Sociological Methods & Research, , vol. 51(4), pages 1469-1483, November.
    4. Laura K. Nelson & Derek Burk & Marcel Knudsen & Leslie McCall, 2021. "The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods," Sociological Methods & Research, , vol. 50(1), pages 202-237, February.
    5. Laura K. Nelson, 2020. "Computational Grounded Theory: A Methodological Framework," Sociological Methods & Research, , vol. 49(1), pages 3-42, February.
    6. Nikolitsa Grigoropoulou & Mario L. Small, 2022. "The data revolution in social science needs qualitative research," Nature Human Behaviour, Nature, vol. 6(7), pages 904-906, July.
    7. repec:osf:socarx:5bck9_v1 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:osf:osfxxx:jvpbw_v1 is not listed on IDEAS
    2. Alex Luscombe & Kevin Dick & Kevin Walby, 2022. "Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(3), pages 1023-1044, June.
    3. AJ Alvero & Jasmine Pal & Katelyn M. Moussavian, 2022. "Linguistic, cultural, and narrative capital: computational and human readings of transfer admissions essays," Journal of Computational Social Science, Springer, vol. 5(2), pages 1709-1734, November.
    4. Bernhardt, Lea & Dewenter, Ralf & Thomas, Tobias, 2023. "Measuring partisan media bias in US newscasts from 2001 to 2012," European Journal of Political Economy, Elsevier, vol. 78(C).
    5. Daniel Caballero-Julia & Philippe Campillo, 2021. "Epistemological Considerations of Text Mining: Implications for Systematic Literature Review," Mathematics, MDPI, vol. 9(16), pages 1-26, August.
    6. Mats Alvesson & Jörgen Sandberg, 2020. "The Problematizing Review: A Counterpoint to Elsbach and Van Knippenberg’s Argument for Integrative Reviews," Journal of Management Studies, Wiley Blackwell, vol. 57(6), pages 1290-1304, September.
    7. Anna Dreber & Magnus Johannesson, 2025. "A framework for evaluating reproducibility and replicability in economics," Economic Inquiry, Western Economic Association International, vol. 63(2), pages 338-356, April.
    8. Ho-Chun Herbert Chang & Brooke Harrington & Feng Fu & Daniel Rockmore, 2023. "Complex Systems of Secrecy: The Offshore Networks of Oligarchs," Papers 2303.03371, arXiv.org.
    9. Franz Neuberger & Martin Bujard & Tobias Rüttenauer, 2022. "Where does public childcare boost female labor force participation? Exploring geographical heterogeneity across Germany 2007–2017," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(24), pages 693-722.
    10. Fanelli, Daniele, 2020. "Metascientific reproducibility patterns revealed by informatic measure of knowledge," MetaArXiv 5vnhj, Center for Open Science.
    11. repec:osf:osfxxx:mydzv_v1 is not listed on IDEAS
    12. Stijn Daenekindt & Julian Schaap, 2022. "Using word embedding models to capture changing media discourses: a study on the role of legitimacy, gender and genre in 24,000 music reviews, 1999–2021," Journal of Computational Social Science, Springer, vol. 5(2), pages 1615-1636, November.
    13. Ashwin,Julian & Chhabra,Aditya & Rao,Vijayendra, 2023. "Using Large Language Models for Qualitative Analysis can Introduce Serious Bias," Policy Research Working Paper Series 10597, The World Bank.
    14. Özgür Özvatan & Bastian Neuhauser & Gökçe Yurdakul, 2023. "The ‘Arab Clans’ Discourse: Narrating Racialization, Kinship, and Crime in the German Media," Social Sciences, MDPI, vol. 12(2), pages 1-18, February.
    15. Martin Kreidl & Zuzana Žilinčíková, 2023. "Adult children’s union type and contact with mothers: A replication," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 48(23), pages 641-680.
    16. Luis Alfonso Dau & Grazia D. Santangelo & Arjen Witteloostuijn, 2022. "Replication studies in international business," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 53(2), pages 215-230, March.
    17. Mangiò, Federico & Mismetti, Marco & Lissana, Elena & Andreini, Daniela, 2023. "That's the Press, Baby! How journalists co-create family business brands meanings: A mixed method analysis," Journal of Business Research, Elsevier, vol. 161(C).
    18. repec:osf:metaar:67sak_v1 is not listed on IDEAS
    19. Boris Forthmann & Mark A. Runco, 2020. "An Empirical Test of the Inter-Relationships between Various Bibliometric Creative Scholarship Indicators," Publications, MDPI, vol. 8(2), pages 1-16, June.
    20. Yuyan Jiang & Xueli Liu, 2023. "A construction and empirical research of the journal disruption index based on open citation data," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(7), pages 3935-3958, July.
    21. Scholdra, Thomas P. & Wichmann, Julian R.K. & Reinartz, Werner J., 2023. "Reimagining personalization in the physical store," Journal of Retailing, Elsevier, vol. 99(4), pages 563-579.
    22. Christof Brandtner & Parham Ashur & Bhargav Srinivasa-Desikan, 2025. "Dynamic persistence of institutions : Modeling the historical endurance of Red Vienna’s public housing utopia," Post-Print hal-04907529, HAL.
    23. Andrew G.H. Thompson & Oliver Escobar & Jennifer J. Roberts & Stephen Elstub & Niccole M. Pamphilis, 2021. "The Importance of Context and the Effect of Information and Deliberation on Opinion Change Regarding Environmental Issues in Citizens’ Juries," Sustainability, MDPI, vol. 13(17), pages 1-21, September.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:jvpbw. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.