IDEAS home Printed from https://ideas.repec.org/a/spr/qualqt/v57y2023i6d10.1007_s11135-023-01615-w.html
   My bibliography  Save this article

Sampling Twitter users for social science research: evidence from a systematic review of the literature

Author

Listed:
  • Paula Vicente

    (ISCTE-Instituto Universitário de Lisboa)

Abstract

All social media platforms can be used to conduct social science research, but Twitter is the most popular as it provides its data via several Application Programming Interfaces, which allows qualitative and quantitative research to be conducted with its members. As Twitter is a huge universe, both in number of users and amount of data, sampling is generally required when using it for research purposes. Researchers only recently began to question whether tweet-level sampling—in which the tweet is the sampling unit—should be replaced by user-level sampling—in which the user is the sampling unit. The major rationale for this shift is that tweet-level sampling does not consider the fact that some core discussants on Twitter are much more active tweeters than other less active users, thus causing a sample biased towards the more active users. The knowledge on how to select representative samples of users in the Twitterverse is still insufficient despite its relevance for reliable and valid research outcomes. This paper contributes to this topic by presenting a systematic quantitative literature review of sampling plans designed and executed in the context of social science research in Twitter, including: (1) the definition of the target populations, (2) the sampling frames used to support sample selection, (3) the sampling methods used to obtain samples of Twitter users, (4) how data is collected from Twitter users, (5) the size of the samples, and (6) how research validity is addressed. This review can be a methodological guide for professionals and academics who want to conduct social science research involving Twitter users and the Twitterverse.

Suggested Citation

  • Paula Vicente, 2023. "Sampling Twitter users for social science research: evidence from a systematic review of the literature," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(6), pages 5449-5489, December.
  • Handle: RePEc:spr:qualqt:v:57:y:2023:i:6:d:10.1007_s11135-023-01615-w
    DOI: 10.1007/s11135-023-01615-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11135-023-01615-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11135-023-01615-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yu, Houqiang & Xiao, Tingting & Xu, Shenmeng & Wang, Yuefen, 2019. "Who posts scientific tweets? An investigation into the productivity, locations, and identities of scientific tweeters," Journal of Informetrics, Elsevier, vol. 13(3), pages 841-855.
    2. Darja Reuschke & Jed Long & Nick Bennett, 2021. "Locating creativity in the city using Twitter data," Environment and Planning B, , vol. 48(9), pages 2607-2622, November.
    3. Rachel Sharples, 2021. "Disrupting State Spaces: Asylum Seekers in Australia’s Offshore Detention Centres," Social Sciences, MDPI, vol. 10(3), pages 1-16, March.
    4. Kristen Olson, 2013. "Paradata for Nonresponse Adjustment," The ANNALS of the American Academy of Political and Social Science, , vol. 645(1), pages 142-170, January.
    5. Tomu Tominaga & Yoshinori Hijikata & Joseph A. Konstan, 2018. "How self-disclosure in Twitter profiles relate to anonymity consciousness and usage objectives: a cross-cultural study," Journal of Computational Social Science, Springer, vol. 1(2), pages 391-435, September.
    6. Hino, Airo & Fahey, Robert A., 2019. "Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints," International Journal of Information Management, Elsevier, vol. 48(C), pages 175-184.
    7. Eszter Hargittai, 2015. "Is Bigger Always Better? Potential Biases of Big Data Derived from Social Network Sites," The ANNALS of the American Academy of Political and Social Science, , vol. 659(1), pages 63-76, May.
    8. Samuel-Azran, Tal & Hayat, Tsahi (Zack), 2020. "The geography of the Arab public sphere on Twitter," Technology in Society, Elsevier, vol. 62(C).
    9. Fischer, Eileen & Reuber, A. Rebecca, 2011. "Social interaction via new social media: (How) can interactions on Twitter affect effectual thinking and behavior?," Journal of Business Venturing, Elsevier, vol. 26(1), pages 1-18, January.
    10. Emilio Ferrara & Zeyao Yang, 2015. "Measuring Emotional Contagion in Social Media," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-14, November.
    11. Han, Sehee & Min, Jinyoung & Lee, Heeseok, 2015. "Antecedents of social presence and gratification of social connection needs in SNS: A study of Twitter users and their mobile and non-mobile usage," International Journal of Information Management, Elsevier, vol. 35(4), pages 459-471.
    12. Kristina Lerman & Luciano G. Marin & Megha Arora & Lucas H. Costa Lima & Emilio Ferrara & David Garcia, 2018. "Language, demographics, emotions, and the structure of online social networks," Journal of Computational Social Science, Springer, vol. 1(1), pages 209-225, January.
    13. Gregory Eady & Jonathan Nagler & Andy Guess & Jan Zilinsky & Joshua A. Tucker, 2019. "How Many People Live in Political Bubbles on Social Media? Evidence From Linked Survey and Twitter Data," SAGE Open, , vol. 9(1), pages 21582440198, February.
    14. Mohammed, Abdulalem & Ferraris, Alberto, 2021. "Factors influencing user participation in social media: Evidence from twitter usage during COVID-19 pandemic in Saudi Arabia," Technology in Society, Elsevier, vol. 66(C).
    15. Moshkovitz, Karin & Hayat, Tsahi, 2021. "The rich get richer: Extroverts' social capital on twitter," Technology in Society, Elsevier, vol. 65(C).
    16. Schaarschmidt, Mario & Könsgen, Raoul, 2020. "Good citizen, good ambassador? Linking employees' reputation perceptions with supportive behavior on Twitter," Journal of Business Research, Elsevier, vol. 117(C), pages 754-763.
    17. Tao Lu & Aimee L. Franklin, 2018. "A Protocol for Identifying and Sampling From Proxy Populations," Social Science Quarterly, Southwestern Social Science Association, vol. 99(4), pages 1535-1546, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2024. "The Effect of Social Media on Elections: Evidence from The United States," Journal of the European Economic Association, European Economic Association, vol. 22(3), pages 1495-1539.
    2. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2021. "The Effect of Social Media on Elections: Evidence from the United States," NBER Working Papers 28849, National Bureau of Economic Research, Inc.
    3. Karunakaran, Arvind & Orlikowski, Wanda J. & Scott, Susan V., 2022. "Crowd-based accountability: examining how social media commentary reconfigures organizational accountability," LSE Research Online Documents on Economics 114401, London School of Economics and Political Science, LSE Library.
    4. repec:osf:osfxxx:x6vda_v1 is not listed on IDEAS
    5. Mochon, Daniel & Schwartz, Janet, 2024. "The confrontation effect: When users engage more with ideology-inconsistent content online," Organizational Behavior and Human Decision Processes, Elsevier, vol. 185(C).
    6. Guohui Song & Yongbin Wang, 2021. "Mainstream Value Information Push Strategy on Chinese Aggregation News Platform: Evolution, Modelling and Analysis," Sustainability, MDPI, vol. 13(19), pages 1-17, October.
    7. Rydén, Pernille & Ringberg, Torsten & Wilke, Ricky, 2015. "How Managers' Shared Mental Models of Business–Customer Interactions Create Different Sensemaking of Social Media," Journal of Interactive Marketing, Elsevier, vol. 31(C), pages 1-16.
    8. Langley, David J. & Hoeve, Maarten C. & Ortt, J. Roland & Pals, Nico & van der Vecht, Bob, 2014. "Patterns of Herding and their Occurrence in an Online Setting," Journal of Interactive Marketing, Elsevier, vol. 28(1), pages 16-25.
    9. Rippa, Pierluigi & Secundo, Giustina, 2019. "Digital academic entrepreneurship: The potential of digital technologies on academic entrepreneurship," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 900-911.
    10. Plé, Loïc & Demangeot, Catherine, 2020. "Social contagion of online and offline deviant behaviors and its value outcomes: The case of tourism ecosystems," Journal of Business Research, Elsevier, vol. 117(C), pages 886-896.
    11. Yandong Wang & Teng Wang & Xinyue Ye & Jianqi Zhu & Jay Lee, 2015. "Using Social Media for Emergency Response and Urban Sustainability: A Case Study of the 2012 Beijing Rainstorm," Sustainability, MDPI, vol. 8(1), pages 1-17, December.
    12. Hainan Huang & Weifan Chen & Tian Xie & Yaoyao Wei & Ziqing Feng & Weijiong Wu, 2021. "The Impact of Individual Behaviors and Governmental Guidance Measures on Pandemic-Triggered Public Sentiment Based on System Dynamics and Cross-Validation," IJERPH, MDPI, vol. 18(8), pages 1-25, April.
    13. Cao, Xiongfei & Yu, Lingling, 2019. "Exploring the influence of excessive social media use at work: A three-dimension usage perspective," International Journal of Information Management, Elsevier, vol. 46(C), pages 83-92.
    14. Werner Liebregts & Pourya Darnihamedani & Eric Postma & Martin Atzmueller, 2020. "The promise of social signal processing for research on decision-making in entrepreneurial contexts," Small Business Economics, Springer, vol. 55(3), pages 589-605, October.
    15. Marko Matalamäki & Tero Vuorinen & Elina Varamäki & Kirsti Sorama, 2017. "Business Growth in Established Companies; Roles of Effectuation and Causation," Journal of Enterprising Culture (JEC), World Scientific Publishing Co. Pte. Ltd., vol. 25(02), pages 123-148, June.
    16. Yifan Yu & Shan Huang & Yuchen Liu & Yong Tan, 2020. "Emotions in Online Content Diffusion," Papers 2011.09003, arXiv.org, revised Mar 2022.
    17. Lisa Oswald, 2024. "More than news! Mapping the deliberative potential of a political online ecosystem with digital trace data," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-16, December.
    18. Leonidou, Erasmia & Christofi, Michael & Vrontis, Demetris & Thrassou, Alkis, 2020. "An integrative framework of stakeholder engagement for innovation management and entrepreneurship development," Journal of Business Research, Elsevier, vol. 119(C), pages 245-258.
    19. Jianshan Sun & Jian Song & Yuanchun Jiang & Yezheng Liu & Jun Li, 2022. "Prick the filter bubble: A novel cross domain recommendation model with adaptive diversity regularization," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(1), pages 101-121, March.
    20. Fraccastoro, Sara & Gabrielsson, Mika & Chetty, Sylvie, 2021. "Social Media Firm Specific Advantages as Enablers of Network Embeddedness of International Entrepreneurial Ventures," Journal of World Business, Elsevier, vol. 56(3).
    21. Pierluigi Conzo & Andrea Gallice & Juan S. Morales & Margaret Samahita & Laura K. Taylor, 2021. "Can Hearts Change Minds? Social media Endorsements and Policy Preferences," Carlo Alberto Notebooks 641, Collegio Carlo Alberto.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:qualqt:v:57:y:2023:i:6:d:10.1007_s11135-023-01615-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.