IDEAS home Printed from https://ideas.repec.org/a/sae/anname/v659y2015i1p63-76.html
   My bibliography  Save this article

Is Bigger Always Better? Potential Biases of Big Data Derived from Social Network Sites

Author

Listed:
  • Eszter Hargittai

Abstract

This article discusses methodological challenges of using big data that rely on specific sites and services as their sampling frames, focusing on social network sites in particular. It draws on survey data to show that people do not select into the use of such sites randomly. Instead, use is biased in certain ways yielding samples that limit the generalizability of findings. Results show that age, gender, race/ethnicity, socioeconomic status, online experiences, and Internet skills all influence the social network sites people use and thus where traces of their behavior show up. This has implications for the types of conclusions one can draw from data derived from users of specific sites. The article ends by noting how big data studies can address the shortcomings that result from biased sampling frames.

Suggested Citation

  • Eszter Hargittai, 2015. "Is Bigger Always Better? Potential Biases of Big Data Derived from Social Network Sites," The ANNALS of the American Academy of Political and Social Science, , vol. 659(1), pages 63-76, May.
  • Handle: RePEc:sae:anname:v:659:y:2015:i:1:p:63-76
    DOI: 10.1177/0002716215570866
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0002716215570866
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0002716215570866?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Rajagopal, 2013. "Social Media Metrics," Palgrave Macmillan Books, in: Managing Social Media and Consumerism, chapter 7, pages 132-151, Palgrave Macmillan.
    2. Ira M. Wasserman & Marie Richmond‐Abbott, 2005. "Gender and the Internet: Causes of Variation in Access, Level, and Scope of Use," Social Science Quarterly, Southwestern Social Science Association, vol. 86(1), pages 252-270, March.
    3. H Andrew Schwartz & Johannes C Eichstaedt & Margaret L Kern & Lukasz Dziurzynski & Stephanie M Ramones & Megha Agrawal & Achal Shah & Michal Kosinski & David Stillwell & Martin E P Seligman & Lyle H U, 2013. "Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-16, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jisoo Sim & Patrick Miller, 2019. "Understanding an Urban Park through Big Data," IJERPH, MDPI, vol. 16(20), pages 1-16, October.
    2. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2021. "The Effect of Social Media on Elections: Evidence from the United States," NBER Working Papers 28849, National Bureau of Economic Research, Inc.
    3. Thomas Fujiwara & Karsten Müller & Carlo Schwarz, 2021. "The Effect of Social Media on Elections: Evidence from the United States," NBER Working Papers 28849, National Bureau of Economic Research, Inc.
    4. Mortati, Marzia & Magistretti, Stefano & Cautela, Cabirio & Dell’Era, Claudio, 2023. "Data in design: How big data and thick data inform design thinking projects," Technovation, Elsevier, vol. 122(C).
    5. Stieglitz, Stefan & Mirbabaie, Milad & Ross, Björn & Neuberger, Christoph, 2018. "Social media analytics – Challenges in topic discovery, data collection, and data preparation," International Journal of Information Management, Elsevier, vol. 39(C), pages 156-168.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Stefano Di Lauro & Aizhan Tursunbayeva & Gilda Antonelli & Marcello Martinez, 2021. "Organizational and Corporate Identity on Social Media: A Literature Review," International Journal of Business and Management, Canadian Center of Science and Education, vol. 15(4), pages 1-53, July.
    2. Yadav, Manjit S. & de Valck, Kristine & Hennig-Thurau, Thorsten & Hoffman, Donna L. & Spann, Martin, 2013. "Social Commerce: A Contingency Framework for Assessing Marketing Potential," Journal of Interactive Marketing, Elsevier, vol. 27(4), pages 311-323.
    3. Galperin, Hernan & Arcidiacono, Malena, 2021. "Employment and the gender digital divide in Latin America: A decomposition analysis," Telecommunications Policy, Elsevier, vol. 45(7).
    4. Ivanete Schneider Hahn & Flavia Luciane Scherer & Kenny Basso & Marindia Brachak dos Santos, 2016. "Consumer Trust in and Emotional Response to Advertisements on Social Media and their Influence on Brand Evaluation," Brazilian Business Review, Fucape Business School, vol. 13(4), pages 49-71, July.
    5. Chiang, Eric P. & Assane, Djeto, 2008. "Music piracy among students on the university campus: Do males and females react differently?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 37(4), pages 1371-1380, August.
    6. Matej Kovač & Vesna Žabkar, 2020. "Do Social Media and E-Mail Engagement Impact Reputation and Trust-Driven Behavior?," Tržište/Market, Faculty of Economics and Business, University of Zagreb, vol. 32(1), pages 9-25.
    7. Jiayin Pei & Guang Yu & Xianyun Tian & Maureen Renee Donnelley, 2017. "A new method for early detection of mass concern about public health issues," Journal of Risk Research, Taylor & Francis Journals, vol. 20(4), pages 516-532, April.
    8. Onețiu Darius Daniel, 2020. "The Impact of Social Media Adoption by Companies. Digital Transformation," Studia Universitatis „Vasile Goldis” Arad – Economics Series, Sciendo, vol. 30(2), pages 83-96, June.
    9. Karel Hrazdil & Jiri Novak & Rafael Rogo & Christine Wiedman & Ray Zhang, 2020. "Measuring executive personality using machine‐learning algorithms: A new approach and audit fee‐based validation tests," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 47(3-4), pages 519-544, March.
    10. Luo, Shuli & He, Sylvia Y., 2021. "Understanding gender difference in perceptions toward transit services across space and time: A social media mining approach," Transport Policy, Elsevier, vol. 111(C), pages 63-73.
    11. Gow, Ian D. & Kaplan, Steven N. & Larcker, David F. & Zakolyukina, Anastasia A., 2016. "CEO Personality and Firm Policies," Research Papers 3444, Stanford University, Graduate School of Business.
    12. Rosenthal, Benjamin & Brito, Eliane P.Z., 2017. "How virtual brand community traces may increase fan engagement in brand pages," Business Horizons, Elsevier, vol. 60(3), pages 375-384.
    13. Mikkel Wallentin, 2018. "Sex differences in post-stroke aphasia rates are caused by age. A meta-analysis and database query," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-18, December.
    14. Hannes Rosenbusch & Maya Aghaei & Anthony M. Evans & Marcel Zeelenberg, 2021. "Psychological trait inferences from women’s clothing: human and machine prediction," Journal of Computational Social Science, Springer, vol. 4(2), pages 479-501, November.
    15. Shahbaznezhad, Hamidreza & Dolan, Rebecca & Rashidirad, Mona, 2021. "The Role of Social Media Content Format and Platform in Users' Engagement Behavior," Journal of Interactive Marketing, Elsevier, vol. 53(C), pages 47-65.
    16. Ahlam Al-Muwil & Vishanth Weerakkody & Ramzi El-haddadeh & Yogesh Dwivedi, 2019. "Balancing Digital-By-Default with Inclusion: A Study of the Factors Influencing E-Inclusion in the UK," Information Systems Frontiers, Springer, vol. 21(3), pages 635-659, June.
    17. Wrzesinski, Marcel & Riechert, Patrick Urs & Dubois, Frédéric & Katzenbach, Christian, 2021. "Working with publication technology to make open access journals sustainable," EconStor Preprints 231355, ZBW - Leibniz Information Centre for Economics.
    18. Jeretta Horn Nord, 2013. "Managing the Business of Social Technologies," International Journal of Management, Knowledge and Learning, International School for Social and Business Studies, Celje, Slovenia, vol. 2(2), pages 255-267.
    19. Rachel Winter & Anna Lavis, 2021. "The Impact of COVID-19 on Young People’s Mental Health in the UK: Key Insights from Social Media Using Online Ethnography," IJERPH, MDPI, vol. 19(1), pages 1-13, December.
    20. Lushi Chen & Tao Gong & Michal Kosinski & David Stillwell & Robert L Davidson, 2017. "Building a profile of subjective well-being for social media users," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-15, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:anname:v:659:y:2015:i:1:p:63-76. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.