IDEAS home Printed from https://ideas.repec.org/a/nos/voprob/2016i4p106-122.html

In Search of Lost Profiles: The Reliability of VKontakte Data and Its Importance for Educational Research

Author

Abstract

Ivan Smirnov - ResearchAssistant, Institute of Education, National Research University Higher School of Economics. E-mail: ibsmirnov@hse.ruElizaveta Sivak - Research Fellow, Institute of Education, National Research University Higher School of Economics. E-mail: esivak@hse.ruYana Kozmina - Junior Research Fellow, Institute of Education, National Research University Higher School of Economics. E-mail: ikozmina@hse.ruAddress: 20 Myasnitskaya str., 101000 Moscow, Russian Federation.The potential of VKontakte as a data source is now acknowledged in educational research, but little is known about the reliability of data obtained from this social network and about its sampling bias. Our article investigates the reliability of VK data, using the examples of a secondary school (766 students) and a university (15,757 students). We describe the procedure of matching V K profiles to real students. A direct comparison permitted us to identify profiles of around 18% of students. A special technique introduced in the article increased this number up to 88% for school students and up to 93% for university students. We compare age, gender and GPA of identified students and those whomwe did not find on V K. We also compare the structure of social relationships, retrieved from VK data, to the expected structure of students' social ties. We found that the structure of virtual' social relationships reproduces both the socio-demographic division of students into classes or majors andthe spatial division into different school buildings or university campuses. To our knowledge, it is the first study of this kind and scale based on VK data. It contributes to the understanding of how reliable data from this SNS is, how its accuracy can be improved, and how it can be used in educational research.

Suggested Citation

  • Ivan Smirnov & Elizaveta Sivak & Yana Kozmina, 2016. "In Search of Lost Profiles: The Reliability of VKontakte Data and Its Importance for Educational Research," Voprosy obrazovaniya / Educational Studies Moscow, National Research University Higher School of Economics, issue 4, pages 106-122.
  • Handle: RePEc:nos:voprob:2016:i:4:p:106-122
    as

    Download full text from publisher

    File URL: https://vo.hse.ru/data/2016/12/29/1114896849/Smirnov%282%29.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Valeria Ivaniushina & Daniil Alexandrov, 2013. "Anti-School Culture and Social Networks in Schools," Voprosy obrazovaniya / Educational Studies Moscow, National Research University Higher School of Economics, issue 2, pages 233-251.
    2. Alexander Krasilnikov & Maria Semenova, 2014. "Do Social Networks Help to Improve Student Academic Performance? The Case of Vk.com and Russian Students," Economics Bulletin, AccessEcon, vol. 34(2), pages 718-733.
    3. Mathieu Jacomy & Tommaso Venturini & Sebastien Heymann & Mathieu Bastian, 2014. "ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-12, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Elena Kotyrlo, 2017. "Social network sites: What users post and to whom they address. Some approaches to the study," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 47, pages 74-99.
    2. Katerina Polivanova & Ivan Smirnov, 2017. "What's in My Profile: VKontakte Data as a Tool for Studying the Interests of Modern Teenagers," Voprosy obrazovaniya / Educational Studies Moscow, National Research University Higher School of Economics, issue 2, pages 134-152.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Florentin Gloetzl & Ernest Aigner, 2015. "Pluralism in the Market of Science? A citation network analysis of economic research at universities in Vienna," Ecological Economics Papers ieep5, Institute of Ecological Economics.
    2. Kamil Yilmaz, 2018. "Bank Volatility Connectedness in South East Asia," Koç University-TUSIAD Economic Research Forum Working Papers 1807, Koc University-TUSIAD Economic Research Forum.
    3. Costantini, Mauro & Maaitah, Ahmad & Mishra, Tapas & Sousa, Ricardo M., 2023. "Bitcoin market networks and cyberattacks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 630(C).
    4. Bilgin, Nuriye Melisa, 2025. "Inflation diffusion through supply chains," International Economics, Elsevier, vol. 184(C).
    5. Nassar S. Al-Nassar & Abdulrahman A. Albahouth, 2023. "Inflation Spillovers among Advanced and Emerging Economies: Evidence from the G20 Group," Economies, MDPI, vol. 11(4), pages 1-25, April.
    6. Yan, Weiwei & Zhang, Yin, 2018. "Research universities on the ResearchGate social networking site: An examination of institutional differences, research activity level, and social networks formed," Journal of Informetrics, Elsevier, vol. 12(1), pages 385-400.
    7. Mert Demirer & Francis X. Diebold & Laura Liu & Kamil Yilmaz, 2018. "Estimating global bank network connectedness," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(1), pages 1-15, January.
    8. Abricha, Amal & Ben Amar, Amine & Bellalah, Makram, 2024. "Commodity futures markets under stress and stress-free periods: Further insights from a quantile connectedness approach," The Quarterly Review of Economics and Finance, Elsevier, vol. 93(C), pages 229-246.
    9. Straulino, Daniel & Diodato, Dario & O’Clery, Neave, 2024. "Economic crisis, urban structural change and inter-sectoral labour mobility," Structural Change and Economic Dynamics, Elsevier, vol. 71(C), pages 135-144.
    10. Chen, Yanhua & Li, Youwei & Pantelous, Athanasios A. & Stanley, H. Eugene, 2022. "Short-run disequilibrium adjustment and long-run equilibrium in the international stock markets: A network-based approach," International Review of Financial Analysis, Elsevier, vol. 79(C).
    11. Davazdahemami, Behrooz & Kalgotra, Pankush & Zolbanin, Hamed M. & Delen, Dursun, 2023. "A developer-oriented recommender model for the app store: A predictive network analytics approach," Journal of Business Research, Elsevier, vol. 158(C).
    12. Cano-Marin, Enrique & Mora-Cantallops, Marçal & Sanchez-Alonso, Salvador, 2023. "The power of big data analytics over fake news: A scientometric review of Twitter as a predictive system in healthcare," Technological Forecasting and Social Change, Elsevier, vol. 190(C).
    13. Koch, Philipp & Schwarzbauer, Wolfgang, 2021. "Yet another space: Why the Industry Space adds value to the understanding of structural change and economic development," Structural Change and Economic Dynamics, Elsevier, vol. 59(C), pages 198-213.
    14. Mao Takongmo, Charles-O. & Touré, Adam, 2023. "Trade openness and connectedness of national productions: Do financial openness, economic specialization, and the size of the country matter?," Economic Modelling, Elsevier, vol. 125(C).
    15. Gomez-Gonzalez, Jose E. & Hirs-Garzon, Jorge & Uribe, Jorge M., 2022. "Spillovers beyond the variance: Exploring the higher order risk linkages between commodity markets and global financial markets," Journal of Commodity Markets, Elsevier, vol. 28(C).
    16. Colavizza, Giovanni & Franssen, Thomas & van Leeuwen, Thed, 2019. "An empirical investigation of the tribes and their territories: Are research specialisms rural and urban?," Journal of Informetrics, Elsevier, vol. 13(1), pages 105-117.
    17. Yongren Shi & Yisook Lim & Chan S Suh, 2018. "Innovation or deviation? The relationship between boundary crossing and audience evaluation in the music field," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-17, October.
    18. Elsayed, Ahmed H. & Hammoudeh, Shawkat & Sousa, Ricardo M., 2021. "Inflation synchronization among the G7and China: The important role of oil inflation," Energy Economics, Elsevier, vol. 100(C).
    19. Gozuacik, Necip & Sakar, C. Okan & Ozcan, Sercan, 2023. "Technological forecasting based on estimation of word embedding matrix using LSTM networks," Technological Forecasting and Social Change, Elsevier, vol. 191(C).
    20. Luis de-Marcos & Manuel Goyanes & Adrián Domínguez-Díaz, 2024. "Mapping science through editorial board interlocking: connections and distance between fields of knowledge and institutional affiliations," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(6), pages 3385-3406, June.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nos:voprob:2016:i:4:p:106-122. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Marta Morozova (email available below). General contact details of provider: http://vo.hse.ru/en/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.