IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v99y2014i2d10.1007_s11192-013-1227-x.html
   My bibliography  Save this article

Webometrics benefitting from web mining? An investigation of methods and applications of two research fields

Author

Listed:
  • David Gunnarsson Lorentzen

    (University of Borås)

Abstract

Webometrics and web mining are two fields where research is focused on quantitative analyses of the web. This literature review outlines definitions of the fields, and then focuses on their methods and applications. It also discusses the potential of closer contact and collaboration between them. A key difference between the fields is that webometrics has focused on exploratory studies, whereas web mining has been dominated by studies focusing on development of methods and algorithms. Differences in type of data can also be seen, with webometrics more focused on analyses of the structure of the web and web mining more focused on web content and usage, even though both fields have been embracing the possibilities of user generated content. It is concluded that research problems where big data is needed can benefit from collaboration between webometricians, with their tradition of exploratory studies, and web miners, with their tradition of developing methods and algorithms.

Suggested Citation

  • David Gunnarsson Lorentzen, 2014. "Webometrics benefitting from web mining? An investigation of methods and applications of two research fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 99(2), pages 409-445, May.
  • Handle: RePEc:spr:scient:v:99:y:2014:i:2:d:10.1007_s11192-013-1227-x
    DOI: 10.1007/s11192-013-1227-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-013-1227-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-013-1227-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rafael Ball & Bernhard Mittermaier & Dirk Tunger, 2009. "Creation of journal-based publication profiles of scientific institutions — A methodology for the interdisciplinary comparison of scientific research based on the J-factor," Scientometrics, Springer;Akadémiai Kiadó, vol. 81(2), pages 381-392, November.
    2. Mike Thelwall & Kevan Buckley & Georgios Paltoglou, 2011. "Sentiment in Twitter events," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(2), pages 406-418, February.
    3. Lun‐Wei Ku & Hsin‐Hsi Chen, 2007. "Mining opinions from the Web: Beyond relevance retrieval," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 58(12), pages 1838-1850, October.
    4. Liwen Vaughan & Rongbin Yang, 2012. "Web data as academic and business quality estimates: A comparison of three data sources," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(10), pages 1960-1972, October.
    5. Bo Yang & Ying Sun, 2013. "An exploration of link-based knowledge map in academic web space," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(1), pages 239-253, July.
    6. Mike Thelwall & Kevan Buckley & Georgios Paltoglou, 2011. "Sentiment in Twitter events," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(2), pages 406-418, February.
    7. Bar-Ilan, Judit, 2008. "Informetrics at the beginning of the 21st century—A review," Journal of Informetrics, Elsevier, vol. 2(1), pages 1-52.
    8. Liwen Vaughan & Mike Thelwall, 2003. "Scholarly use of the Web: What are the key inducers of links to journal Web sites?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(1), pages 29-38, January.
    9. Henry Small, 2010. "Maps of science as interdisciplinary discourse: co-citation contexts and the role of analogy," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(3), pages 835-849, June.
    10. Seong Eun Cho & Han Woo Park, 2012. "Government organizations’ innovative use of the Internet: The case of the Twitter activity of South Korea’s Ministry for Food, Agriculture, Forestry and Fisheries," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(1), pages 9-23, January.
    11. Jonathan W. Palmer, 2002. "Web Site Usability, Design, and Performance Metrics," Information Systems Research, INFORMS, vol. 13(2), pages 151-167, June.
    12. Kim Holmberg, 2010. "Co-inlinking to a municipal Web space: a webometric and content analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(3), pages 851-862, June.
    13. Esteban Romero‐Frías & Liwen Vaughan, 2012. "Exploring the relationships between media and political parties through web hyperlink analysis: The case of Spain," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(5), pages 967-976, May.
    14. Christopher J. Williams & Michael O'Rourke & Sanford D. Eigenbrode & Ian O'Loughlin & Stephen J. Crowley, 2013. "Using bibliometrics to support the facilitation of cross-disciplinary communication," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(9), pages 1768-1779, September.
    15. Koen Jonkers & Felix de Moya Anegon & Isidro F. Aguillo, 2012. "Measuring the usage of e‐research infrastructure as an indicator of research activity," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(7), pages 1374-1382, July.
    16. Liwen Vaughan & Esteban Romero-Frías, 2012. "Exploring Web keyword analysis as an alternative to link analysis: a multi-industry case," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(1), pages 217-232, October.
    17. Mike Thelwall & Antje Klitkou & Arnold Verbeek & David Stuart & Celine Vincent, 2010. "Policy-relevant Webometrics for individual scientific fields," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(7), pages 1464-1475, July.
    18. Mike Thelwall & Pardeep Sud, 2011. "A comparison of methods for collecting web citation data for academic organizations," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(8), pages 1488-1497, August.
    19. Marc Cheong & Vincent C. S. Lee, 2011. "A microblogging-based approach to terrorism informatics: Exploration and chronicling civilian sentiment and response to terrorism events via Twitter," Information Systems Frontiers, Springer, vol. 13(1), pages 45-59, March.
    20. Xavier Polanco & Roche Ivana & Besagni Dominique, 2006. "User science indicators in the Web context and co-usage analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 66(1), pages 171-182, January.
    21. Lennart Björneborn & Peter Ingwersen, 2001. "Perspective of webometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 50(1), pages 65-82, January.
    22. Miles Efron, 2011. "Information search and retrieval in microblogs," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(6), pages 996-1008, June.
    23. Mike Thelwall, 2006. "Interpreting social science link analysis research: A theoretical framework," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(1), pages 60-68, January.
    24. Qingyu Zhang & Richard S. Segall, 2008. "Web Mining: A Survey Of Current Research, Techniques, And Software," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 7(04), pages 683-720.
    25. Miles Efron, 2011. "Information search and retrieval in microblogs," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(6), pages 996-1008, June.
    26. Alesia Zuccala, 2006. "Author Cocitation Analysis is to intellectual structure as Web Colink Analysis is to …?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(11), pages 1487-1502, September.
    27. Wang, Kai-Yu & Ting, I-Hsien & Wu, Hui-Ju, 2013. "Discovering interest groups for marketing in virtual communities: An integrated approach," Journal of Business Research, Elsevier, vol. 66(9), pages 1360-1366.
    28. David Wilkinson & Mike Thelwall, 2012. "Trending Twitter topics in English: An international comparison," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(8), pages 1631-1646, August.
    29. Judit Bar-Ilan, 2004. "A microscopic link analysis of academic institutions within a country — the case of Israel," Scientometrics, Springer;Akadémiai Kiadó, vol. 59(3), pages 391-403, March.
    30. Biehl, Markus & Kim, Henry & Wade, Michael, 2006. "Relationships among the academic business disciplines: a multi-method citation analysis," Omega, Elsevier, vol. 34(4), pages 359-371, August.
    31. Pamela Barreto Lang & Fábio Castro Gouveia & Jacqueline Leta, 2013. "Cooperation in Health: Mapping Collaborative Networks on the Web," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-7, August.
    32. Farzaneh Aminpour & Payam Kabiri & Zahra Otroj & Abbas Ali Keshtkar, 2009. "Webometric analysis of Iranian universities of medical sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 80(1), pages 253-264, July.
    33. Han Park & Mike Thelwall, 2008. "Link analysis: Hyperlink patterns and social structure on politicians’ Web sites in South Korea," Quality & Quantity: International Journal of Methodology, Springer, vol. 42(5), pages 687-697, October.
    34. Kim Holmberg & Mike Thelwall, 2009. "Local government web sites in Finland: A geographic and webometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 79(1), pages 157-169, April.
    35. Esteban Romero-Frías & Liwen Vaughan, 2012. "Exploring the relationships between media and political parties through web hyperlink analysis: The case of Spain," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(5), pages 967-976, May.
    36. Vaughan, Liwen & You, Justin, 2010. "Word co-occurrences on Webpages as a measure of the relatedness of organizations: A new Webometrics concept," Journal of Informetrics, Elsevier, vol. 4(4), pages 483-491.
    37. Liwen Vaughan & Rongbin Yang, 2012. "Web data as academic and business quality estimates: A comparison of three data sources," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(10), pages 1960-1972, October.
    38. Thelwall, Mike & Sud, Pardeep, 2012. "Webometric research with the Bing Search API 2.0," Journal of Informetrics, Elsevier, vol. 6(1), pages 44-52.
    39. Isidro F. Aguillo & Begoña Granadino & José L. Ortega & José A. Prieto, 2006. "Scientific research activity and communication measured with cybermetrics indicators," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(10), pages 1296-1302, August.
    40. Jose Luis Ortega & Isidro Aguillo & Viv Cothey & Andrea Scharnhorst, 2008. "Maps of the academic web in the European Higher Education Area — an exploration of visual web indicators," Scientometrics, Springer;Akadémiai Kiadó, vol. 74(2), pages 295-308, February.
    41. Mike Thelwall & Pardeep Sud, 2011. "A comparison of methods for collecting web citation data for academic organizations," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(8), pages 1488-1497, August.
    42. Thed van Leeuwen & Robert Tijssen, 2000. "Interdisciplinary dynamics of modern science: analysis of cross-disciplinary citation flows," Research Evaluation, Oxford University Press, vol. 9(3), pages 183-187, December.
    43. Pamela Lang & Fábio C. Gouveia & Jacqueline Leta, 2010. "Site co-link analysis applied to small networks: a new methodological approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(1), pages 157-166, April.
    44. Mike Thelwall & Katie Vann & Ruth Fairclough, 2006. "Web issue analysis: An integrated water resource management case study," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(10), pages 1303-1314, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Raheem Sarwar & Afifa Zia & Raheel Nawaz & Ayman Fayoumi & Naif Radi Aljohani & Saeed-Ul Hassan, 2021. "Webometrics: evolution of social media presence of universities," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 951-967, February.
    2. George Masterton & Erik J. Olsson & Staffan Angere, 2016. "Linking as voting: how the Condorcet jury theorem in political science is relevant to webometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 945-966, March.
    3. Ton Mooij, 2015. "Exploring a prototype framework of web-based and peer-reviewed “European Educational Research Quality Indicators” (EERQI)," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 1037-1055, January.
    4. Weiai Xu & I-Hsuan Chiu & Yixin Chen & Tanuka Mukherjee, 2015. "Twitter hashtags for health: applying network and content analyses to understand the health knowledge sharing in a Twitter-based community of practice," Quality & Quantity: International Journal of Methodology, Springer, vol. 49(4), pages 1361-1380, July.
    5. Jozef Kapusta & Michal Munk & Martin Drlik, 2018. "Website Structure Improvement Based on the Combination of Selected Web Structure and Web Usage Mining Methods," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(06), pages 1743-1776, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pardeep Sud & Mike Thelwall, 2014. "Linked title mentions: a new automated link search candidate," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(3), pages 1831-1849, December.
    2. Bar-Ilan, Judit, 2008. "Informetrics at the beginning of the 21st century—A review," Journal of Informetrics, Elsevier, vol. 2(1), pages 1-52.
    3. Liwen Vaughan, 2016. "Uncovering information from social media hyperlinks: An investigation of twitter," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(5), pages 1105-1120, May.
    4. José-Antonio Ontalba-Ruipérez & Enrique Orduna-Malea & Adolfo Alonso-Arroyo, 2016. "Identifying institutional relationships in a geographically distributed public health system using interlinking and co-authorship methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1167-1191, March.
    5. Vaughan, Liwen & Yang, Rongbin, 2013. "Web traffic and organization performance measures: Relationships and data sources examined," Journal of Informetrics, Elsevier, vol. 7(3), pages 699-711.
    6. Enrique Orduña-Malea, 2021. "Dot-science top level domain: Academic websites or dumpsites?," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3565-3591, April.
    7. Lepori, Benedetto & Barberio, Vitaliano & Seeber, Marco & Aguillo, Isidro, 2013. "Core–periphery structures in national higher education systems. A cross-country analysis using interlinking data," Journal of Informetrics, Elsevier, vol. 7(3), pages 622-634.
    8. Enrique Orduna-Malea & Selenay Aytac, 2015. "Revealing the online network between university and industry: the case of Turkey," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1849-1866, December.
    9. Patrick Kenekayoro & Kevan Buckley & Mike Thelwall, 2015. "Clustering research group website homepages," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2023-2039, March.
    10. Benedetto Lepori & Isidro F. Aguillo & Marco Seeber, 2014. "Size of web domains and interlinking behavior of higher education institutions in Europe," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(2), pages 497-518, August.
    11. Amalia Mas-Bleda & Mike Thelwall & Kayvan Kousha & Isidro F. Aguillo, 2014. "Do highly cited researchers successfully use the social web?," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 337-356, October.
    12. Ping-Yu Hsu & Hong-Tsuen Lei & Shih-Hsiang Huang & Teng Hao Liao & Yao-Chung Lo & Chin-Chun Lo, 2019. "Effects of sentiment on recommendations in social network," Electronic Markets, Springer;IIM University of St. Gallen, vol. 29(2), pages 253-262, June.
    13. Chung Joo Chung & Han Woo Park, 2012. "Web visibility of scholars in media and communication journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(1), pages 207-215, October.
    14. Stefan Stieglitz & Christian Meske & Björn Ross & Milad Mirbabaie, 2020. "Going Back in Time to Predict the Future - The Complex Role of the Data Collection Period in Social Media Analytics," Information Systems Frontiers, Springer, vol. 22(2), pages 395-409, April.
    15. Mike Thelwall & Alesia Zuccala, 2008. "A university-centred European Union link analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 75(3), pages 407-420, June.
    16. George Masterton & Erik J. Olsson & Staffan Angere, 2016. "Linking as voting: how the Condorcet jury theorem in political science is relevant to webometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 945-966, March.
    17. Young Mee Chung & So Young Yu & Yong Kwang Kim & Su Yeon Kim, 2009. "Characteristics and link structure of a national scholarly Web space: The case of South Korea," Scientometrics, Springer;Akadémiai Kiadó, vol. 80(3), pages 595-612, September.
    18. Sujin Choi & Ji-young Park & Han Woo Park, 2012. "Using social media data to explore communication processes within South Korean online innovation communities," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(1), pages 43-56, January.
    19. Junwei Ma & Jianhua Wang & Philip Szmedra, 2019. "Sustainable Competitive Position of Mobile Communication Companies: Comprehensive Perspectives of Insiders and Outsiders," Sustainability, MDPI, vol. 11(7), pages 1-15, April.
    20. Liwen Vaughan & Esteban Romero-Frías, 2012. "Exploring Web keyword analysis as an alternative to link analysis: a multi-industry case," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(1), pages 217-232, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:99:y:2014:i:2:d:10.1007_s11192-013-1227-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.