IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v102y2015i3d10.1007_s11192-014-1497-y.html
   My bibliography  Save this article

Clustering research group website homepages

Author

Listed:
  • Patrick Kenekayoro

    (University of Wolverhampton)

  • Kevan Buckley

    (University of Wolverhampton)

  • Mike Thelwall

    (University of Wolverhampton)

Abstract

The majority of early exploratory webometrics studies have typically used simple network methods or multi-dimensional scaling to identify hyperlink or text-based relationships between collections of related academic websites. This paper uses unsupervised machine learning techniques to identify groups of computer science departments with similar interests through co-word occurrences in the homepages of the departmental research groups. The clustering results reflect inter-department research similarity reasonably well, at least as reflected online. This clustering approach may be useful for policy makers in identifying future collaborators with similar research interests or for monitoring research fields.

Suggested Citation

  • Patrick Kenekayoro & Kevan Buckley & Mike Thelwall, 2015. "Clustering research group website homepages," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2023-2039, March.
  • Handle: RePEc:spr:scient:v:102:y:2015:i:3:d:10.1007_s11192-014-1497-y
    DOI: 10.1007/s11192-014-1497-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-014-1497-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-014-1497-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Blaise Cronin & Herbert W. Snyder & Howard Rosenbaum & Anna Martinson & Ewa Callahan, 1998. "Invoked on the Web," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(14), pages 1319-1328.
    2. Peter van den Besselaar & Gaston Heimeriks, 2006. "Mapping research topics using word-reference co-occurrences: A method and an exploratory case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 68(3), pages 377-393, September.
    3. Gohar Feroz Khan & Han Woo Park, 2011. "Measuring the triple helix on the web: Longitudinal trends in the university‐industry‐government relationship in Korea," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(12), pages 2443-2455, December.
    4. Franz Barjak & Mike Thelwall, 2008. "A statistical analysis of the web presences of European life sciences research teams," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(4), pages 628-643, February.
    5. Patrick Kenekayoro & Kevan Buckley & Mike Thelwall, 2014. "Automatic classification of academic web page types," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1015-1026, November.
    6. Gohar Feroz Khan & Han Woo Park, 2011. "Measuring the triple helix on the web: Longitudinal trends in the university-industry-government relationship in Korea," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(12), pages 2443-2455, December.
    7. Otto Tuomaala & Kalervo Järvelin & Pertti Vakkari, 2014. "Evolution of library and information science, 1965–2005: Content analysis of journal articles," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(7), pages 1446-1462, July.
    8. Mike Thelwall & Antje Klitkou & Arnold Verbeek & David Stuart & Celine Vincent, 2010. "Policy-relevant Webometrics for individual scientific fields," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(7), pages 1464-1475, July.
    9. Seeber, Marco & Lepori, Benedetto & Lomi, Alessandro & Aguillo, Isidro & Barberio, Vitaliano, 2012. "Factors affecting web links between European higher education institutions," Journal of Informetrics, Elsevier, vol. 6(3), pages 435-447.
    10. Mike Thelwall, 2006. "Interpreting social science link analysis research: A theoretical framework," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(1), pages 60-68, January.
    11. Peters, H. P. F. & van Raan, A. F. J., 1993. "Co-word-based science maps of chemical engineering. Part I: Representations by direct multidimensional scaling," Research Policy, Elsevier, vol. 22(1), pages 23-45, February.
    12. Alesia Zuccala, 2006. "Author Cocitation Analysis is to intellectual structure as Web Colink Analysis is to …?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(11), pages 1487-1502, September.
    13. Bart Thijs & Wolfgang Glänzel, 2010. "A structural analysis of collaboration between European research institutes," Research Evaluation, Oxford University Press, vol. 19(1), pages 55-65, March.
    14. Leydesdorff, Loet & Welbers, Kasper, 2011. "The semantic mapping of words and co-words in contexts," Journal of Informetrics, Elsevier, vol. 5(3), pages 469-475.
    15. Vaughan, Liwen & You, Justin, 2010. "Word co-occurrences on Webpages as a measure of the relatedness of organizations: A new Webometrics concept," Journal of Informetrics, Elsevier, vol. 4(4), pages 483-491.
    16. Schreiber, M. & Malesios, C.C. & Psarakis, S., 2012. "Exploratory factor analysis for the Hirsch index, 17 h-type variants, and some traditional bibliometric indicators," Journal of Informetrics, Elsevier, vol. 6(3), pages 347-358.
    17. Waltman, Ludo & van Eck, Nees Jan & Noyons, Ed C.M., 2010. "A unified approach to mapping and clustering of bibliometric networks," Journal of Informetrics, Elsevier, vol. 4(4), pages 629-635.
    18. Leydesdroff, Loet, 1989. "Words and co-words as indicators of intellectual organization," Research Policy, Elsevier, vol. 18(4), pages 209-223, August.
    19. Mike Thelwall & Alesia Zuccala, 2008. "A university-centred European Union link analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 75(3), pages 407-420, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Arash Hajikhani & Arho Suominen, 2022. "Mapping the sustainable development goals (SDGs) in science, technology and innovation: application of machine learning in SDG-oriented artefact detection," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6661-6693, November.
    2. Jungwon Yoon & Joshua SungWoo Yang & Han Woo Park, 2017. "Quintuple helix structure of Sino-Korean research collaboration in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 61-81, October.
    3. Gohar Feroz Khan & Sungjoon Lee & Ji Young Park & Han Woo Park, 2016. "Theories in communication science: a structural analysis using webometrics and social network approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(2), pages 531-557, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. José-Antonio Ontalba-Ruipérez & Enrique Orduna-Malea & Adolfo Alonso-Arroyo, 2016. "Identifying institutional relationships in a geographically distributed public health system using interlinking and co-authorship methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1167-1191, March.
    2. Rongying Zhao & Bikun Chen, 2014. "Applying author co-citation analysis to user interaction analysis: a case study on instant messaging groups," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 985-997, November.
    3. David Gunnarsson Lorentzen, 2014. "Webometrics benefitting from web mining? An investigation of methods and applications of two research fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 99(2), pages 409-445, May.
    4. Muhammad Omar & Arif Mehmood & Gyu Sang Choi & Han Woo Park, 2017. "Global mapping of artificial intelligence in Google and Google Scholar," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1269-1305, December.
    5. Pardeep Sud & Mike Thelwall, 2014. "Linked title mentions: a new automated link search candidate," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(3), pages 1831-1849, December.
    6. Enrique Orduña-Malea & Adolfo Alonso-Arroyo & José-Antonio Ontalba-Ruipérez & Ferrán Catalá-López, 2023. "Evaluating the online impact of reporting guidelines for randomised trial reports and protocols: a cross-sectional web-based data analysis of CONSORT and SPIRIT initiatives," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 407-440, January.
    7. Boschma, Ron & Heimeriks, Gaston & Balland, Pierre-Alexandre, 2014. "Scientific knowledge dynamics and relatedness in biotech cities," Research Policy, Elsevier, vol. 43(1), pages 107-114.
    8. Igors Skute & Kasia Zalewska-Kurek & Isabella Hatak & Petra Weerd-Nederhof, 2019. "Mapping the field: a bibliometric analysis of the literature on university–industry collaborations," The Journal of Technology Transfer, Springer, vol. 44(3), pages 916-947, June.
    9. Gohar Feroz Khan & Han Woo Park, 2012. "Editorial: Triple Helix and innovation in Asia using scientometrics, webometrics, and informetrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(1), pages 1-7, January.
    10. Matthew A. Shapiro & Han Woo Park, 2012. "Regional development in South Korea: accounting for research area in centrality and networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(1), pages 271-287, January.
    11. Minjeong Kim & Han Woo Park, 2012. "Measuring Twitter-based political participation and deliberation in the South Korean context by using social network and Triple Helix indicators," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(1), pages 121-140, January.
    12. Edwards-Schachter,Mónica & Wallace,Matthew, 2015. "âShaken, but not stirredâ: six decades defining social innovation," INGENIO (CSIC-UPV) Working Paper Series 201504, INGENIO (CSIC-UPV).
    13. Mêgnigbêto, Eustache, 2014. "Efficiency, unused capacity and transmission power as indicators of the Triple Helix of university–industry–government relationships," Journal of Informetrics, Elsevier, vol. 8(1), pages 284-294.
    14. Sabrina L. Woltmann & Lars Alkærsig, 2018. "Tracing university–industry knowledge transfer through a text mining approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 449-472, October.
    15. Enrique Orduña-Malea & José-Antonio Ontalba-Ruipérez, 2013. "Selective linking from social platforms to university websites: a case study of the Spanish academic system," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(2), pages 593-614, May.
    16. Han Woo Park, 2014. "Mapping election campaigns through negative entropy: Triple and Quadruple Helix approach to South Korea’s 2012 presidential election," Scientometrics, Springer;Akadémiai Kiadó, vol. 99(1), pages 187-197, April.
    17. Moosung Lee & Han Woo Park, 2012. "Exploring the web visibility of world-class universities," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(1), pages 201-218, January.
    18. Bar-Ilan, Judit, 2008. "Informetrics at the beginning of the 21st century—A review," Journal of Informetrics, Elsevier, vol. 2(1), pages 1-52.
    19. Jungwon Yoon, 2015. "The evolution of South Korea’s innovation system: moving towards the triple helix model?," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(1), pages 265-293, July.
    20. Seeber, Marco & Lepori, Benedetto & Lomi, Alessandro & Aguillo, Isidro & Barberio, Vitaliano, 2012. "Factors affecting web links between European higher education institutions," Journal of Informetrics, Elsevier, vol. 6(3), pages 435-447.

    More about this item

    Keywords

    Webometrics; Unsupervised learning; Cluster analysis; Co-word analysis; Research group; Self-organising maps;
    All these keywords.

    JEL classification:

    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques
    • C80 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:102:y:2015:i:3:d:10.1007_s11192-014-1497-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.