IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v58y2003i3d10.1023_bscie.0000006884.08036.73.html
   My bibliography  Save this article

Data mining in a closed Web environment

Author

Listed:
  • Cristina Faba-Pérez

    (University of Extremadura Alcazaba de Badajoz (Antiguo Hospital Militar))

  • Vicente P. Guerrero-Bote

    (University of Extremadura Alcazaba de Badajoz (Antiguo Hospital Militar))

  • Félix De Moya-Anegón

    (University of Granada Campus Cartuja, Colegio Máximo)

Abstract

The need to understand the fabric of relationships that are building up on the World Wide Web calls for the application of tools that allow one to extract the underlying knowledge. Some of the most interesting relationships are those that are brought to light by co-linking analysis (the Web analogue of cocitation analysis). We here propose such an analysis based on the co-links that are generated within a closed web environment, using multivariate statistics (Principal Component Analysis, and Multidimensional Scaling) and a connection-based technique (Kohonen's Self-Organizing Maps). An application was made to a generic thematic environment, and the underlying relationships and structures were manifest in the interpretation of the results.

Suggested Citation

  • Cristina Faba-Pérez & Vicente P. Guerrero-Bote & Félix De Moya-Anegón, 2003. "Data mining in a closed Web environment," Scientometrics, Springer;Akadémiai Kiadó, vol. 58(3), pages 623-640, November.
  • Handle: RePEc:spr:scient:v:58:y:2003:i:3:d:10.1023_b:scie.0000006884.08036.73
    DOI: 10.1023/B:SCIE.0000006884.08036.73
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1023/B:SCIE.0000006884.08036.73
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1023/B:SCIE.0000006884.08036.73?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hsinchun Chen & Andrea L. Houston & Robin R. Sewell & Bruce R. Schatz, 1998. "Internet browsing and searching: User evaluations of category map and concept space techniques," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(7), pages 582-603, May.
    2. Hak Joon Kim, 2000. "Motivations for hyperlinking in scholarly electronic articles: A qualitative study," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 51(10), pages 887-899.
    3. Howard D. White, 1981. "Cocited author retrieval online: An experiment with the social indicators literature," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 32(1), pages 16-21, January.
    4. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    5. Lennart Björneborn & Peter Ingwersen, 2001. "Perspective of webometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 50(1), pages 65-82, January.
    6. Larsen, Jan & Hansen, Lars Kai & Have, Anna Szymkowiak & Christiansen, Torben & Kolenda, Thomas, 2002. "Webmining: learning from the world wide web," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 517-532, February.
    7. Hui‐Min Chen & Michael D. Cooper, 2001. "Using clustering techniques to detect usage patterns in a Web‐based information system," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 52(11), pages 888-904.
    8. Vicente P. Guerrero & Félix de Moya Anegón, 2001. "Reduction of the dimension of a document space using the fuzzified output of a Kohonen network," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 52(14), pages 1234-1241.
    9. Ying Ding & Gobinda G. Chowdhury & Schubert Foo, 2000. "Journal as Markers of Intellectual Space: Journal Co-Citation Analysis of Information Retrieval Area, 1987–1997," Scientometrics, Springer;Akadémiai Kiadó, vol. 47(1), pages 55-73, January.
    10. Howard D. White & Belver C. Griffith, 1981. "Author cocitation: A literature measure of intellectual structure," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 32(3), pages 163-171, May.
    11. Anthony F.J. van Raan, 2001. "Bibliometrics and internet: Some observations and expectations," Scientometrics, Springer;Akadémiai Kiadó, vol. 50(1), pages 59-63, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Junping Qiu & Yejun Li & Jiang Li & Quane Ren, 2008. "An exploratory study on substantive co-link analysis: A modification to total co-link analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 76(2), pages 327-341, August.
    2. Feng Zhou & Huai-Cheng Guo & Yuh-Shan Ho & Chao-Zhong Wu, 2007. "Scientometric analysis of geostatistics using multivariate methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 73(3), pages 265-279, December.
    3. Enrique Orduna-Malea & Selenay Aytac, 2015. "Revealing the online network between university and industry: the case of Turkey," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1849-1866, December.
    4. Qiang Yao & Peng-Hui Lyu & Lian-Ping Yang & Lan Yao & Zhi-Yong Liu, 2014. "Current performance and future trends in health care sciences and services research," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 751-779, October.
    5. Peng Hui Lv & Gui-Fang Wang & Yong Wan & Jia Liu & Qing Liu & Fei-cheng Ma, 2011. "Bibliometric trend analysis on global graphene research," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(2), pages 399-419, August.
    6. Fei-Cheng Ma & Peng-Hui Lyu & Qiang Yao & Lan Yao & Shi-Jing Zhang, 2014. "Publication trends and knowledge maps of global translational medicine research," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 221-246, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Markus Gmür, 2003. "Co-citation analysis and the search for invisible colleges: A methodological evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 57(1), pages 27-57, January.
    2. Worapan Kusakunniran & Amit Singh Dahal & Wantanee Viriyasitavat, 2018. "Journal Co-Citation Analysis for Identifying Trends of Inter-Disciplinary Research: An Exploratory Case Study in a University," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 17(04), pages 1-22, December.
    3. Li, Kai & Yan, Erjia, 2018. "Co-mention network of R packages: Scientific impact and clustering structure," Journal of Informetrics, Elsevier, vol. 12(1), pages 87-100.
    4. Pin Li & Guoli Yang & Chuanqi Wang, 2019. "Visual topical analysis of library and information science," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1753-1791, December.
    5. Tsung Teng Chen, 2012. "The development and empirical study of a literature review aiding system," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(1), pages 105-116, July.
    6. Gaviria-Marin, Magaly & Merigó, José M. & Baier-Fuentes, Hugo, 2019. "Knowledge management: A global examination based on bibliometric analysis," Technological Forecasting and Social Change, Elsevier, vol. 140(C), pages 194-220.
    7. Pamela E. Sandstrom, 2001. "Scholarly communication as a socioecological system," Scientometrics, Springer;Akadémiai Kiadó, vol. 51(3), pages 573-605, July.
    8. Dixit, Aasheesh & Jakhar, Suresh Kumar, 2021. "Airport capacity management: A review and bibliometric analysis," Journal of Air Transport Management, Elsevier, vol. 91(C).
    9. Zhong, Xiang & Liu, Jiajun & Gao, Yong & Wu, Lun, 2017. "Analysis of co-occurrence toponyms in web pages based on complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 466(C), pages 462-475.
    10. Jianhua Hou, 2017. "Exploration into the evolution and historical roots of citation analysis by referenced publication year spectroscopy," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1437-1452, March.
    11. Yu-Wei Chang & Mu-Hsuan Huang & Chiao-Wen Lin, 2015. "Evolution of research subjects in library and information science based on keyword, bibliographical coupling, and co-citation analyses," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 2071-2087, December.
    12. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    13. João J. M. Ferreira & Cristina I. Fernandes & Sascha Kraus, 2019. "Entrepreneurship research: mapping intellectual structures and research trends," Review of Managerial Science, Springer, vol. 13(1), pages 181-205, February.
    14. Carlos Olmeda-Gómez & Maria-Antonia Ovalle-Perandones & Antonio Perianes-Rodríguez, 2017. "Co-word analysis and thematic landscapes in Spanish information science literature, 1985–2014," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 195-217, October.
    15. Masaki Eto, 2013. "Evaluations of context-based co-citation searching," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(2), pages 651-673, February.
    16. Jun-Ping Qiu & Ke Dong & Hou-Qiang Yu, 2014. "Comparative study on structure and correlation among author co-occurrence networks in bibliometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1345-1360, November.
    17. Georg Groh & Christoph Fuchs, 2011. "Multi-modal social networks for modeling scientific fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 569-590, November.
    18. Pamela E. Sandstrom, 2001. "Scholarly communication as a socioecological system," Scientometrics, Springer;Akadémiai Kiadó, vol. 50(3), pages 573-605, January.
    19. Kim, Ha Jin & Jeong, Yoo Kyung & Song, Min, 2016. "Content- and proximity-based author co-citation analysis using citation sentences," Journal of Informetrics, Elsevier, vol. 10(4), pages 954-966.
    20. Xuerong Li & Han Qiao & Shouyang Wang, 2017. "Exploring evolution and emerging trends in business model study: a co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 869-887, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:58:y:2003:i:3:d:10.1023_b:scie.0000006884.08036.73. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.