IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v9y2015i2p273-284.html
   My bibliography  Save this article

Problems and challenges of information resources producers’ clustering

Author

Listed:
  • Cena, Anna
  • Gagolewski, Marek
  • Mesiar, Radko

Abstract

Classically, unsupervised machine learning techniques are applied on data sets with fixed number of attributes (variables). However, many problems encountered in the field of informetrics face us with the need to extend these kinds of methods in a way such that they may be computed over a set of nonincreasingly ordered vectors of unequal lengths. Thus, in this paper, some new dissimilarity measures (metrics) are introduced and studied. Owing to that we may use, e.g. hierarchical clustering algorithms in order to determine an input data set's partition consisting of sets of producers that are homogeneous not only with respect to the quality of information resources, but also their quantity.

Suggested Citation

  • Cena, Anna & Gagolewski, Marek & Mesiar, Radko, 2015. "Problems and challenges of information resources producers’ clustering," Journal of Informetrics, Elsevier, vol. 9(2), pages 273-284.
  • Handle: RePEc:eee:infome:v:9:y:2015:i:2:p:273-284
    DOI: 10.1016/j.joi.2015.02.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157715000231
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2015.02.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ortega, José Luis & López-Romero, Elena & Fernández, Inés, 2011. "Multivariate approach to classify research institutes according to their outputs: The case of the CSIC's institutes," Journal of Informetrics, Elsevier, vol. 5(3), pages 323-332.
    2. Woeginger, Gerhard J., 2008. "An axiomatic characterization of the Hirsch-index," Mathematical Social Sciences, Elsevier, vol. 56(2), pages 224-232, September.
    3. Rodrigo Costas & Thed N. van Leeuwen & María Bordons, 2010. "A bibliometric classificatory approach for the study and assessment of research performance at the individual level: The effects of age on productivity and impact," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(8), pages 1564-1581, August.
    4. Gagolewski, Marek & Mesiar, Radko, 2012. "Aggregating different paper quality measures with a generalized h-index," Journal of Informetrics, Elsevier, vol. 6(4), pages 566-579.
    5. Gagolewski, Marek, 2013. "Scientific impact assessment cannot be fair," Journal of Informetrics, Elsevier, vol. 7(4), pages 792-802.
    6. Ying Cheng & Nian Cai Liu, 2006. "A first approach to the classification of the top 500 world universities by their disciplinary characteristics using scientometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 68(1), pages 135-150, July.
    7. Gagolewski, Marek, 2011. "Bibliometric impact assessment with R and the CITAN package," Journal of Informetrics, Elsevier, vol. 5(4), pages 678-692.
    8. Rodrigo Costas & Thed N. van Leeuwen & María Bordons, 2010. "A bibliometric classificatory approach for the study and assessment of research performance at the individual level: The effects of age on productivity and impact," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(8), pages 1564-1581, August.
    9. Leo Egghe, 2006. "Theory and practise of the g-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 69(1), pages 131-152, October.
    10. Nieminen, Paavo & Pölönen, Ilkka & Sipola, Tuomo, 2013. "Research literature clustering using diffusion maps," Journal of Informetrics, Elsevier, vol. 7(4), pages 874-886.
    11. Schreiber, Michael, 2013. "A case study of the arbitrariness of the h-index and the highly-cited-publications indicator," Journal of Informetrics, Elsevier, vol. 7(2), pages 379-387.
    12. Glenn Milligan, 1979. "Ultrametric hierarchical clustering algorithms," Psychometrika, Springer;The Psychometric Society, vol. 44(3), pages 343-346, September.
    13. Waltman, Ludo & van Eck, Nees Jan & Noyons, Ed C.M., 2010. "A unified approach to mapping and clustering of bibliometric networks," Journal of Informetrics, Elsevier, vol. 4(4), pages 629-635.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lutz Bornmann & Werner Marx, 2014. "How to evaluate individual researchers working in the natural and life sciences meaningfully? A proposal of methods based on percentiles of citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 487-509, January.
    2. Wildgaard, Lorna, 2016. "A critical cluster analysis of 44 indicators of author-level performance," Journal of Informetrics, Elsevier, vol. 10(4), pages 1055-1078.
    3. Bouyssou, Denis & Marchant, Thierry, 2014. "An axiomatic approach to bibliometric rankings and indices," Journal of Informetrics, Elsevier, vol. 8(3), pages 449-477.
    4. Leydesdorff, Loet & Bornmann, Lutz & Zhou, Ping, 2016. "Construction of a pragmatic base line for journal classifications and maps based on aggregated journal-journal citation relations," Journal of Informetrics, Elsevier, vol. 10(4), pages 902-918.
    5. Díaz-Faes, Adrián A. & Costas, Rodrigo & Galindo, M. Purificación & Bordons, María, 2015. "Unravelling the performance of individual scholars: Use of Canonical Biplot analysis to explore the performance of scientists by academic rank and scientific field," Journal of Informetrics, Elsevier, vol. 9(4), pages 722-733.
    6. Marek Gagolewski & Barbara Żogała-Siudem & Grzegorz Siudem & Anna Cena, 2022. "Ockham’s index of citation impact," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2829-2845, May.
    7. Rodrigo Costas & Thomas Franssen, 2018. "Reflections around ‘the cautionary use’ of the h-index: response to Teixeira da Silva and Dobránszki," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 1125-1130, May.
    8. Żogała-Siudem, Barbara & Cena, Anna & Siudem, Grzegorz & Gagolewski, Marek, 2023. "Interpretable reparameterisations of citation models," Journal of Informetrics, Elsevier, vol. 17(1).
    9. Vîiu, Gabriel-Alexandru, 2017. "Disaggregated research evaluation through median-based characteristic scores and scales: a comparison with the mean-based approach," Journal of Informetrics, Elsevier, vol. 11(3), pages 748-765.
    10. Gabriel-Alexandru Vîiu & Mihai Păunescu & Adrian Miroiu, 2016. "Research-driven classification and ranking in higher education: an empirical appraisal of a Romanian policy experience," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 785-805, May.
    11. Gagolewski, Marek & Mesiar, Radko, 2012. "Aggregating different paper quality measures with a generalized h-index," Journal of Informetrics, Elsevier, vol. 6(4), pages 566-579.
    12. Vîiu, Gabriel-Alexandru, 2016. "A theoretical evaluation of Hirsch-type bibliometric indicators confronted with extreme self-citation," Journal of Informetrics, Elsevier, vol. 10(2), pages 552-566.
    13. Marek Kwiek & Wojciech Roszka, 2022. "Academic vs. biological age in research on academic careers: a large-scale study with implications for scientifically developing systems," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 3543-3575, June.
    14. Ash Mohammad Abbas, 2011. "Weighted indices for evaluating the quality of research with multiple authorship," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 107-131, July.
    15. Rodrigo Costas & María Bordons, 2011. "Do age and professional rank influence the order of authorship in scientific publications? Some evidence from a micro-level perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 145-161, July.
    16. Claus-Christian Carbon, 2011. "The Carbon_h-Factor: Predicting Individuals' Research Impact at Early Stages of Their Career," PLOS ONE, Public Library of Science, vol. 6(12), pages 1-7, December.
    17. Rodrigo Costas & Thed N. Leeuwen & María Bordons, 2012. "Referencing patterns of individual researchers: Do top scientists rely on more extensive information sources?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2433-2450, December.
    18. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    19. Giancarlo Ruocco & Cinzia Daraio, 2013. "An empirical approach to compare the performance of heterogeneous academic fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(3), pages 601-625, December.
    20. Giovanni Abramo & Ciriaco Andrea D’Angelo, 2011. "National-scale research performance assessment at the individual level," Scientometrics, Springer;Akadémiai Kiadó, vol. 86(2), pages 347-364, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:9:y:2015:i:2:p:273-284. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.