IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v95y2013i3d10.1007_s11192-012-0871-x.html
   My bibliography  Save this article

The effect of database dirty data on h-index calculation

Author

Listed:
  • Fiorenzo Franceschini

    (Politecnico di Torino)

  • Domenico Maisano

    (Politecnico di Torino)

  • Luca Mastrogiacomo

    (Politecnico di Torino)

Abstract

As all databases, the bibliometric ones (e.g. Scopus, Web of Knowledge and Google Scholar) are not exempt from errors, such as missing or wrong records, which may obviously affect publication/citation statistics and—more in general—the resulting bibliometric indicators. This paper tries to answer to the question “What is the effect of database uncertainty on the evaluation of the h-index?”, breaking the paradigm of deterministic database analysis and treating responses to database queries as random variables. Precisely an informetric model of the h-index is used to quantify the variability of this indicator with respect to the variability stemming from errors in database records. Some preliminary results are presented and discussed.

Suggested Citation

  • Fiorenzo Franceschini & Domenico Maisano & Luca Mastrogiacomo, 2013. "The effect of database dirty data on h-index calculation," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(3), pages 1179-1188, June.
  • Handle: RePEc:spr:scient:v:95:y:2013:i:3:d:10.1007_s11192-012-0871-x
    DOI: 10.1007/s11192-012-0871-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-012-0871-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-012-0871-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. repec:ebl:ecbull:v:3:y:2008:i:78:p:1-9 is not listed on IDEAS
    2. Franceschini, Fiorenzo & Maisano, Domenico A., 2010. "Analysis of the Hirsch index's operational properties," European Journal of Operational Research, Elsevier, vol. 203(2), pages 494-504, June.
    3. Leo Egghe & Ronald Rousseau, 2006. "An informetric model for the Hirsch-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 69(1), pages 121-129, October.
    4. Anthony F. J. Raan, 2006. "Comparison of the Hirsch-index with standard bibliometric indicators and with peer judgment for 147 chemistry research groups," Scientometrics, Springer;Akadémiai Kiadó, vol. 67(3), pages 491-502, June.
    5. Fiorenzo Franceschini & Maurizio Galetto & Domenico Maisano & Luca Mastrogiacomo, 2012. "The success-index: an alternative approach to the h-index for evaluating an individual’s research output," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(3), pages 621-641, September.
    6. Monika Henzinger & Jacob Suñol & Ingmar Weber, 2010. "The stability of the h-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(2), pages 465-479, August.
    7. Jean-Michel Courtault & Naïla Hayek, 2008. "On the Robustness of the h-index: a mathematical approach," Economics Bulletin, AccessEcon, vol. 3(78), pages 1-9.
    8. Bar-Ilan, Judit & Levene, Mark & Lin, Ayelet, 2007. "Some measures for comparing citation databases," Journal of Informetrics, Elsevier, vol. 1(1), pages 26-34.
    9. Tibor Braun & Wolfgang Glänzel & András Schubert, 2006. "A Hirsch-type index for journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 69(1), pages 169-173, October.
    10. Lutz Bornmann & Hans-Dieter Daniel, 2005. "Does the h-index for ranking of scientists really work?," Scientometrics, Springer;Akadémiai Kiadó, vol. 65(3), pages 391-392, December.
    11. Alonso, S. & Cabrerizo, F.J. & Herrera-Viedma, E. & Herrera, F., 2009. "h-Index: A review focused in its variants, computation and standardization for different scientific fields," Journal of Informetrics, Elsevier, vol. 3(4), pages 273-289.
    12. Franceschini, Fiorenzo & Maisano, Domenico, 2010. "The Hirsch spectrum: A novel tool for analyzing scientific journals," Journal of Informetrics, Elsevier, vol. 4(1), pages 64-73.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fiorenzo Franceschini & Domenico Maisano & Luca Mastrogiacomo, 2015. "Errors in DOI indexing by bibliometric databases," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2181-2186, March.
    2. Malesios, C., 2016. "Measuring the robustness of the journal h-index with respect to publication and citation values: A Bayesian sensitivity analysis," Journal of Informetrics, Elsevier, vol. 10(3), pages 719-731.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bornmann, Lutz & Marx, Werner, 2012. "HistCite analysis of papers constituting the h index research front," Journal of Informetrics, Elsevier, vol. 6(2), pages 285-288.
    2. Fiorenzo Franceschini & Domenico Maisano, 2011. "Bibliometric positioning of scientific manufacturing journals: a comparative analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 86(2), pages 463-485, February.
    3. Franceschini, Fiorenzo & Maisano, Domenico, 2010. "The citation triad: An overview of a scientist's publication output based on Ferrers diagrams," Journal of Informetrics, Elsevier, vol. 4(4), pages 503-511.
    4. Deming Lin & Tianhui Gong & Wenbin Liu & Martin Meyer, 2020. "An entropy-based measure for the evolution of h index research," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2283-2298, December.
    5. Franceschini, Fiorenzo & Maisano, Domenico, 2011. "Structured evaluation of the scientific output of academic research groups by recent h-based indicators," Journal of Informetrics, Elsevier, vol. 5(1), pages 64-74.
    6. L. Egghe, 2011. "The influence of random removal of sources and items on the h-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(2), pages 363-370, August.
    7. Anna Tietze & Philip Hofmann, 2019. "The h-index and multi-author hm-index for individual researchers in condensed matter physics," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(1), pages 171-185, April.
    8. Antonis Sidiropoulos & Dimitrios Katsaros & Yannis Manolopoulos, 2007. "Generalized Hirsch h-index for disclosing latent facts in citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 72(2), pages 253-280, August.
    9. Johannes Hönekopp & Julie Khan, 2012. "Future publication success in science is better predicted by traditional measures than by the h index," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(3), pages 843-853, March.
    10. Fiorenzo Franceschini & Domenico Maisano & Anna Perotti & Andrea Proto, 2010. "Analysis of the ch-index: an indicator to evaluate the diffusion of scientific research output by citers," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 203-217, October.
    11. Leo Egghe & Ronald Rousseau, 2021. "The h-index formalism," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6137-6145, July.
    12. Fiorenzo Franceschini & Domenico Maisano, 2011. "Proposals for evaluating the regularity of a scientist’s research output," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 279-295, July.
    13. Judit Bar-Ilan, 2008. "The h-index of h-index and of other informetric topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 75(3), pages 591-605, June.
    14. Judit Bar-Ilan, 2008. "Which h-index? — A comparison of WoS, Scopus and Google Scholar," Scientometrics, Springer;Akadémiai Kiadó, vol. 74(2), pages 257-271, February.
    15. Alonso, S. & Cabrerizo, F.J. & Herrera-Viedma, E. & Herrera, F., 2009. "h-Index: A review focused in its variants, computation and standardization for different scientific fields," Journal of Informetrics, Elsevier, vol. 3(4), pages 273-289.
    16. Fiorenzo Franceschini & Maurizio Galetto & Domenico Maisano & Luca Mastrogiacomo, 2012. "The success-index: an alternative approach to the h-index for evaluating an individual’s research output," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(3), pages 621-641, September.
    17. Maziar Montazerian & Edgar Dutra Zanotto & Hellmut Eckert, 2019. "A new parameter for (normalized) evaluation of H-index: countries as a case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 1065-1078, March.
    18. Parul Khurana & Kiran Sharma, 2022. "Impact of h-index on author’s rankings: an improvement to the h-index for lower-ranked authors," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(8), pages 4483-4498, August.
    19. Xia Gao & Jiancheng Guan, 2012. "Network model of knowledge diffusion," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(3), pages 749-762, March.
    20. Mingers, John & Yang, Liying, 2017. "Evaluating journal quality: A review of journal citation indicators and ranking in business and management," European Journal of Operational Research, Elsevier, vol. 257(1), pages 323-337.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:95:y:2013:i:3:d:10.1007_s11192-012-0871-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.