IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v107y2016i2d10.1007_s11192-016-1867-8.html
   My bibliography  Save this article

Do Scopus and WoS correct “old” omitted citations?

Author

Listed:
  • Fiorenzo Franceschini

    (Politecnico di Torino)

  • Domenico Maisano

    (Politecnico di Torino)

  • Luca Mastrogiacomo

    (Politecnico di Torino)

Abstract

Omitted citations—i.e., missing links between a cited paper and the corresponding citing papers—are a consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (1) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of “new” dirty data, and (2) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning “old” dirty data. The latter action is probably more complicated, as it requires the application of suitable error-detection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (1) both databases slowly correct old omitted citations, and (2) a small portion of initially corrected citations can surprisingly come off from databases over time.

Suggested Citation

  • Fiorenzo Franceschini & Domenico Maisano & Luca Mastrogiacomo, 2016. "Do Scopus and WoS correct “old” omitted citations?," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 321-335, May.
  • Handle: RePEc:spr:scient:v:107:y:2016:i:2:d:10.1007_s11192-016-1867-8
    DOI: 10.1007/s11192-016-1867-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-016-1867-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-016-1867-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Schenker N. & Gentleman J. F., 2001. "On Judging the Significance of Differences by Examining the Overlap Between Confidence Intervals," The American Statistician, American Statistical Association, vol. 55, pages 182-186, August.
    2. Marlies Olensky & Marion Schmidt & Nees Jan Eck, 2016. "Evaluation of the citation matching algorithms of CWTS and iFQ in comparison to the Web of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(10), pages 2550-2564, October.
    3. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "The museum of errors/horrors in Scopus," Journal of Informetrics, Elsevier, vol. 10(1), pages 174-182.
    4. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2014. "Scientific journal publishers and omitted citations in bibliometric databases: Any relationship?," Journal of Informetrics, Elsevier, vol. 8(3), pages 751-765.
    5. Valderrama-Zurián, Juan-Carlos & Aguilar-Moya, Remedios & Melero-Fuentes, David & Aleixandre-Benavent, Rafael, 2015. "A systematic analysis of duplicate records in Scopus," Journal of Informetrics, Elsevier, vol. 9(3), pages 570-576.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thelwall, Mike, 2018. "Microsoft Academic automatic document searches: Accuracy for journal articles and suitability for citation analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 1-9.
    2. Shirley Ainsworth & Jane M. Russell, 2018. "Has hosting on science direct improved the visibility of Latin American scholarly journals? A preliminary analysis of data quality," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1463-1484, June.
    3. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "Empirical analysis and classification of database errors in Scopus and Web of Science," Journal of Informetrics, Elsevier, vol. 10(4), pages 933-953.
    4. Houqiang Yu & Xueting Cao & Tingting Xiao & Zhenyi Yang, 2020. "How accurate are policy document mentions? A first look at the role of altmetrics database," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1517-1540, November.
    5. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    6. Mariana-Daniela González-Zamar & Emilio Abad-Segura & Eloy López-Meneses & José Gómez-Galán, 2020. "Managing ICT for Sustainable Education: Research Analysis in the Context of Higher Education," Sustainability, MDPI, vol. 12(19), pages 1-25, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "Empirical analysis and classification of database errors in Scopus and Web of Science," Journal of Informetrics, Elsevier, vol. 10(4), pages 933-953.
    2. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    3. Shuo Xu & Liyuan Hao & Xin An & Dongsheng Zhai & Hongshen Pang, 2019. "Types of DOI errors of cited references in Web of Science with a cleaning method," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 1427-1437, September.
    4. Shirley Ainsworth & Jane M. Russell, 2018. "Has hosting on science direct improved the visibility of Latin American scholarly journals? A preliminary analysis of data quality," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1463-1484, June.
    5. Thelwall, Mike, 2018. "Microsoft Academic automatic document searches: Accuracy for journal articles and suitability for citation analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 1-9.
    6. Alessia Cioffi & Sara Coppini & Arcangelo Massari & Arianna Moretti & Silvio Peroni & Cristian Santini & Nooshin Shahidzadeh Asadi, 2022. "Identifying and correcting invalid citations due to DOI errors in Crossref data," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 3593-3612, June.
    7. Fiorenzo Franceschini & Domenico Maisano & Luca Mastrogiacomo, 2015. "Influence of omitted citations on the bibliometric statistics of the major Manufacturing journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(3), pages 1083-1122, June.
    8. Sergio Copiello, 2019. "The open access citation premium may depend on the openness and inclusiveness of the indexing database, but the relationship is controversial because it is ambiguous where the open access boundary lie," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 995-1018, November.
    9. Houqiang Yu & Xueting Cao & Tingting Xiao & Zhenyi Yang, 2020. "How accurate are policy document mentions? A first look at the role of altmetrics database," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1517-1540, November.
    10. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "The museum of errors/horrors in Scopus," Journal of Informetrics, Elsevier, vol. 10(1), pages 174-182.
    11. Domínguez-Torreiro, Marcos & Soliño, Mario, 2011. "Provided and perceived status quo in choice experiments: Implications for valuing the outputs of multifunctional rural areas," Ecological Economics, Elsevier, vol. 70(12), pages 2523-2531.
    12. Christophe Boudry & Ghislaine Chartron, 2017. "Availability of digital object identifiers in publications archived by PubMed," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1453-1469, March.
    13. Tim Goedemé & Karel Van den Bosch & Lina Salanauskaite & Gerlinde Verbist, 2013. "Testing the Statistical Significance of Microsimulation Results: Often Easier than You Think. A Technical Note," ImPRovE Working Papers 13/10, Herman Deleeck Centre for Social Policy, University of Antwerp.
    14. Carmen de la Cruz-Lovera & Alberto-Jesus Perea-Moreno & José Luis de la Cruz-Fernández & Francisco G. Montoya & Alfredo Alcayde & Francisco Manzano-Agugliaro, 2019. "Analysis of Research Topics and Scientific Collaborations in Energy Saving Using Bibliometric Techniques and Community Detection," Energies, MDPI, vol. 12(10), pages 1-23, May.
    15. Matthieu Ballandonne & Igor Cersosimo, 2021. "A note on reference publication year spectroscopy with incomplete information," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(6), pages 4927-4939, June.
    16. Amy E. Wagler, 2014. "Confidence Intervals for Assessing Heterogeneity in Generalized Linear Mixed Models," Journal of Educational and Behavioral Statistics, , vol. 39(3), pages 167-179, June.
    17. Laura Lindberg & Kathryn Kost & Isaac Maddow-Zimet & Sheila Desai & Mia Zolna, 2020. "Abortion Reporting in the United States: An Assessment of Three National Fertility Surveys," Demography, Springer;Population Association of America (PAA), vol. 57(3), pages 899-925, June.
    18. Fabio S. V. Silva & Peter A. Schulz & Everard C. M. Noyons, 2019. "Co-authorship networks and research impact in large research facilities: benchmarking internal reports and bibliometric databases," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(1), pages 93-108, January.
    19. Yuqing Feng & Jing Wei & Maogui Hu & Chengdong Xu & Tao Li & Jinfeng Wang & Wei Chen, 2022. "Lagged Effects of Exposure to Air Pollutants on the Risk of Pulmonary Tuberculosis in a Highly Polluted Region," IJERPH, MDPI, vol. 19(9), pages 1-13, May.
    20. Sabine D Klein & Loredana Torchetti & Martin Frei-Erb & Ursula Wolf, 2015. "Usage of Complementary Medicine in Switzerland: Results of the Swiss Health Survey 2012 and Development Since 2007," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-10, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:107:y:2016:i:2:d:10.1007_s11192-016-1867-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.