IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v93y2012i3d10.1007_s11192-012-0810-x.html
   My bibliography  Save this article

Ten challenges in modeling bibliographic data for bibliometric analysis

Author

Listed:
  • Alfio Ferrara

    () (Università degli Studi di Milano)

  • Silvia Salini

    () (Università degli Studi di Milano)

Abstract

The complexity and variety of bibliographic data is growing, and efforts to define new methodologies and techniques for bibliometric analysis are intensifying. In this complex scenario, one of the most crucial issues is the quality of data and the capability of bibliometric analysis to cope with multiple data dimensions. Although the problem of enforcing a multidimensional approach to the analysis and management of bibliographic data is not new, a reference design pattern and a specific conceptual model for multidimensional analysis of bibliographic data are still missing. In this paper, we discuss ten of the most relevant challenges for bibliometric analysis when dealing with multidimensional data, and we propose a reference data model that, according to different goals, can help analysis designers and bibliographic experts in working with large collections of bibliographic data.

Suggested Citation

  • Alfio Ferrara & Silvia Salini, 2012. "Ten challenges in modeling bibliographic data for bibliometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(3), pages 765-785, December.
  • Handle: RePEc:spr:scient:v:93:y:2012:i:3:d:10.1007_s11192-012-0810-x
    DOI: 10.1007/s11192-012-0810-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-012-0810-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yu, Hairong & Davis, Mari & Wilson, Concepción S. & Cole, Fletcher T.H., 2008. "Object-relational data modelling for informetric databases," Journal of Informetrics, Elsevier, vol. 2(3), pages 240-251.
    2. Teh, Yee Whye & Jordan, Michael I. & Beal, Matthew J. & Blei, David M., 2006. "Hierarchical Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1566-1581, December.
    3. Jean-Francois Molinari & Alain Molinari, 2008. "A new methodology for ranking scientific institutions," Scientometrics, Springer;Akadémiai Kiadó, vol. 75(1), pages 163-174, April.
    4. Romera Ayllón, María Rosario & Benito Bonito, Mónica, 2011. "Improving quality assessment of composite indicators in university rankings: a case study of French and German universities of excellence," DES - Working Papers. Statistics and Econometrics. WS ws112015, Universidad Carlos III de Madrid. Departamento de Estadística.
    5. Mallig, Nicolai, 2010. "A relational database for bibliometric analysis," Journal of Informetrics, Elsevier, vol. 4(4), pages 564-580.
    6. Michael Greenacre, 2008. "Correspondence analysis of raw data," Economics Working Papers 1112, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2009.
    7. Ron S. Kenett & Silvia Salini, 2011. "Modern analysis of customer satisfaction surveys: comparison of models and integrated analysis," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 27(5), pages 465-475, September.
    8. Wolfgang Glänzel & András Schubert, 2003. "A new classification scheme of science fields and subfields designed for scientometric evaluation purposes," Scientometrics, Springer;Akadémiai Kiadó, vol. 56(3), pages 357-367, March.
    9. J. Hubert, 1977. "Bibliometric models for journal productivity," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 4(1), pages 441-473, January.
    10. Mallig, Nicolai, 2010. "A relational database for bibliometric analysis," Discussion Papers "Innovation Systems and Policy Analysis" 22, Fraunhofer Institute for Systems and Innovation Research (ISI).
    11. Emil Hudomalj & Gaj Vidmar, 2003. "OLAP and bibliographic databases," Scientometrics, Springer;Akadémiai Kiadó, vol. 58(3), pages 609-622, November.
    12. Marco Geraci & M. Degli Esposti, 2011. "Where do Italian universities stand? An in-depth statistical analysis of national and international rankings," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(3), pages 667-681, June.
    13. M. Benito & R. Romera, 2011. "Improving quality assessment of composite indicators in university rankings: a case study of French and German universities of excellence," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(1), pages 153-176, October.
    14. Dietmar Wolfram, 2006. "Applications of SQL for informetric frequency distribution processing," Scientometrics, Springer;Akadémiai Kiadó, vol. 67(2), pages 301-313, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jeong, Yujin & Park, Inchae & Yoon, Byungun, 2019. "Identifying emerging Research and Business Development (R&BD) areas based on topic modeling and visualization with intellectual property right data," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 655-672.
    2. Chyi-Kwei Yau & Alan Porter & Nils Newman & Arho Suominen, 2014. "Clustering scientific documents with topic modeling," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(3), pages 767-786, September.
    3. Sabine Loudcher & Wararat Jakawat & Edmundo Pavel Soriano Morales & Cécile Favre, 2015. "Combining OLAP and information networks for bibliographic data analysis: a survey," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 471-487, May.
    4. Massimo FLORIO & Francesco GIFFONI, 2019. "L’impatto sociale della produzione di scienza su larga scala: come governarlo?," Departmental Working Papers 2019-05, Department of Economics, Management and Quantitative Methods at Università degli Studi di Milano.
    5. Francesca Battisti & Alfio Ferrara & Silvia Salini, 2015. "A decade of research in statistics: a topic model approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 413-433, May.
    6. Bornmann, Lutz, 2019. "Does the normalized citation impact of universities profit from certain properties of their published documents – such as the number of authors and the impact factor of the publishing journals? A mult," Journal of Informetrics, Elsevier, vol. 13(1), pages 170-184.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guillaume Cabanac, 2012. "Shaping the landscape of research in information systems from the perspective of editorial boards: A scientometric study of 77 leading journals," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(5), pages 977-996, May.
    2. Guillaume Cabanac, 2013. "Experimenting with the partnership ability φ-index on a million computer scientists," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(1), pages 1-9, July.
    3. Gagolewski, Marek, 2011. "Bibliometric impact assessment with R and the CITAN package," Journal of Informetrics, Elsevier, vol. 5(4), pages 678-692.
    4. Guillaume Cabanac, 2012. "Shaping the landscape of research in information systems from the perspective of editorial boards: A scientometric study of 77 leading journals," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(5), pages 977-996, May.
    5. Ruiz, Francisco & El Gibari, Samira & Cabello, José M. & Gómez, Trinidad, 2020. "MRP-WSCI: Multiple reference point based weak and strong composite indicators," Omega, Elsevier, vol. 95(C).
    6. Fernanda Morillo & Ignacio Santabárbara & Javier Aparicio, 2013. "The automatic normalisation challenge: detailed addresses identification," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(3), pages 953-966, June.
    7. Cova, Tânia F.G.G. & Jarmelo, Susana & Formosinho, Sebastião J. & de Melo, J. Sérgio Seixas & Pais, Alberto A.C.C., 2015. "Unsupervised characterization of research institutions with task-force estimation," Journal of Informetrics, Elsevier, vol. 9(1), pages 59-68.
    8. Yang Ding & Yelin Fu & Kin Keung Lai & W. K. John Leung, 2018. "Using Ranked Weights and Acceptability Analysis to Construct Composite Indicators: A Case Study of Regional Sustainable Society Index," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 139(3), pages 871-885, October.
    9. Magnone, Edoardo, 2013. "A scientometric look at calendar events," Journal of Informetrics, Elsevier, vol. 7(1), pages 101-108.
    10. M. Ryan Haley, 2020. "Combining the weighted and unweighted Euclidean indices: a graphical approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(1), pages 103-111, April.
    11. Víctor Giménez & Claudio Thieme & Diego Prior & Emili Tortosa-Ausina, 2020. "Evaluation and determinants of pre-school effectiveness in Chile," Working Papers 2020/02, Economics Department, Universitat Jaume I, Castellón (Spain).
    12. Carolin Michels & Ulrich Schmoch, 2012. "The growth of science and database coverage," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(3), pages 831-846, December.
    13. Yelin Fu & Xiongtianrui Kong & Hao Luo & Lean Yu, 0. "Constructing Composite Indicators with Collective Choice and Interval-Valued TOPSIS: The Case of Value Measure," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 0, pages 1-19.
    14. Sicilia, Miguel-Angel & Sánchez-Alonso, Salvador & García-Barriocanal, Elena, 2011. "Comparing impact factors from two different citation databases: The case of Computer Science," Journal of Informetrics, Elsevier, vol. 5(4), pages 698-704.
    15. Yelin Fu & Kong Xiangtianrui & Hao Luo & Lean Yu, 2020. "Constructing Composite Indicators with Collective Choice and Interval-Valued TOPSIS: The Case of Value Measure," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 152(1), pages 117-135, November.
    16. Lin Zhang & Ronald Rousseau & Wolfgang Glänzel, 2016. "Diversity of references as an indicator of the interdisciplinarity of journals: Taking similarity between subject fields into account," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(5), pages 1257-1265, May.
    17. Domenico Piccolo & Rosaria Simone, 2019. "The class of cub models: statistical foundations, inferential issues and empirical evidence," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 389-435, September.
    18. Eric Beh & Luigi D’Ambra, 2009. "Some Interpretative Tools for Non-Symmetrical Correspondence Analysis," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 55-76, April.
    19. Pilar García Gómez & Ángel López Nicolás, 2005. "Socio-economic inequalities in health in Catalonia," Hacienda Pública Española / Review of Public Economics, IEF, vol. 175(4), pages 103-121, december.
    20. Redivo, Edoardo & Nguyen, Hien D. & Gupta, Mayetri, 2020. "Bayesian clustering of skewed and multimodal data using geometric skewed normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:93:y:2012:i:3:d:10.1007_s11192-012-0810-x. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Sonal Shukla) or (Springer Nature Abstracting and Indexing). General contact details of provider: http://www.springer.com .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.