IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v127y2022i3d10.1007_s11192-022-04266-0.html
   My bibliography  Save this article

Accounting for quality in data integration systems: a completeness-aware integration approach

Author

Listed:
  • Cinzia Daraio

    (Sapienza University of Rome)

  • Simone Leo

    (Sapienza University of Rome)

  • Monica Scannapieco

    (ISTAT)

Abstract

Ensuring the quality of integrated data is undoubtedly one of the main problems of integrated data systems. When focusing on multi-national and historical data integration systems, where the “space” and “time” dimensions play a relevant role, it is very much important to build the integration layer in such a way that the final user accesses a layer that is “by design” as much complete as possible. In this paper, we propose a method for accessing data in multipurpose data infrastructures, like data integration systems, which has the properties of (i) relieving the final user from the need to access single data sources while, at the same time, (ii) ensuring to maximize the amount of the information available for the user at the integration layer. Our approach is based on a completeness-aware integration approach which allows the user to have ready available all the maximum information that can get out of the integrated data system without having to carry out the preliminary data quality analysis on each of the databases included in the system. Our proposal of providing data quality information at the integrated level extends then the functions of the individual data sources, opening the data infrastructure to additional uses. This may be a first step to move from data infrastructures towards knowledge infrastructures. A case study on the research infrastructure for the science and innovation studies shows the usefulness of the proposed approach.

Suggested Citation

  • Cinzia Daraio & Simone Leo & Monica Scannapieco, 2022. "Accounting for quality in data integration systems: a completeness-aware integration approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1465-1490, March.
  • Handle: RePEc:spr:scient:v:127:y:2022:i:3:d:10.1007_s11192-022-04266-0
    DOI: 10.1007/s11192-022-04266-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-022-04266-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-022-04266-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cinzia Daraio & Andrea Bonaccorsi, 2017. "Beyond university rankings? Generating new indicators on universities by linking data in open platforms," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(2), pages 508-529, February.
    2. Cinzia Daraio, 2017. "A framework for the Assessment of Research and its impacts," DIAG Technical Reports 2017-04, Department of Computer, Control and Management Engineering, Universita' degli Studi di Roma "La Sapienza".
    3. Marco Angelini & Cinzia Daraio & Maurizio Lenzerini & Francesco Leotta & Giuseppe Santucci, 2020. "Performance model’s development: a novel approach encompassing ontology-based data access and visual analytics," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 865-892, November.
    4. Cinzia Daraio & Maurizio Lenzerini & Claudio Leporelli & Henk F. Moed & Paolo Naggar & Andrea Bonaccorsi & Alessandro Bartolucci, 2016. "Data integration for research and innovation policy: an Ontology-Based Data Management approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(2), pages 857-871, February.
    5. Atanu Sengupta & Sanjoy De, 2020. "Review of Literature," India Studies in Business and Economics, in: Assessing Performance of Banks in India Fifty Years After Nationalization, chapter 0, pages 15-30, Springer.
    6. Hamid Ekbia & Michael Mattioli & Inna Kouper & G. Arave & Ali Ghazinejad & Timothy Bowman & Venkata Ratandeep Suri & Andrew Tsou & Scott Weingart & Cassidy R. Sugimoto, 2015. "Big data, bigger dilemmas: A critical review," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(8), pages 1523-1545, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marco Angelini & Cinzia Daraio & Maurizio Lenzerini & Francesco Leotta & Giuseppe Santucci, 2019. "Performance Model’s development: A Novel Approach encompassing Ontology-Based Data Access and Visual Analytics," DIAG Technical Reports 2019-11, Department of Computer, Control and Management Engineering, Universita' degli Studi di Roma "La Sapienza".
    2. Marco Angelini & Cinzia Daraio & Maurizio Lenzerini & Francesco Leotta & Giuseppe Santucci, 2020. "Performance model’s development: a novel approach encompassing ontology-based data access and visual analytics," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 865-892, November.
    3. Cinzia Daraio, 2017. "A framework for the Assessment of Research and its impacts," DIAG Technical Reports 2017-04, Department of Computer, Control and Management Engineering, Universita' degli Studi di Roma "La Sapienza".
    4. José María López-Sanz & Azucena Penelas-Leguía & Pablo Gutiérrez-Rodríguez & Pedro Cuesta-Valiño, 2021. "Sustainable Development and Consumer Behavior in Rural Tourism—The Importance of Image and Loyalty for Host Communities," Sustainability, MDPI, vol. 13(9), pages 1-20, April.
    5. Cristina Blasi Casagran & Colleen Boland & Elena Sánchez-Montijano & Eva Vilà Sanchez, 2021. "The Role of Emerging Predictive IT Tools in Effective Migration Governance," Politics and Governance, Cogitatio Press, vol. 9(4), pages 133-145.
    6. Maria Maddalena Sirufo & Francesca De Pietro & Alessandra Catalogna & Lia Ginaldi & Massimo De Martinis, 2021. "The Microbiota-Bone-Allergy Interplay," IJERPH, MDPI, vol. 19(1), pages 1-14, December.
    7. Oleh Pasko & Mykola Hordiyenko & Fuli Chen & Yarmila Tkal & Yulia Abraham, 2021. "Mapping Global Research on International Financial Reporting Standards: A Scientometric Review," International Journal of Financial Research, International Journal of Financial Research, Sciedu Press, vol. 12(3), pages 116-134, May.
    8. Zhang, Tianyu & Dong, Peiwu & Zeng, Yongchao & Ju, Yanbing, 2022. "Analyzing the diffusion of competitive smart wearable devices: An agent-based multi-dimensional relative agreement model," Journal of Business Research, Elsevier, vol. 139(C), pages 90-105.
    9. Vitor Hugo Ferreira & André da Costa Pinho & Dickson Silva de Souza & Bárbara Siqueira Rodrigues, 2021. "A New Clustering Approach for Automatic Oscillographic Records Segmentation," Energies, MDPI, vol. 14(20), pages 1-18, October.
    10. Maurizio Massaro & Francesca Dal Mas & Charbel Jose Chiappetta Jabbour & Carlo Bagnoli, 2020. "Crypto‐economy and new sustainable business models: Reflections and projections using a case study analysis," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 27(5), pages 2150-2160, September.
    11. Ines A. Ferreira & Rachel M. Gisselquist & Finn Tarp, 2021. "On the impact of inequality on growth, human development, and governance," WIDER Working Paper Series wp-2021-34, World Institute for Development Economic Research (UNU-WIDER).
    12. He Tingting, 2021. "Comparing Money and Time Donation: What Do Experiments Tell Us?," Marketing of Scientific and Research Organizations, Sciendo, vol. 41(3), pages 65-94, September.
    13. Beatriz Calzada Olvera & Mario Gonzalez-Sauri & Federico Louvin & David-Alexander Harings Moya, 2021. "COVID-19 in Central America: effects of firm resilience and policy responses on employment," WIDER Working Paper Series wp-2021-166, World Institute for Development Economic Research (UNU-WIDER).
    14. Alberto Cerezo-Narváez & Andrés Pastor-Fernández & Manuel Otero-Mateo & Pablo Ballesteros-Pérez, 2022. "The Influence of Knowledge on Managing Risk for the Success in Complex Construction Projects: The IPMA Approach," Sustainability, MDPI, vol. 14(15), pages 1-30, August.
    15. Iversen, Sara V. & Naomi, van der Velden & Convery, Ian & Mansfield, Lois & Holt, Claire D.S., 2022. "Why understanding stakeholder perspectives and emotions is important in upland woodland creation – A case study from Cumbria, UK," Land Use Policy, Elsevier, vol. 114(C).
    16. Kik, M.C. & Claassen, G.D.H. & Meuwissen, M.P.M. & Smit, A.B. & Saatkamp, H.W., 2021. "Actor analysis for sustainable soil management – A case study from the Netherlands," Land Use Policy, Elsevier, vol. 107(C).
    17. Rafidah Md Noor & Nadia Bella Gustiani Rasyidi & Tarak Nandy & Raenu Kolandaisamy, 2020. "Campus Shuttle Bus Route Optimization Using Machine Learning Predictive Analysis: A Case Study," Sustainability, MDPI, vol. 13(1), pages 1-24, December.
    18. Dominika Ehrenbergerová & Martin Hodula & Zuzana Gric, 2022. "Does capital-based regulation affect bank pricing policy?," Journal of Regulatory Economics, Springer, vol. 61(2), pages 135-167, April.
    19. Kenneth David Strang & Zhaohao Sun, 2017. "Big Data Paradigm: What is the Status of Privacy and Security?," Annals of Data Science, Springer, vol. 4(1), pages 1-17, March.
    20. Daraio, Cinzia & Simar, Leopold & Wilson, Paul, 2019. "Quality and its impact on efficiency," LIDAM Discussion Papers ISBA 2019004, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:127:y:2022:i:3:d:10.1007_s11192-022-04266-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.