IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v20y2018i3d10.1007_s10796-017-9822-7.html
   My bibliography  Save this article

A Process Pattern Model for Tackling and Improving Big Data Quality

Author

Listed:
  • Agung Wahyudi

    (Delft University of Technology)

  • George Kuk

    (Nottingham Trent University)

  • Marijn Janssen

    (Delft University of Technology)

Abstract

Data seldom create value by themselves. They need to be linked and combined from multiple sources, which can often come with variable data quality. The task of improving data quality is a recurring challenge. In this paper, we use a case study of a large telecom company to develop a generic process pattern model for improving data quality. The process pattern model is defined as a proven series of activities, aimed at improving the data quality given a certain context, a particular objective, and a specific set of initial conditions. Four different patterns are derived to deal with the variations in data quality of datasets. Instead of having to find the way to improve the quality of big data for each situation, the process model provides data users with generic patterns, which can be used as a reference model to improve big data quality.

Suggested Citation

  • Agung Wahyudi & George Kuk & Marijn Janssen, 2018. "A Process Pattern Model for Tackling and Improving Big Data Quality," Information Systems Frontiers, Springer, vol. 20(3), pages 457-469, June.
  • Handle: RePEc:spr:infosf:v:20:y:2018:i:3:d:10.1007_s10796-017-9822-7
    DOI: 10.1007/s10796-017-9822-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-017-9822-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-017-9822-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yogesh K. Dwivedi & Marijn Janssen & Emma L. Slade & Nripendra P. Rana & Vishanth Weerakkody & Jeremy Millard & Jan Hidders & Dhoya Snijders, 2017. "Driving innovation through big open linked data (BOLD): Exploring antecedents using interpretive structural modelling," Information Systems Frontiers, Springer, vol. 19(2), pages 197-212, April.
    2. Rajiv Nag & Donald C. Hambrick & Ming‐Jer Chen, 2007. "What is strategic management, really? Inductive derivation of a consensus definition of the field," Strategic Management Journal, Wiley Blackwell, vol. 28(9), pages 935-955, September.
    3. Nicky J. Welton & Howard H. Z. Thom, 2015. "Value of Information," Medical Decision Making, , vol. 35(5), pages 564-566, July.
    4. Soe-Tsyr Daphne Yuan & Shiou-Tian Hsu, 2017. "Enhancing service system design: An entity interaction pattern approach," Information Systems Frontiers, Springer, vol. 19(3), pages 481-507, June.
    5. Shih-Chia Huang & Suzanne McIntosh & Stanislav Sobolevsky & Patrick C. K. Hung, 2017. "Big Data Analytics and Business Intelligence in Industry," Information Systems Frontiers, Springer, vol. 19(6), pages 1229-1232, December.
    6. Soe-Tsyr Daphne Yuan & Shiou-Tian Hsu, 0. "Enhancing service system design: An entity interaction pattern approach," Information Systems Frontiers, Springer, vol. 0, pages 1-27.
    7. Yogesh K. Dwivedi & Marijn Janssen & Emma L. Slade & Nripendra P. Rana & Vishanth Weerakkody & Jeremy Millard & Jan Hidders & Dhoya Snijders, 0. "Driving innovation through big open linked data (BOLD): Exploring antecedents using interpretive structural modelling," Information Systems Frontiers, Springer, vol. 0, pages 1-16.
    8. Jörg Becker & Patrick Delfmann & Hanns-Alexander Dietrich & Matthias Steinhorst & Mathias Eggert, 2016. "Business process compliance checking – applying and evaluating a generic pattern matching approach for conceptual models in the financial sector," Information Systems Frontiers, Springer, vol. 18(2), pages 359-405, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Stefan Stieglitz & Christian Meske & Björn Ross & Milad Mirbabaie, 2020. "Going Back in Time to Predict the Future - The Complex Role of the Data Collection Period in Social Media Analytics," Information Systems Frontiers, Springer, vol. 22(2), pages 395-409, April.
    2. Qi Liu & Gengzhong Feng & Giri Kumar Tayi & Jun Tian, 2021. "Managing Data Quality of the Data Warehouse: A Chance-Constrained Programming Approach," Information Systems Frontiers, Springer, vol. 23(2), pages 375-389, April.
    3. Roman Lukyanenko & Andrea Wiggins & Holly K. Rosser, 0. "Citizen Science: An Information Quality Research Frontier," Information Systems Frontiers, Springer, vol. 0, pages 1-23.
    4. Amit V. Deokar & Jie Tao, 0. "OrgMiner: A Framework for Discovering User-Related Process Intelligence from Event Logs," Information Systems Frontiers, Springer, vol. 0, pages 1-20.
    5. Yogesh K. Dwivedi & Gerald Kelly & Marijn Janssen & Nripendra P. Rana & Emma L. Slade & Marc Clement, 2018. "Social Media: The Good, the Bad, and the Ugly," Information Systems Frontiers, Springer, vol. 20(3), pages 419-423, June.
    6. Amit V. Deokar & Jie Tao, 2021. "OrgMiner: A Framework for Discovering User-Related Process Intelligence from Event Logs," Information Systems Frontiers, Springer, vol. 23(3), pages 753-772, June.
    7. Roman Lukyanenko & Andrea Wiggins & Holly K. Rosser, 2020. "Citizen Science: An Information Quality Research Frontier," Information Systems Frontiers, Springer, vol. 22(4), pages 961-983, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. de Camargo Fiorini, Paula & Roman Pais Seles, Bruno Michel & Chiappetta Jabbour, Charbel Jose & Barberio Mariano, Enzo & de Sousa Jabbour, Ana Beatriz Lopes, 2018. "Management theory and big data literature: From a review to a research agenda," International Journal of Information Management, Elsevier, vol. 43(C), pages 112-129.
    2. Federico Cabitza & Angela Locoro & Carlo Batini, 2020. "Making Open Data more Personal Through a Social Value Perspective: a Methodological Approach," Information Systems Frontiers, Springer, vol. 22(1), pages 131-148, February.
    3. Gupta, Shivam & Kar, Arpan Kumar & Baabdullah, Abdullah & Al-Khowaiter, Wassan A.A., 2018. "Big data with cognitive computing: A review for the future," International Journal of Information Management, Elsevier, vol. 42(C), pages 78-89.
    4. Maciej Grzenda & Jaroslaw Legierski, 2021. "Towards Increased Understanding of Open Data Use for Software Development," Information Systems Frontiers, Springer, vol. 23(2), pages 495-513, April.
    5. Morteza Ghobakhloo & Mohammad Iranmanesh & Andrius Grybauskas & Mantas Vilkas & Monika Petraitė, 2021. "Industry 4.0, innovation, and sustainable development: A systematic review and a roadmap to sustainable innovation," Business Strategy and the Environment, Wiley Blackwell, vol. 30(8), pages 4237-4257, December.
    6. Shivam Gupta & Vinayak A. Drave & Surajit Bag & Zongwei Luo, 2019. "Leveraging Smart Supply Chain and Information System Agility for Supply Chain Flexibility," Information Systems Frontiers, Springer, vol. 21(3), pages 547-564, June.
    7. Marijn Janssen & David Konopnicki & Jane L. Snowdon & Adegboyega Ojo, 2017. "Driving public sector innovation using big and open linked data (BOLD)," Information Systems Frontiers, Springer, vol. 19(2), pages 189-195, April.
    8. Marijn Janssen & David Konopnicki & Jane L. Snowdon & Adegboyega Ojo, 0. "Driving public sector innovation using big and open linked data (BOLD)," Information Systems Frontiers, Springer, vol. 0, pages 1-7.
    9. Lnenicka, Martin & Komarkova, Jitka, 2019. "Developing a government enterprise architecture framework to support the requirements of big and open linked data with the use of cloud computing," International Journal of Information Management, Elsevier, vol. 46(C), pages 124-141.
    10. Neeraj Yadav & Ravi Shankar & Surya Prakash Singh, 2021. "Hierarchy of Critical Success Factors (CSF) for Lean Six Sigma (LSS) in Quality 4.0," International Journal of Global Business and Competitiveness, Springer, vol. 16(1), pages 1-14, June.
    11. Abdullah Mohammed Baabdullah, 2020. "Factors Influencing Adoption of Mobile Social Network Games (M-SNGs): The Role of Awareness," Information Systems Frontiers, Springer, vol. 22(2), pages 411-427, April.
    12. Arenas, Alvaro E. & Goh, Jie Mein & Urueña, Alberto, 2019. "How does IT affect design centricity approaches: Evidence from Spain’s smart tourism ecosystem," International Journal of Information Management, Elsevier, vol. 45(C), pages 149-162.
    13. Joash Mageto, 2021. "Big Data Analytics in Sustainable Supply Chain Management: A Focus on Manufacturing Supply Chains," Sustainability, MDPI, vol. 13(13), pages 1-22, June.
    14. Rana, Nripendra P. & Barnard, Daniel J. & Baabdullah, Abdullah M.A. & Rees, Daniel & Roderick, Sian, 2019. "Exploring barriers of m-commerce adoption in SMEs in the UK: Developing a framework using ISM," International Journal of Information Management, Elsevier, vol. 44(C), pages 141-153.
    15. Olivia Benfeldt & John Stouby Persson & Sabine Madsen, 2020. "Data Governance as a Collective Action Problem," Information Systems Frontiers, Springer, vol. 22(2), pages 299-313, April.
    16. Olivia Benfeldt & John Stouby Persson & Sabine Madsen, 0. "Data Governance as a Collective Action Problem," Information Systems Frontiers, Springer, vol. 0, pages 1-15.
    17. Nripendra P. Rana & Sunil Luthra & Sachin Kumar Mangla & Rubina Islam & Sian Roderick & Yogesh K. Dwivedi, 2019. "Barriers to the Development of Smart Cities in Indian Context," Information Systems Frontiers, Springer, vol. 21(3), pages 503-525, June.
    18. Calvard, Thomas Stephen & Jeske, Debora, 2018. "Developing human resource data risk management in the age of big data," International Journal of Information Management, Elsevier, vol. 43(C), pages 159-164.
    19. Sushil,, 2018. "Incorporating polarity of relationships in ISM and TISM for theory building in information and organization management," International Journal of Information Management, Elsevier, vol. 43(C), pages 38-51.
    20. Paul Brous & Marijn Janssen, 2020. "Trusted Decision-Making: Data Governance for Creating Trust in Data Science Decision Outcomes," Administrative Sciences, MDPI, vol. 10(4), pages 1-19, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:20:y:2018:i:3:d:10.1007_s10796-017-9822-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.