IDEAS home Printed from https://ideas.repec.org/h/zbw/hiclch/267187.html
   My bibliography  Save this book chapter

Preliminary analysis on data quality for ML applications

In: Changing Tides: The New Role of Resilience and Sustainability in Logistics and Supply Chain Management – Innovative Approaches for the Shift to a New Era. Proceedings of the Hamburg International Conference of Logistics (HICL), Vol. 33

Author

Listed:
  • Kiebler, Lorenz
  • Moroff, Nikolas Ulrich
  • Jacobsen, Jens Jakob

Abstract

Purpose: This publication investigates preliminary data quality analyses to estimate the efforts and expected results of the use of data sets for ML solutions already in the data understanding phase of an implementation. Knowledge about the necessary data cleaning efforts and result qualities allows potentials to be estimated early in the process. Methodology: Through a literature research, characteristics of a time series as well as methods of data cleaning are analysed. Based on the results, a test environment is implemented in Python, enabling the evaluation of individual methods using sample data sets from the process industry and comparing them with different error analyses. Findings: The publication describes a detailed overview of data cleaning procedures and addresses a first Indication of a connection between the final achievable forecast quality and the degree of error of the original data set. Insights into the influence of the choice of preprocessing method on the achievable quality of the AI-based forecast can be concluded. Originality: Within the publication, the link between data characteristics in time series and preprocessing methods is established to draw conclusions in advance about the quality improvement to be expected from selected data cleaning methods and to provide decision support for the selection of the method.

Suggested Citation

  • Kiebler, Lorenz & Moroff, Nikolas Ulrich & Jacobsen, Jens Jakob, 2022. "Preliminary analysis on data quality for ML applications," Chapters from the Proceedings of the Hamburg International Conference of Logistics (HICL), in: Kersten, Wolfgang & Jahn, Carlos & Blecker, Thorsten & Ringle, Christian M. (ed.), Changing Tides: The New Role of Resilience and Sustainability in Logistics and Supply Chain Management – Innovative Approaches for the Shift to a New , volume 33, pages 207-236, Hamburg University of Technology (TUHH), Institute of Business Logistics and General Management.
  • Handle: RePEc:zbw:hiclch:267187
    DOI: 10.15480/882.4693
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/267187/1/hicl-2021-33-207.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.15480/882.4693?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bole, Uroš & Popovič, Aleš & Žabkar, Jure & Papa, Gregor & Jaklič, Jurij, 2015. "A case analysis of embryonic data mining success," International Journal of Information Management, Elsevier, vol. 35(2), pages 253-259.
    2. D. Laurie Hughes & Nripendra P. Rana & Yogesh K. Dwivedi, 2020. "Elucidation of IS project success factors: an interpretive structural modelling approach," Annals of Operations Research, Springer, vol. 285(1), pages 35-66, February.
    3. Ralf T. Kreutzer & Marie Sirrenberg, 2019. "Künstliche Intelligenz verstehen," Springer Books, Springer, number 978-3-658-25561-9, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rajiv Kumar Sharma, 2022. "Examining interaction among supplier selection strategies in an outsourcing environment using ISM and fuzzy logic approach," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(5), pages 2175-2194, October.
    2. Roll, Oliver & Loh, Patrick, 2020. "Der Einfluss der Digitalisierung auf das Preismanagement – Ansatzpunkte, Modelle und Methoden," Die Unternehmung - Swiss Journal of Business Research and Practice, Nomos Verlagsgesellschaft mbH & Co. KG, vol. 74(4), pages 334-348.
    3. Garg, Aashish & Goel, Pooja & Sharma, Anuj & Rana, Nripendra P., 2022. "As you sow, so shall you reap: Assessing drivers of socially responsible investment attitude and intention," Technological Forecasting and Social Change, Elsevier, vol. 184(C).
    4. Elvira Ismagilova & Laurie Hughes & Nripendra P. Rana & Yogesh K. Dwivedi, 2022. "Security, Privacy and Risks Within Smart Cities: Literature Review and Development of a Smart City Interaction Framework," Information Systems Frontiers, Springer, vol. 24(2), pages 393-414, April.
    5. M. Suguna & Bhavin Shah & S. Karthik Raj & M. Suresh, 2022. "A study on the influential factors of the last mile delivery projects during Covid-19 era," Operations Management Research, Springer, vol. 15(1), pages 399-412, June.
    6. Mahmud A. Shareef & Yogesh K. Dwivedi & Vinod Kumar & D. Laurie Hughes & Ramakrishnan Raman, 2022. "Sustainable supply chain for disaster management: structural dynamics and disruptive risks," Annals of Operations Research, Springer, vol. 319(1), pages 1451-1475, December.
    7. Sorooshian, Shahryar & Tavana, Madjid & Ribeiro-Navarrete, Samuel, 2023. "From classical interpretive structural modeling to total interpretive structural modeling and beyond: A half-century of business research," Journal of Business Research, Elsevier, vol. 157(C).
    8. A. M. Alamdari & Y. Jabarzadeh & B. Adams & D. Samson & S. Khanmohammadi, 2023. "An analytic network process model to prioritize supply chain risks in green residential megaprojects," Operations Management Research, Springer, vol. 16(1), pages 141-163, March.
    9. Cruz-Jesus, Frederico & Figueira-Alves, Hugo & Tam, Carlos & Pinto, Diego Costa & Oliveira, Tiago & Venkatesh, Viswanath, 2023. "Pragmatic and idealistic reasons: What drives electric vehicle drivers' satisfaction and continuance intention?," Transportation Research Part A: Policy and Practice, Elsevier, vol. 170(C).
    10. Yigang Jiang & Guanxin Yao & Jing Xu & Yue Tian, 2021. "Study in Driving Strategy and Analysis of Sustainable and Symbiosis Development Relationship between Agricultural Industrial Clusters and Agricultural Logistics Industry," Sustainability, MDPI, vol. 13(24), pages 1-21, December.

    More about this item

    Keywords

    Artificial Intelligence; Blockchain;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:hiclch:267187. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://hicl.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.