IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v8y2017i2d10.1007_s13198-016-0508-1.html
   My bibliography  Save this article

Empirical analysis of metrics for object oriented multidimensional model of data warehouse using unsupervised machine learning techniques

Author

Listed:
  • Sangeeta Sabharwal

    (NSIT)

  • Sushama Nagpal

    (NSIT)

  • Gargi Aggarwal

    (NSIT)

Abstract

Data Warehouse provides the foundation for businesses to take informed decisions for day to day operations and making future strategy. Since the role is so pivotal to the growth and success of the business, its quality is very critical. Conceptual models of data warehouses give us a great insight into the quality of the developed system during the early stages of the design process. Researchers have proposed a number of metrics to evaluate the quality of these object oriented multidimensional models. Further, for these metrics to be used in practice, empirical evaluation is crucial. There are a number of propositions in literature that work towards empirical validation of metrics. But most of them are either restricted to statistical techniques or supervised machine learning techniques. In order to empirically validate the metrics, we need to get user responses for a number of schemas and take down observations to quantify model quality aspects like understandability, efficiency etc. This can result in personal biases, errors and random outliers which impacts the evaluation model. In this paper, we have made a first attempt to assess the relationship between the object oriented multidimensional data warehouse structural metrics and understandability of its models by using unsupervised machine learning techniques with the aid of a data warehouse quality expert. The results indicate that the proposed metrics have a strong relationship with understandability and inturn quality of the data warehouse conceptual models and the unsupervised techniques are able to identify this relationship with high degree of accuracy.

Suggested Citation

  • Sangeeta Sabharwal & Sushama Nagpal & Gargi Aggarwal, 2017. "Empirical analysis of metrics for object oriented multidimensional model of data warehouse using unsupervised machine learning techniques," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 8(2), pages 703-715, November.
  • Handle: RePEc:spr:ijsaem:v:8:y:2017:i:2:d:10.1007_s13198-016-0508-1
    DOI: 10.1007/s13198-016-0508-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-016-0508-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-016-0508-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Manuel Serrano & Coral Calero & Mario Piattini, 2005. "An Experimental Replication With Data Warehouse Metrics," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 1(4), pages 1-21, October.
    2. Anjana Gosain & Sangeeta Sabharwal & Sushama Nagpal, 2012. "Predicting quality of data warehouse using fuzzy logic," International Journal of Business and Systems Research, Inderscience Enterprises Ltd, vol. 6(3), pages 255-268.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Anjana Gosain & Jaspreeti Singh, 2017. "Quality metrics emphasizing dimension hierarchy sharing in multidimensional models for data warehouse: a theoretical and empirical evaluation," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 8(2), pages 1672-1688, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:8:y:2017:i:2:d:10.1007_s13198-016-0508-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.