IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v15y2013i3d10.1007_s10796-012-9405-6.html
   My bibliography  Save this article

Active XML-based Web data integration

Author

Listed:
  • Rashed Salem

    (Université de Lyon (ERIC Lyon 2))

  • Omar Boussaïd

    (Université de Lyon (ERIC Lyon 2))

  • Jérôme Darmont

    (Université de Lyon (ERIC Lyon 2))

Abstract

Today, the Web is the largest source of information worldwide. There is currently a strong trend for decision-making applications such as Data Warehousing (DW) and Business Intelligence (BI) to move onto the Web, especially in the cloud. Integrating data into DW/BI applications is a critical and time-consuming task. To make better decisions in DW/BI applications, next generation data integration poses new requirements to data integration systems, over those posed by traditional data integration. In this paper, we propose a generic, metadata-based, service-oriented, and event-driven approach for integrating Web data timely and autonomously. Beside handling data heterogeneity, distribution and interoperability, our approach satisfies near real-time requirements and realize active data integration. For this sake, we design and develop a framework that utilizes Web standards (e.g., XML and Web services) for tackling data heterogeneity, distribution and interoperability issues. Moreover, our framework utilizes Active XML (AXML) to warehouse passive data as well as services to integrate active and dynamic data on-the-fly. AXML embedded services and changes detection services ensure near real-time data integration. Furthermore, the idea of integrating Web data actively and autonomously revolves around mining events logged by the data integration environment. Therefore, we propose an incremental XML-based algorithm for mining association rules from logged events. Then, we define active rules dynamically upon mined data to automate and reactivate integration tasks. Finally, as a proof of concept, we implement a framework prototype as a Web application using open-source tools.

Suggested Citation

  • Rashed Salem & Omar Boussaïd & Jérôme Darmont, 2013. "Active XML-based Web data integration," Information Systems Frontiers, Springer, vol. 15(3), pages 371-398, July.
  • Handle: RePEc:spr:infosf:v:15:y:2013:i:3:d:10.1007_s10796-012-9405-6
    DOI: 10.1007/s10796-012-9405-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-012-9405-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-012-9405-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. M. Asif Naeem & Gillian Dobbie & Gerald Weber, 2011. "HYBRIDJOIN for Near-Real-Time Data Warehousing," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 7(4), pages 21-42, October.
    2. Vicky Nassis & Rajugan Rajagopalapillai & Tharam S. Dillon & Wenny Rahayu, 2005. "Conceptual and Systematic Design Approach for XML Document Warehouses," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 1(3), pages 63-87, July.
    3. Laura Irina Rusu & J. Wenny Rahayu & David Taniar, 2005. "A Methodology for Building XML Data Warehouses," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 1(2), pages 23-48, April.
    4. Benedikt Martens & Frank Teuteberg, 2012. "Decision-making in cloud computing environments: A cost and risk based approach," Information Systems Frontiers, Springer, vol. 14(4), pages 871-893, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Malu Castellanos & Florian Daniel & Irene Garrigós & Jose-Norberto Mazón, 2013. "Business Intelligence and the Web," Information Systems Frontiers, Springer, vol. 15(3), pages 307-309, July.
    2. Jakub Malý & Martin Nečaský, 2015. "Model-driven approach to modeling and validating integrity constraints for XML with OCL and Schematron," Information Systems Frontiers, Springer, vol. 17(4), pages 917-946, August.
    3. Chichang Jou, 2019. "Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules," Information Systems Frontiers, Springer, vol. 21(1), pages 163-174, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mingwen Yang & Varghese S. Jacob & Srinivasan Raghunathan, 2021. "Cloud Service Model’s Role in Provider and User Security Investment Incentives," Production and Operations Management, Production and Operations Management Society, vol. 30(2), pages 419-437, February.
    2. John Oredo & Denis Dennehy, 2023. "Exploring the Role of Organizational Mindfulness on Cloud Computing and Firm Performance: The Case of Kenyan Organizations," Information Systems Frontiers, Springer, vol. 25(5), pages 2029-2050, October.
    3. Jewan Singh & Vibhakar Mansotra, 2019. "Towards Development of an Integrated Cloud-Computing Adoption Framework — A Case of Indian School Education System," International Journal of Innovation and Technology Management (IJITM), World Scientific Publishing Co. Pte. Ltd., vol. 16(02), pages 1-27, April.
    4. M. Asif Naeem, 2019. "Optimization and Extension of Stream-Relation Joins," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(04), pages 1289-1315, July.
    5. Shui-Lien Chen & June-Hong Chen & Shiou Chi Chang, 2017. "Understanding the Antecedents of Individuals Intention of Using Cloud Services," Journal of Economics and Management, College of Business, Feng Chia University, Taiwan, vol. 13(2), pages 139-166, August.
    6. Shui-Lien Chen & June-Hong Chen & Yung Hsin Lee, 2018. "A Comparison of Competing Models for Understanding Industrial Organization’s Acceptance of Cloud Services," Sustainability, MDPI, vol. 10(3), pages 1-20, March.
    7. Jason J. Jung & Yue-Shan Chang & Ying Liu & Chao-Chin Wu, 2012. "Advances in intelligent grid and cloud computing," Information Systems Frontiers, Springer, vol. 14(4), pages 823-825, September.
    8. Ping Wang & Kuo-Ming Chao & Chi-Chun Lo, 2015. "Satisfaction-based Web service discovery and selection scheme utilizing vague sets theory," Information Systems Frontiers, Springer, vol. 17(4), pages 827-844, August.
    9. Wafa Bouaynaya, 2020. "Characterization of Cloud Computing Reversibility as Explored by the DELPHI Method," Information Systems Frontiers, Springer, vol. 22(6), pages 1505-1518, December.
    10. Xiaolong Zheng & Daniel Zeng & Fei-Yue Wang, 2015. "Social balance in signed networks," Information Systems Frontiers, Springer, vol. 17(5), pages 1077-1095, October.
    11. Shuai Yuan & Sanjukta Das & Ram Ramesh & Chunming Qiao, 2023. "Availability-Aware Virtual Resource Provisioning for Infrastructure Service Agreements in the Cloud," Information Systems Frontiers, Springer, vol. 25(4), pages 1495-1512, August.
    12. Haoyi Xiong & Daqing Zhang & Daqiang Zhang & Vincent Gauthier & Kun Yang & Monique Becker, 2014. "MPaaS: Mobility prediction as a service in telecom cloud," Information Systems Frontiers, Springer, vol. 16(1), pages 59-75, March.
    13. Nguyen Hoang Thuan & Pedro Antunes & David Johnstone, 2016. "Factors influencing the decision to crowdsource: A systematic literature review," Information Systems Frontiers, Springer, vol. 18(1), pages 47-68, February.
    14. Mitra, Amit & O'Regan, Nicholas & Sarpong, David, 2018. "Cloud resource adaptation: A resource based perspective on value creation for corporate growth," Technological Forecasting and Social Change, Elsevier, vol. 130(C), pages 28-38.
    15. Masky Mackita & Soo-Young Shin & Tae-Young Choe, 2019. "ERMOCTAVE: A Risk Management Framework for IT Systems Which Adopt Cloud Computing," Future Internet, MDPI, vol. 11(9), pages 1-21, September.
    16. Venkataraghavan Krishnaswamy & R. P. Sundarraj, 2017. "Organizational implications of a comprehensive approach for cloud-storage sourcing," Information Systems Frontiers, Springer, vol. 19(1), pages 57-73, February.
    17. Chen, Li-Ming & Chang, Wei-Lun, 2020. "Under what conditions can an application service firm with in-house computing benefit from cloudbursting?," European Journal of Operational Research, Elsevier, vol. 282(1), pages 71-80.
    18. Jeng-Chieh Cheng & Jeen-Fong Li & Chi-Yo Huang, 2023. "Enablers for Adopting Restriction of Hazardous Substances Directives by Electronic Manufacturing Service Providers," Sustainability, MDPI, vol. 15(16), pages 1-45, August.
    19. Adele Caldarelli & Luca Ferri & Marco Maffei, 2016. "I rischi derivanti dall?implementazione del cloud computing: un?indagine empirica nelle PMI Italiane," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2016(3), pages 27-48.
    20. Chulhwan Chris Bang, 2015. "Information systems frontiers: Keyword analysis and classification," Information Systems Frontiers, Springer, vol. 17(1), pages 217-237, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:15:y:2013:i:3:d:10.1007_s10796-012-9405-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.