IDEAS home Printed from https://ideas.repec.org/a/wsi/ijitdm/v16y2017i06ns0219622017500341.html
   My bibliography  Save this article

Mining Weighted Frequent Itemsets without Candidate Generation in Uncertain Databases

Author

Listed:
  • Jerry Chun-Wei Lin

    (School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, P. R. China)

  • Wensheng Gan

    (School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, P. R. China)

  • Philippe Fournier-Viger

    (#x2020;School of Natural Sciences and Humanities, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, P. R. China)

  • Tzung-Pei Hong

    (#x2021;Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan§Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan)

  • Han-Chieh Chao

    (#xB6;Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan)

Abstract

Frequent itemset mining (FIM) is a fundamental set of techniques used to discover useful and meaningful relationships between items in transaction databases. In recent decades, extensions of FIM such as weighted frequent itemset mining (WFIM) and frequent itemset mining in uncertain databases (UFIM) have been proposed. WFIM considers that items may have different weight/importance. It can thus discover itemsets that are more useful and meaningful by ignoring irrelevant itemsets with lower weights. UFIM takes into account that data collected in a real-life environment may often be inaccurate, imprecise, or incomplete. Recently, these two ideas have been combined in the HEWI-Uapriori algorithm. This latter considers both item weights and transaction uncertainty to mine the high expected weighted itemsets (HEWIs) using a two-phase Apriori-based approach. Although the upper-bound proposed in HEWI-Uapriori can reduce the size of the search space, it still generates a large amount of candidates and uses a level-wise search. In this paper, a more efficient algorithm named HEWI-Utree is developed to efficiently mine HEWIs without performing multiple database scans and without generating candidates. This algorithm relies on three novel structures named element (E)-table, weighted-probability (WP)-table and WP-tree to maintain the information required for identifying and pruning unpromising itemsets early. Experimental results show that the proposed algorithm is generally much more efficient than traditional methods for WFIM and UFIM, as well as the state-of-the-art HEWI-Uapriori algorithm, in terms of runtime, memory consumption, and scalability.

Suggested Citation

  • Jerry Chun-Wei Lin & Wensheng Gan & Philippe Fournier-Viger & Tzung-Pei Hong & Han-Chieh Chao, 2017. "Mining Weighted Frequent Itemsets without Candidate Generation in Uncertain Databases," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(06), pages 1549-1579, November.
  • Handle: RePEc:wsi:ijitdm:v:16:y:2017:i:06:n:s0219622017500341
    DOI: 10.1142/S0219622017500341
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219622017500341
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219622017500341?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gang Kou & Yanqun Lu & Yi Peng & Yong Shi, 2012. "Evaluation Of Classification Algorithms Using Mcdm And Rank Correlation," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 11(01), pages 197-225.
    2. Binbin Zhang & Jerry Chun-Wei Lin & Philippe Fournier-Viger & Ting Li, 2017. "Mining of high utility-probability sequential patterns from uncertain databases," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-21, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fábio T. F. Silva & Alexandre Szklo & Amanda Vinhoza & Ana Célia Nogueira & André F. P. Lucena & Antônio Marcos Mendonça & Camilla Marcolino & Felipe Nunes & Francielle M. Carvalho & Isabela Tagomori , 2022. "Inter-sectoral prioritization of climate technologies: insights from a Technology Needs Assessment for mitigation in Brazil," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 27(7), pages 1-39, October.
    2. Asongu, Simplice A. & Odhiambo, Nicholas M., 2021. "Inequality, finance and renewable energy consumption in Sub-Saharan Africa," Renewable Energy, Elsevier, vol. 165(P1), pages 678-688.
    3. Abiodun Ogunyemi & Kevin Johnston, 2017. "Is Server Virtualization Implementation in Business and Public Organizations a Worthwhile Investment?," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(03), pages 711-736, May.
    4. Wenyi Zeng & Deqing Li & Peizhuang Wang, 2016. "Variable Weight Decision Making and Balance Function Analysis Based on Factor Space," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(05), pages 999-1014, September.
    5. Kun Chen & Gang Kou & J. Michael Tarn & Yan Song, 2015. "Bridging the gap between missing and inconsistent values in eliciting preference from pairwise comparison matrices," Annals of Operations Research, Springer, vol. 235(1), pages 155-175, December.
    6. Kuang-Hua Hu & Wei Jianguo & Gwo-Hshiung Tzeng, 2017. "Risk Factor Assessment Improvement for China’s Cloud Computing Auditing Using a New Hybrid MADM Model," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(03), pages 737-777, May.
    7. Simplice A. Asongu & Nicholas M.Odhiambo, "undated". "Governance and Renewable Energy Consumption in sub-Saharan Africa," Working Papers AESRIWP11, African Economic and Social Research Institute (AESRI).
    8. Ying Li & Yung-Ho Chiu & Tai-Yu Lin & Tzu-Han Chang, 2020. "Pre-Evaluating the Technical Efficiency Gains from Potential Mergers and Acquisitions in the IC Design Industry," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 19(02), pages 525-559, April.
    9. Asongu, Simplice A. & Nnanna, Joseph & Acha-Anyi, Paul N., 2020. "Finance, inequality and inclusive education in Sub-Saharan Africa," Economic Analysis and Policy, Elsevier, vol. 67(C), pages 162-177.
    10. Mahmood Safaei & Elankovan A. Sundararajan & Shahla Asadi & Mehrbakhsh Nilashi & Mohd Juzaiddin Ab Aziz & M. S. Saravanan & Maha Abdelhaq & Raed Alsaqour, 2022. "A Hybrid MCDM Approach Based on Fuzzy-Logic and DEMATEL to Evaluate Adult Obesity," IJERPH, MDPI, vol. 19(23), pages 1-21, November.
    11. Ernesto Tarantino & Ivanoe De Falco & Umberto Scafuri, 2019. "A mobile personalized tourist guide and its user evaluation," Information Technology & Tourism, Springer, vol. 21(3), pages 413-455, September.
    12. Eleonora Bottani & Piera Centobelli & Teresa Murino & Ehsan Shekarian, 2018. "A QFD-ANP Method for Supplier Selection with Benefits, Opportunities, Costs and Risks Considerations," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(03), pages 911-939, May.
    13. Xunjie Gou & Zeshui Xu & Huchang Liao, 2019. "Hesitant Fuzzy Linguistic Possibility Degree-Based Linear Assignment Method for Multiple Criteria Decision-Making," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(01), pages 35-63, January.
    14. Simplice A. Asongu & Joseph Nnanna, 2020. "Governance and the Capital Flight Trap in Africa," Working Papers of the African Governance and Development Institute. 20/024, African Governance and Development Institute..
    15. Zheng Yuan & Baohua Wen & Cheng He & Jin Zhou & Zhonghua Zhou & Feng Xu, 2022. "Application of Multi-Criteria Decision-Making Analysis to Rural Spatial Sustainability Evaluation: A Systematic Review," IJERPH, MDPI, vol. 19(11), pages 1-31, May.
    16. Iheonu, Chimere & Asongu, Simplice & Odo, Kingsley & Ojiem, Patrick, 2020. "Financial Sector Development and Investment in Selected ECOWAS Countries: Empirical Evidence using Heterogeneous Panel Data Method," MPRA Paper 107102, University Library of Munich, Germany.
    17. Jozef Kapusta & Michal Munk & Martin Drlik, 2018. "Website Structure Improvement Based on the Combination of Selected Web Structure and Web Usage Mining Methods," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(06), pages 1743-1776, November.
    18. Viral Gupta & P. K. Kapur & Deepak Kumar, 2019. "Prioritizing and Optimizing Disaster Recovery Solution using Analytic Network Process and Multi Attribute Utility Theory," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(01), pages 171-207, January.
    19. Chun-Hao Chen & Tzung-Pei Hong & Yeong-Chyi Lee & Vincent S. Tseng, 2015. "Finding Active Membership Functions for Genetic-Fuzzy Data Mining," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 14(06), pages 1215-1242, November.
    20. Josep Domenech & Raul Peña-Ortiz & Jose A. Gil & Ana Pont, 2016. "A Methodology for Economic Evaluation of Cloud-Based Web Applications," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(06), pages 1555-1578, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:16:y:2017:i:06:n:s0219622017500341. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.