IDEAS home Printed from https://ideas.repec.org/a/wsi/ijitdm/v16y2017i06ns0219622017500341.html
   My bibliography  Save this article

Mining Weighted Frequent Itemsets without Candidate Generation in Uncertain Databases

Author

Listed:
  • Jerry Chun-Wei Lin

    (School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, P. R. China)

  • Wensheng Gan

    (School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, P. R. China)

  • Philippe Fournier-Viger

    (#x2020;School of Natural Sciences and Humanities, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, P. R. China)

  • Tzung-Pei Hong

    (#x2021;Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan§Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan)

  • Han-Chieh Chao

    (#xB6;Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan)

Abstract

Frequent itemset mining (FIM) is a fundamental set of techniques used to discover useful and meaningful relationships between items in transaction databases. In recent decades, extensions of FIM such as weighted frequent itemset mining (WFIM) and frequent itemset mining in uncertain databases (UFIM) have been proposed. WFIM considers that items may have different weight/importance. It can thus discover itemsets that are more useful and meaningful by ignoring irrelevant itemsets with lower weights. UFIM takes into account that data collected in a real-life environment may often be inaccurate, imprecise, or incomplete. Recently, these two ideas have been combined in the HEWI-Uapriori algorithm. This latter considers both item weights and transaction uncertainty to mine the high expected weighted itemsets (HEWIs) using a two-phase Apriori-based approach. Although the upper-bound proposed in HEWI-Uapriori can reduce the size of the search space, it still generates a large amount of candidates and uses a level-wise search. In this paper, a more efficient algorithm named HEWI-Utree is developed to efficiently mine HEWIs without performing multiple database scans and without generating candidates. This algorithm relies on three novel structures named element (E)-table, weighted-probability (WP)-table and WP-tree to maintain the information required for identifying and pruning unpromising itemsets early. Experimental results show that the proposed algorithm is generally much more efficient than traditional methods for WFIM and UFIM, as well as the state-of-the-art HEWI-Uapriori algorithm, in terms of runtime, memory consumption, and scalability.

Suggested Citation

  • Jerry Chun-Wei Lin & Wensheng Gan & Philippe Fournier-Viger & Tzung-Pei Hong & Han-Chieh Chao, 2017. "Mining Weighted Frequent Itemsets without Candidate Generation in Uncertain Databases," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(06), pages 1549-1579, November.
  • Handle: RePEc:wsi:ijitdm:v:16:y:2017:i:06:n:s0219622017500341
    DOI: 10.1142/S0219622017500341
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219622017500341
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219622017500341?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gang Kou & Yanqun Lu & Yi Peng & Yong Shi, 2012. "Evaluation Of Classification Algorithms Using Mcdm And Rank Correlation," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 11(01), pages 197-225.
    2. Binbin Zhang & Jerry Chun-Wei Lin & Philippe Fournier-Viger & Ting Li, 2017. "Mining of high utility-probability sequential patterns from uncertain databases," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-21, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fábio T. F. Silva & Alexandre Szklo & Amanda Vinhoza & Ana Célia Nogueira & André F. P. Lucena & Antônio Marcos Mendonça & Camilla Marcolino & Felipe Nunes & Francielle M. Carvalho & Isabela Tagomori , 2022. "Inter-sectoral prioritization of climate technologies: insights from a Technology Needs Assessment for mitigation in Brazil," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 27(7), pages 1-39, October.
    2. Abiodun Ogunyemi & Kevin Johnston, 2017. "Is Server Virtualization Implementation in Business and Public Organizations a Worthwhile Investment?," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(03), pages 711-736, May.
    3. Wenyi Zeng & Deqing Li & Peizhuang Wang, 2016. "Variable Weight Decision Making and Balance Function Analysis Based on Factor Space," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(05), pages 999-1014, September.
    4. Ying Li & Yung-Ho Chiu & Tai-Yu Lin & Tzu-Han Chang, 2020. "Pre-Evaluating the Technical Efficiency Gains from Potential Mergers and Acquisitions in the IC Design Industry," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 19(02), pages 525-559, April.
    5. Asongu, Simplice A. & Nnanna, Joseph & Acha-Anyi, Paul N., 2020. "Finance, inequality and inclusive education in Sub-Saharan Africa," Economic Analysis and Policy, Elsevier, vol. 67(C), pages 162-177.
    6. Xunjie Gou & Zeshui Xu & Huchang Liao, 2019. "Hesitant Fuzzy Linguistic Possibility Degree-Based Linear Assignment Method for Multiple Criteria Decision-Making," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(01), pages 35-63, January.
    7. Simplice A. Asongu & Joseph Nnanna, 2020. "Governance and the Capital Flight Trap in Africa," Working Papers of the African Governance and Development Institute. 20/024, African Governance and Development Institute..
    8. Zheng Yuan & Baohua Wen & Cheng He & Jin Zhou & Zhonghua Zhou & Feng Xu, 2022. "Application of Multi-Criteria Decision-Making Analysis to Rural Spatial Sustainability Evaluation: A Systematic Review," IJERPH, MDPI, vol. 19(11), pages 1-31, May.
    9. Chimere O. Iheonu & Simplice A. Asongu & Kingsley O. Odo & Patrick K. Ojiem, 2020. "Financial Sector Development and Investment in Selected ECOWAS Countries: Empirical Evidence using Heterogeneous Panel Data Method," Working Papers of the African Governance and Development Institute. 20/045, African Governance and Development Institute..
    10. Jozef Kapusta & Michal Munk & Martin Drlik, 2018. "Website Structure Improvement Based on the Combination of Selected Web Structure and Web Usage Mining Methods," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(06), pages 1743-1776, November.
    11. Viral Gupta & P. K. Kapur & Deepak Kumar, 2019. "Prioritizing and Optimizing Disaster Recovery Solution using Analytic Network Process and Multi Attribute Utility Theory," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(01), pages 171-207, January.
    12. Chun-Hao Chen & Tzung-Pei Hong & Yeong-Chyi Lee & Vincent S. Tseng, 2015. "Finding Active Membership Functions for Genetic-Fuzzy Data Mining," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 14(06), pages 1215-1242, November.
    13. Mahmut Baydaş & Orhan Emre Elma & Željko Stević, 2024. "Proposal of an innovative MCDA evaluation methodology: knowledge discovery through rank reversal, standard deviation, and relationship with stock return," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-35, December.
    14. Jing Wang & Jian-Qiang Wang & Hong-Yu Zhang & Xiao-Hong Chen, 2017. "Distance-Based Multi-Criteria Group Decision-Making Approaches with Multi-Hesitant Fuzzy Linguistic Information," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(04), pages 1069-1099, July.
    15. Asongu, Simplice A. & Nnanna, Joseph & Acha-Anyi, Paul N., 2020. "Inequality and gender economic inclusion: The moderating role of financial access in Sub-Saharan Africa," Economic Analysis and Policy, Elsevier, vol. 65(C), pages 173-185.
    16. Qian Qian & Yang Yang & Zong-Fang Zhou, 2019. "Research on Trade Credit Spreading and Credit Risk within the Supply Chain," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(01), pages 389-411, January.
    17. Simplice A. Asongu & Nicholas M. Odhiambo, 2019. "Enhancing ICT for insurance in Africa," Review of Development Finance Journal, Chartered Institute of Development Finance, vol. 9(2), pages 16-27.
    18. Burcu Yılmaz Kaya & Aylin Adem & Metin Dağdeviren, 2020. "A DSS-Based Novel Approach Proposition Employing Decision Techniques for System Design," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 19(02), pages 413-445, March.
    19. Lei Wang & Qing Liu & Tongle Yin, 2018. "Decision-making of investment in navigation safety improving schemes with application of cumulative prospect theory," Journal of Risk and Reliability, , vol. 232(6), pages 710-724, December.
    20. Gia Sirbiladze & Irina Khutsishvili & Otar Badagadze & Mikheil Kapanadze, 2016. "More Precise Decision-Making Methodology in the Temporalized Body of Evidence. Application in the Information Technology Management," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 15(06), pages 1469-1502, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:16:y:2017:i:06:n:s0219622017500341. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.