A Parallel Incremental Frequent Itemsets Mining IFIN+: Improvement and Extensive Evaluation

Research output: Chapter in Book/Report/Conference proceedingConference proceedings

Abstract

In this paper, we propose a shared-memory parallelization solution for the Frequent Itemsets Mining algorithm IFIN, called IFIN+. The motivation for our work is that commodity processors, nowadays, are enhanced with many physical computational units, and exploiting full advantage of this is a potential solution to improve computational performance in single-machine environments. The portions in the serial version are improved in means which increases efficiency and computational independence for convenience in designing parallel computation with Work-Pool model, be known as a good model for load balance. We conducted extensive experiments on both synthetic and real datasets to evaluate IFIN+ against its serial version IFIN, the well-known algorithm FP-Growth and other two state-of-the-art ones, FIN and PrePost+. The experimental results show that the running time of IFIN+ is the most efficient, especially in the case of mining at different support thresholds within the same running session. Compare to its serial version, IFIN+ performance is improved significantly.
Original languageEnglish
Title of host publicationTransactions on Large-Scale Data- and Knowledge-Centered Systems XLI. Special Issue on Data and Security Engineering
EditorsTran Khanh Dang, Roland Wagner, Abdelkader Hameurlain
PublisherSpringer
Pages78-106
Number of pages28
Volume11390
DOIs
Publication statusPublished - Feb 2019

Publication series

NameLecture Notes in Computer Science (LNCS)

Fields of science

  • 202007 Computer integrated manufacturing (CIM)
  • 102001 Artificial intelligence
  • 102006 Computer supported cooperative work (CSCW)
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102016 IT security
  • 102019 Machine learning
  • 102022 Software development
  • 102025 Distributed systems
  • 102028 Knowledge engineering
  • 102033 Data mining
  • 102035 Data science
  • 502007 E-commerce
  • 505002 Data protection
  • 506002 E-government
  • 509018 Knowledge management

JKU Focus areas

  • Digital Transformation

Cite this