Skip to main navigation Skip to search Skip to main content

A Parallel Incremental Frequent Itemsets Mining IFIN+: Improvement and Extensive Evaluation

Research output: Chapter in Book/Report/Conference proceedingConference proceedings

Abstract

In this paper, we propose a shared-memory parallelization solution for the Frequent Itemsets Mining algorithm IFIN, called IFIN+. The motivation for our work is that commodity processors, nowadays, are enhanced with many physical computational units, and exploiting full advantage of this is a potential solution to improve computational performance in single-machine environments. The portions in the serial version are improved in means which increases efficiency and computational independence for convenience in designing parallel computation with Work-Pool model, be known as a good model for load balance. We conducted extensive experiments on both synthetic and real datasets to evaluate IFIN+ against its serial version IFIN, the well-known algorithm FP-Growth and other two state-of-the-art ones, FIN and PrePost+. The experimental results show that the running time of IFIN+ is the most efficient, especially in the case of mining at different support thresholds within the same running session. Compare to its serial version, IFIN+ performance is improved significantly.
Original languageEnglish
Title of host publicationTransactions on Large-Scale Data- and Knowledge-Centered Systems XLI. Special Issue on Data and Security Engineering
EditorsTran Khanh Dang, Roland Wagner, Abdelkader Hameurlain
PublisherSpringer
Pages78-106
Number of pages29
Volume11390
ISBN (Print)9783662588079
DOIs
Publication statusPublished - Feb 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11390 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fields of science

  • 202007 Computer integrated manufacturing (CIM)
  • 102001 Artificial intelligence
  • 102006 Computer supported cooperative work (CSCW)
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102016 IT security
  • 102019 Machine learning
  • 102022 Software development
  • 102025 Distributed systems
  • 102028 Knowledge engineering
  • 102033 Data mining
  • 102035 Data science
  • 502007 E-commerce
  • 505002 Data protection
  • 506002 E-government
  • 509018 Knowledge management

JKU Focus areas

  • Digital Transformation

Cite this